Skip to main content
Engineering, Data / ML

Model Excellence Scores: A Framework for Enhancing the Quality of Machine Learning Systems at Scale

21 March / Global
Featured image for Model Excellence Scores: A Framework for Enhancing the Quality of Machine Learning Systems at Scale
Image
Figure 1: Example ML quality dimensions (in yellow) in a typical ML system.
Image
Figure 2: Relationship among agreement, indicator, objective, use cases, and models.
Image
Figure 3: High-level view of the interaction between the MES framework and various ML systems.
IndicatorsDescriptionPossible ActionsMetric Normalization
Data QualityMeasures the quality of the input datasets used to train the model. This is a compost score for:
– Feature null
– Cross-region consistency
– Missing Partiitions
– Duplicates
– Backfill the missing partitions
– Sync the data partitions across different regions and instances
– De-duplicate the rows in the data
Each component in the composite score is normalized to the percentage scale
Dataset FreshnessMeasures the freshness of the input datasets used to train the model– Retrain with fresh input datasets
– Backfill input datasets if updated data is available
Scale-consistent
Feature and Concept DriftShift in the target and covariate distribution as well as the relationship between the two over time for a model in production– Apply weighted training or retrain the model with fresh data
– Validate the correctness of upstream feature ETL pipelines
Normalized to [0,1] by using normalized distance metric and importance weights
Model InterpretabilityMeasures the presence and confidence of robust feature explanations for each prediction generated by the model– Enable explanationsNormalized to [0,1]
Prediciton AccuracyPrediction accuracy of the model on production traffic (e.g., AUC, normalized RMSE)– Update training datasets to account for train-serve skew
– Check for feature or concept drift
Normalized to [0,1] by normalizing the accuracy metric
Table: Sample of indicators.
Min Cai

Min Cai

Min Cai is a Distinguished Engineer at Uber working on the AI/ML platform, Michelangelo. He received his Ph.D. degree in Computer Science from Univ. of Southern California. He has published over 20 journal and conference papers and holds 6 US patents.

Joseph Wang

Joseph Wang

Joseph Wang serves as a Principal Software Engineer on the AI Platform team at Uber, based in San Francisco. His notable achievements encompass designing the Feature Store, expanding the capacity of the real-time prediction service, developing a robust model platform, and improving the performance of key models. Presently, Wang is focusing his expertise on advancing the domain of generative AI.

Anupriya Mouleesha

Anupriya Mouleesha

Anupriya has been a thought partner and executor for multiple engineering initiatives within Uber spanning software networking, Data, ML platform, UberAI and product development. She currently leads Technical Program Management for Uber's Mobility and Product Platforms.

Sally Mihyoung Lee

Sally Mihyoung Lee

Sally is a Senior Staff Tech Lead Manager who leads ML Quality and ML Ops for the Uber AI/ML platform, Michelangelo. With over 15 years of experience, she is passionate about applying machine learning solutions to large scale business problems.

Posted by Min Cai, Joseph Wang, Anupriya Mouleesha, Sally Mihyoung Lee