Hasan's Post

Tutorial repository

View on GitHub
30 June 2021

Introduction_ml_in_production_week2

by Hasan

Modelling overview

Key challenges

AI systems = Code + Data

Model development is a iterative process

#### Challenges in model development

Why low average error isn’t good enough

1. Performence on disproportionality examples

  1. Naviagational queries

2. Performence on key slices of the dataset

3. Rare classes.

Establish a baseline

Unstructured data

Strucuted data

Ways to establish a baseline

Tips for getting started

Iteration proces

After doing above process continuously we can reach to our goal

Deployment constraints when picking a model

Yes. if Baseline is already established and you are sure this project will work.

No. If purpose is to establish a baseline and determine what is possible and might be worth pursuing. If the open source implementation is so complex that you will never implement that.

Sanity check for code and algorithm

Error Analysis and performance auditing

Error analysis is also an iterative process

Useful metrics for each tag

Data iteration

Data Augmentation

Check list

Can more data hurt ?

Strurctue data ?

Adding Feature.

Shifting is going on like below

(similarity) (Content description of the restaurant) cold start feature

Data iteration.

Experiment tracking

What to track

Tracking tools

From Big data to good data

Good data

3rd week class notes

tags: