#High Cost
- Obtaining data
- Labeling data
- 30 minutes per record.
#Bad Quality
- Raw data quality
- Labeling quality
Some common problems with raw data are:
- noise
- bias
- low predictive power
- outdated examples → Concept Drift
- outliers
- leakage
Some common problems with raw data are: