Instructions
Find a dataset in an area that you are interested in. For the purpose of this exercise, avoid text heavy datasets and image datasets. Make sure there are at least 4 features in the data.
See Finding DatasetsFinding Datasets
Kaggle
Kaggle is a Community for Machine Learning and Data Science. There are many datasets available for free. It leans towards AI/ML applications but the data can just as well be used for Data V... for a few different places where you can find open datasets.
Spend some time exploring this dataset in a tabular format. Here are some things you could do:
- Sort Rows and Columns
- Keyword Searches
- Check for Missing or Inconsistent Data
- Calculate Simple Aggregations - Sum/Count/Mean/Frequency
- Look for Patterns and Trends
- Look for Outliers Manually - are there unusually high or low
- Investigate Relationships
Based on what you found, what are some hypotheses you have about your data? Come up with 3 statements that you think you could make based on your observations.
Submission format
Link to dataset
- Statement 1
- Statement 2
- Statement 3