Task-1

Clean and prepare a raw dataset (with nulls, duplicates, inconsistent formats). 1) Duplicates Removed: Checked and dropped any repeated rows. Dataset kept at 8,807 unique entries. 2) Nulls (Missing Data) Fixed Title: Any row without a title was dropped (essential field). Country: Filled missing with “Unknown”. Director: Filled missing with “Unknown”. Cast: Filled missing with “Not Provided”. Date Added: Filled missing with “Unknown”, then tried converting to proper date format. Rating: Filled missing with “Unrated”. Duration: Filled missing with “Unknown”. 3) Standardized Formats: Converted date_added into proper datetime format where possible. Normalized text fields: country → Title Case (e.g., united states → United States). type → Title Case (movie → Movie). rating → Uppercase (pg → PG).