Data Cleaning

Data cleaning involves handling missing values, duplicates, and inconsistent data. Key methods: dropna(), fillna(), drop_duplicates(), replace().

import pandas as pd
import numpy as np

df = pd.DataFrame({"a": [1, np.nan, 3], "b": [4, 5, np.nan]})
print(df.fillna(0))

Practice Problems (0/2 solved)

Easy

Count the number of missing values in each column and print the result.

Loading Python...

Loading Python runtime (Pyodide)...

Medium

Fill missing age values with the mean age and print the result.

Loading Python...

Loading Python runtime (Pyodide)...