How to Use
PreppyData allows you to preprocess your data easily and efficiently. Follow the detailed instructions below to get started.
Step 1: Upload Your Data
Prepare your dataset in CSV or XLSX format.
Access PreppyData - Note: The official domain address is not yet available. Please use the application directly. - Open the PreppyData application on your computer. - Click on the “Upload” button. - Select your CSV or XLSX file to upload.
Step 2: Select Preprocessing Options
Customize your data preprocessing by choosing from our list of options:
2.1 Data Encoding
Transform categorical variables into numerical format.
Options: - One-Hot Encoding: Creates binary columns for each category. - Label Encoding: Assigns a unique integer to each category.
How to Select: - Navigate to the “Data Encoding” section. - Choose One-Hot Encoding or Label Encoding by checking the corresponding box.
2.2 Outlier Detection
Identify and handle outliers in your data.
Options: - Z-Score: Detects outliers based on standard deviations from the mean. - Interquartile Range (IQR): Uses quartiles to identify outliers. - Local Outlier Factor (LOF): Detects anomalies based on local density.
How to Select: - Go to the “Outlier Detection” section. - Select the desired method(s) by checking the appropriate boxes.
2.3 Missing Value Handling
Address missing data in your dataset.
Options: - Mean Imputation: Replaces missing values with the mean. - Median Imputation: Uses the median to fill missing values. - Deletion: Removes rows or columns with missing values.
How to Select: - Access the “Missing Value Handling” section. - Choose your preferred method.
Note: If you do not select any options, default methods will be applied automatically.
Step 3: Apply Preprocessing
After selecting your options, click on the “Apply” button.
The application will process your data based on your selections.
A progress indicator will show the processing status.
Step 4: Download Processed Data
Once preprocessing is complete, you can review a summary of the changes.
Click on the “Download” button.
Save your processed data in your preferred format (CSV or XLSX).