API Reference

1. Data Cleaning Endpoint

Description Cleans the dataset according to user-defined options, including handling missing values, detecting outliers, and removing duplicates.

Endpoint POST /cleaning

Request Format

Content-Type: application/json Authorization: Bearer <your-api-token>

1{
2   "dataset_url": "https://example.com/mydata.csv",
3   "cleaning_options": {
4      "handle_missing": "mean",
5      "detect_outliers": true,
6      "remove_duplicates": true
7   }
8}

Response Format

1{
2   "status": "success",
3   "cleaned_dataset_url": "https://preppydata.com/cleaned/mydata_cleaned.csv",
4   "summary": {
5      "rows_removed": 5,
6      "missing_values_handled": true,
7      "outliers_detected": 12
8   }
9}
1{
2   "status": "error",
3   "message": "Dataset URL is invalid or inaccessible."
4}

Usage Example

 1curl -X POST https://api.preppydata.com/v1/cleaning \
 2-H "Content-Type: application/json" \
 3-H "Authorization: Bearer YOUR_API_TOKEN" \
 4-d '{
 5   "dataset_url": "https://example.com/mydata.csv",
 6   "cleaning_options": {
 7      "handle_missing": "mean",
 8      "detect_outliers": true,
 9      "remove_duplicates": true
10   }
11}'

2. Feature Selection Endpoint

Description Selects the most important features from the dataset using a user-defined algorithm.

Endpoint POST /feature-selection

Request Format

Content-Type: application/json Authorization: Bearer <your-api-token>

1{
2   "dataset_url": "https://example.com/mydata.csv",
3   "selection_method": "random_forest",
4   "num_features": 5
5}

Response Format

1{
2   "status": "success",
3   "selected_features": ["feature_1", "feature_3", "feature_5", "feature_7", "feature_9"],
4   "dataset_url_with_selected_features": "https://preppydata.com/processed/mydata_selected.csv"
5}
1{
2   "status": "error",
3   "message": "Selection method is invalid or unsupported."
4}

Usage Example

1curl -X POST https://api.preppydata.com/v1/feature-selection \
2-H "Content-Type: application/json" \
3-H "Authorization: Bearer YOUR_API_TOKEN" \
4-d '{
5   "dataset_url": "https://example.com/mydata.csv",
6   "selection_method": "random_forest",
7   "num_features": 5
8}'

3. Encoding Endpoint

Description Converts categorical data into One-hot, Label, or Target encoding.

Endpoint POST /encoding

Request Format

Content-Type: application/json Authorization: Bearer <your-api-token>

1{
2   "dataset_url": "https://example.com/mydata.csv",
3   "encoding_type": "one_hot",
4   "columns_to_encode": ["column1", "column2"]
5}

Response Format

1{
2   "status": "success",
3   "encoded_dataset_url": "https://preppydata.com/processed/mydata_encoded.csv"
4}
1{
2   "status": "error",
3   "message": "Column names are invalid or missing in the dataset."
4}

Usage Example

1curl -X POST https://api.preppydata.com/v1/encoding \
2-H "Content-Type: application/json" \
3-H "Authorization: Bearer YOUR_API_TOKEN" \
4-d '{
5   "dataset_url": "https://example.com/mydata.csv",
6   "encoding_type": "one_hot",
7   "columns_to_encode": ["column1", "column2"]
8}'

Error Codes

Code

Message

Description

400

Bad Request

The request parameters are invalid or missing.

401

Unauthorized

The API token is invalid.

500

Internal Server Error

An unexpected error occurred on the server.