Data Analysis
General Analysis
This endpoint allows users to upload a file and receive a comprehensive analysis of the dataset, covering data structure, quality, statistical analysis, correlations, and distributions.
Endpoint: POST /general-analysis
Request Parameters
File Upload
file
(required): The file to be processed. Supported formats include CSV.
Example Request
Analysis Components
- Data Structure Analysis: Provides an overview of the dataset, including the number of columns, rows, shape, column names, data types, memory usage, and duplicate records.
- Data Quality Analysis: Evaluates the dataset for missing values, unique value counts, constant columns, and highly imbalanced columns.
- Statistical Analysis: Computes key statistics for numerical columns, including mean, median, standard deviation, minimum, and maximum values. Also calculates skewness and kurtosis for assessing distribution.
- Correlation Analysis: Identifies correlations between numerical features and flags highly correlated features (based on a customizable threshold, default 0.9).
- Distribution Analysis: Analyzes the distribution of numerical and categorical features, including histograms, categorical frequency distributions, and outlier detection using the IQR method.