Data Analysis
Descriptive Statistics
This endpoint allows users to upload a file and receive descriptive statistics of the dataset, including numerical and categorical statistics, and optionally generate graphs like histograms, boxplots, or bar charts.
Endpoint: POST /descriptive-statistics
Request Parameters
File Upload
file
(required): The file to be processed. Supported formats include CSV, Excel, etc.
Columns Selection
columns
(optional): Comma-separated list of columns to include in the analysis. If omitted, all columns are included.
Exclude Columns
exclude_columns
(optional): Comma-separated list of columns to exclude from the analysis.
Numerical Statistics
num_stats
(optional): Comma-separated list of numerical statistics to compute. Default values are["mean", "median", "std", "min", "max", "percentiles", "skew", "kurtosis"]
. Available options:mean
,median
,std
,min
,max
,percentiles
,skew
,kurtosis
.
Categorical Statistics
cat_stats
(optional): Comma-separated list of categorical statistics to compute. Default values are["count", "unique", "top", "frequency", "mode"]
. Available options:count
,unique
,top
,frequency
,mode
.
Graphs
include_graphs
(optional): Boolean to specify if graphs should be included in the response. Default isfalse
.graph_types
(optional): Comma-separated list of graph types to include in the analysis. Default is["Histogram"]
. Available options:Histogram
,Boxplot
,Bar
.
Example Request
Analysis Components
- Numerical Statistics: Computes statistical values for numerical columns. This includes mean, median, standard deviation, minimum, maximum, percentiles, skewness, and kurtosis.
- Categorical Statistics: Provides statistics for categorical columns, such as count, unique values, mode, and frequency distribution.
- Graphs: Generates optional visualizations (e.g., histograms, boxplots, or bar charts) for numerical or categorical data.