Data Analysis
NLP Analysis
This endpoint provides natural language processing (NLP) analysis on the uploaded text data, covering named entity recognition, text similarity, language detection, text classification, sentiment analysis, keyword extraction, topic modeling, and text summarization.
Endpoint: POST /nlp-analysis
Request Parameters
File Upload
file
(required): The file containing the text data to be analyzed. Supported formats include CSV.text_column
(required): The name of the column containing the text data.analysis_type
(optional): The specific type of analysis to perform (ner
,similarity
,language
,classification
,sentiment
,keywords
,topics
,summary
). If omitted, all analyses will be performed.num_keywords
(optional, default=5): The number of keywords to extract during keyword extraction.num_topics
(optional, default=3): The number of topics for topic modeling.summary_length
(optional, default=3): The number of sentences in the text summary.
Example Request
Analysis Components
- Named Entity Recognition (NER): Identifies entities like organizations, people, locations, dates, and products.
- Text Similarity: Calculates cosine similarity between text samples.
- Language Detection: Detects the language of the text data.
- Text Classification: Classifies text as positive, negative, or neutral based on sentiment scores.
- Sentiment Analysis: Uses VADER to analyze sentiment scores (positive, negative, neutral, compound).
- Keyword Extraction: Extracts keywords based on term frequency.
- Topic Modeling (LDA): Discovers topics and related keywords from the text.
- Text Summarization: Generates a brief extractive summary of the text data.