Depression Project

Web application combining data analysis, machine learning, advanced statistics and visualization to identify depression-related risk factors.

Project overview

A data analysis and machine learning project focused on studying sociodemographic and behavioral variables related to depression. The app combines CRUD data management, modeling, analytical visualization, and semantic search.

Current state

Web application developed with data management and analytical interface.
Classification, regression, and clustering models implemented.
Analytical visualizations and statistical exploration integrated.
Embedding-based semantic search added.

Tech stack

PythonFlaskPandasScikit-learnMongoDBHTML/CSS

Tags & Code

Data ScienceMachine LearningFlaskMongoDB

Private code (academic project)

Vision

Build a unified platform to explore and analyze depression-related data.
Highlight relationships between behavioral, sociodemographic, and risk variables.
Combine statistical exploration, machine learning models, and visualization in a usable web interface.

Architecture

Flask web application with CRUD management of sociodemographic data.
Data processing and storage pipelines using MongoDB Atlas.
Analytical modules: classification, regression, clustering, and dimensionality reduction (PCA).
Result visualization: analytical charts, confusion matrices, statistical exploration.
Embedding-based semantic search to enrich data exploration.

Roadmap

Phase 1: structuring and managing sociodemographic data.
Phase 2: statistical analysis, correlations, and PCA exploration.
Phase 3: machine learning implementation and result visualization.
Phase 4: semantic search integration and analytical interface improvement.

Engineering decisions

Flask for a lightweight architecture suited to a web analytical project.
MongoDB Atlas for storage flexibility and easy pipeline integration.
Clear separation between data management, statistical analysis, modeling, and visualization.
Semantic search to go beyond a simple data viewing interface.

Possible improvements

Improve the UI to make exploration more intuitive.
Strengthen comparison between machine learning models.
Further document pipelines and analytical assumptions.

Lessons learned

A relevant data project requires as much rigor in data management as in modeling.
Visualization plays a key role in making results interpretable.
An analytical web interface makes a scientific project far more usable.

Screenshots

Depression project preview

← All projects Contact me