HT
Projects/Depression Project
Completed2025Design, web development, data analysis, and modeling

Depression Project

Web application combining data analysis, machine learning, advanced statistics and visualization to identify depression-related risk factors.

Depression Project

Project overview

A data analysis and machine learning project focused on studying sociodemographic and behavioral variables related to depression. The app combines CRUD data management, modeling, analytical visualization, and semantic search.

Current state

  • Web application developed with data management and analytical interface.
  • Classification, regression, and clustering models implemented.
  • Analytical visualizations and statistical exploration integrated.
  • Embedding-based semantic search added.

Tech stack

PythonFlaskPandasScikit-learnMongoDBHTML/CSS

Tags & Code

Data ScienceMachine LearningFlaskMongoDB

Private code (academic project)

Vision

  • Build a unified platform to explore and analyze depression-related data.
  • Highlight relationships between behavioral, sociodemographic, and risk variables.
  • Combine statistical exploration, machine learning models, and visualization in a usable web interface.

Architecture

  • Flask web application with CRUD management of sociodemographic data.
  • Data processing and storage pipelines using MongoDB Atlas.
  • Analytical modules: classification, regression, clustering, and dimensionality reduction (PCA).
  • Result visualization: analytical charts, confusion matrices, statistical exploration.
  • Embedding-based semantic search to enrich data exploration.

Roadmap

  • Phase 1: structuring and managing sociodemographic data.
  • Phase 2: statistical analysis, correlations, and PCA exploration.
  • Phase 3: machine learning implementation and result visualization.
  • Phase 4: semantic search integration and analytical interface improvement.

Engineering decisions

  • Flask for a lightweight architecture suited to a web analytical project.
  • MongoDB Atlas for storage flexibility and easy pipeline integration.
  • Clear separation between data management, statistical analysis, modeling, and visualization.
  • Semantic search to go beyond a simple data viewing interface.

Possible improvements

  • Improve the UI to make exploration more intuitive.
  • Strengthen comparison between machine learning models.
  • Further document pipelines and analytical assumptions.

Lessons learned

  • A relevant data project requires as much rigor in data management as in modeling.
  • Visualization plays a key role in making results interpretable.
  • An analytical web interface makes a scientific project far more usable.

Screenshots

Depression project preview

Depression project preview