Dimitrios M.

Data Engineer & Scientist

Dimitrios is a Senior Data Engineer and Data Scientist with a PhD in Mathematical Neuroscience from The Rockefeller University.

He specializes in neural networks, data modeling, and dynamical systems, with extensive expertise in ETL development, clinical data modeling, and large-scale data processing across healthcare, life sciences, and AI sectors.

He has delivered complex data solutions for leading organizations, including SAP, EPAM (Odysseus Inc.), and Femtec Health, building scalable data pipelines on Databricks, AWS, and Snowflake, and implementing predictive models to improve healthcare outcomes.

With a unique blend of scientific rigor and engineering precision, Dimitrios translates advanced theory into production-ready, intelligent data systems.

Main expertise

  • Python
    Python 8 years
  • SQL
    SQL 8 years
  • Data Science 8 years

Other skills

  • R (programming language)
    R (programming language) 4 years
  • Machine Learning
    Machine Learning 4 years
  • Matlab
    Matlab 3 years
Dimitrios

Dimitrios M.

Greece

Get started

Selected experience

Employment

  • Senior Data Engineer / ETL Developer

    Odysseus Inc (part of EPAM Systems) - 2 years 11 months

    Odysseus Inc, a subsidiary of EPAM Systems, provides data engineering and analytics solutions for the healthcare and pharmaceutical industries, focusing on OMOP Common Data Model (CDM) integrations.

    • Led the development and optimization of ETL pipelines that transformed healthcare data into the OMOP Common Data Model (CDM) for multiple global clients.
    • Designed and implemented data architecture solutions to enhance scalability and maintainability across distributed environments.
    • Collaborated with analysts and data scientists to ensure data quality, validation, and standardization for clinical research use cases.
    • Improved existing Python ETL orchestration libraries, contributing to performance and reliability enhancements.
    • Worked with Databricks, Snowflake, Spark, Hadoop, AWS, and Azure Data Factory to build automated data ingestion pipelines.
    • Supported deployment processes, version control, and CI/CD workflows using Bitbucket and containerized environments.

    Technologies:

    • Technologies:
    • AWS AWS
    • Databricks Databricks
    • Python Python
    • SQL SQL
    • Azure Data Factory Azure Data Factory
    • Bitbucket Bitbucket
    • Data Analytics
    • Snowflake Snowflake
    • ETL ETL
    • Hadoop Hadoop
  • Senior Data Scientist

    Femtec Health - 2 years 6 months

    Femtec Health is a US-based digital healthcare platform specializing in personalized women’s health solutions using AI, predictive analytics, and clinical data modeling.

    • Established and led Femtec’s data science team of four, defining technical direction and mentoring data scientists.
    • Engineered large-scale ETL processes to integrate and harmonize healthcare claims data from multiple US sources into the OMOP CDM.
    • Designed and implemented predictive models for early detection of gynecological conditions using Spark, Redshift, and AWS EMR.
    • Applied OHDSI frameworks for model training and external validation across datasets with diverse medical coding standards.
    • Developed machine learning algorithms (Random Forest, Lasso, Ridge, Gradient Boosting, Naïve Bayes, kNN) and unsupervised models for clustering microbiome data.
    • Built interactive dashboards and visualizations using Seaborn, Plotly, and Cufflinks for executive reporting.

    Technologies:

    • Technologies:
    • Docker Docker
    • AWS AWS
    • Python Python
    • SQL SQL
    • R (programming language) R (programming language)
    • Scikit-learn Scikit-learn
    • Data Analytics
    • ETL ETL
    • Machine Learning Machine Learning
    • Plotly Plotly
    • Redshift Redshift
    • PySpark PySpark
    • Seaborn Seaborn
    • AWS EMR AWS EMR
  • Senior Data Scientist

    Vivante Health (Cylinder) - 8 months

    Vivante Health is a digital health startup specializing in the management of chronic gastrointestinal diseases through the use of AI and predictive analytics.

    • Developed predictive and classification models for gastrointestinal conditions using claims and wearable data.
    • Built and optimized data pipelines in GCP and BigQuery, ensuring secure and efficient ingestion of medical records.
    • Collaborated on feature engineering, model validation, and deployed Google Cloud AI- models into production systems.
    • Contributed to the creation of automated clinical data pipelines using Python, SQL, and Apache Spark.

    Technologies:

    • Technologies:
    • Apache Spark Apache Spark
    • Python Python
    • SQL SQL
    • Google Cloud Google Cloud
    • Pandas Pandas
    • BigQuery BigQuery
    • Data Analytics
    • Machine Learning Machine Learning
  • Research Assistant

    The Rockefeller University - 6 years 6 months

    The Rockefeller University, located in New York, is a premier biomedical research institution dedicated to advancing neuroscience, biology, and physics.

    • Conducted PhD research in Mathematical Neuroscience, modeling neural network dynamics using nonlinear systems and stochastic processes.
    • Proposed a mathematical framework for adaptive, self-reconfigurable neural dynamics, which was published in Journal of Statistical Physics (2017).
    • Developed neural network models that simulated input-dependent computations in the visual cortex and nonlinear wave propagation in the cochlea.
    • Analyzed large-scale ECoG and multi-electrode array recordings, applying advanced statistical and computational modeling techniques.
    • Authored multiple publications in peer-reviewed journals and contributed to theoretical neuroscience research communities.

    Technologies:

    • Technologies:
    • Python Python
    • TensorFlow TensorFlow
    • Keras Keras
    • Matlab Matlab
    • Neural Network

Education

  • Doctor Of PhilosophyMathematical Neuroscience

    The Rockefeller University · 2014 - 2019

  • MSc.Mathematics

    Brown University · 2010 - 2012

  • BSc.Mathematics

    National and Kapodistrian University of Athens · 2004 - 2008

Find your next developer within days, not months

In a short 25-minute call, we would like to:

  • Understand your development needs
  • Explain our process to match you with qualified, vetted developers from our network
  • You are presented the right candidates 2 days in average after we talk

Not sure where to start? Let’s have a chat