NEW
Proxify is bringing transparency to tech team performance based on research conducted at Stanford. An industry first, built for engineering leaders.
Learn more
Dimitrios M.
Data Engineer & Scientist
Dimitrios is a Senior Data Engineer and Data Scientist with a PhD in Mathematical Neuroscience from The Rockefeller University.
He specializes in neural networks, data modeling, and dynamical systems, with extensive expertise in ETL development, clinical data modeling, and large-scale data processing across healthcare, life sciences, and AI sectors.
He has delivered complex data solutions for leading organizations, including SAP, EPAM (Odysseus Inc.), and Femtec Health, building scalable data pipelines on Databricks, AWS, and Snowflake, and implementing predictive models to improve healthcare outcomes.
With a unique blend of scientific rigor and engineering precision, Dimitrios translates advanced theory into production-ready, intelligent data systems.
Main expertise
- Python 8 years

- SQL 8 years
- Data Science 8 years
Other skills
- R (programming language) 4 years

- Machine Learning 4 years

- Matlab 3 years

Selected experience
Employment
Senior Data Engineer / ETL Developer
Odysseus Inc (part of EPAM Systems) - 2 years 11 months
Odysseus Inc, a subsidiary of EPAM Systems, provides data engineering and analytics solutions for the healthcare and pharmaceutical industries, focusing on OMOP Common Data Model (CDM) integrations.
- Led the development and optimization of ETL pipelines that transformed healthcare data into the OMOP Common Data Model (CDM) for multiple global clients.
- Designed and implemented data architecture solutions to enhance scalability and maintainability across distributed environments.
- Collaborated with analysts and data scientists to ensure data quality, validation, and standardization for clinical research use cases.
- Improved existing Python ETL orchestration libraries, contributing to performance and reliability enhancements.
- Worked with Databricks, Snowflake, Spark, Hadoop, AWS, and Azure Data Factory to build automated data ingestion pipelines.
- Supported deployment processes, version control, and CI/CD workflows using Bitbucket and containerized environments.
Technologies:
- Technologies:
AWS
Databricks
Python
SQL
Azure Data Factory
Bitbucket
- Data Analytics
Snowflake
ETL
Hadoop
Senior Data Scientist
Femtec Health - 2 years 6 months
Femtec Health is a US-based digital healthcare platform specializing in personalized women’s health solutions using AI, predictive analytics, and clinical data modeling.
- Established and led Femtec’s data science team of four, defining technical direction and mentoring data scientists.
- Engineered large-scale ETL processes to integrate and harmonize healthcare claims data from multiple US sources into the OMOP CDM.
- Designed and implemented predictive models for early detection of gynecological conditions using Spark, Redshift, and AWS EMR.
- Applied OHDSI frameworks for model training and external validation across datasets with diverse medical coding standards.
- Developed machine learning algorithms (Random Forest, Lasso, Ridge, Gradient Boosting, Naïve Bayes, kNN) and unsupervised models for clustering microbiome data.
- Built interactive dashboards and visualizations using Seaborn, Plotly, and Cufflinks for executive reporting.
Technologies:
- Technologies:
Docker
AWS
Python
SQL
R (programming language)
Scikit-learn
- Data Analytics
ETL
Machine Learning
Plotly
Redshift
PySpark
Seaborn
AWS EMR
Senior Data Scientist
Vivante Health (Cylinder) - 8 months
Vivante Health is a digital health startup specializing in the management of chronic gastrointestinal diseases through the use of AI and predictive analytics.
- Developed predictive and classification models for gastrointestinal conditions using claims and wearable data.
- Built and optimized data pipelines in GCP and BigQuery, ensuring secure and efficient ingestion of medical records.
- Collaborated on feature engineering, model validation, and deployed Google Cloud AI- models into production systems.
- Contributed to the creation of automated clinical data pipelines using Python, SQL, and Apache Spark.
Technologies:
- Technologies:
Apache Spark
Python
SQL
Google Cloud
Pandas
BigQuery
- Data Analytics
Machine Learning
Research Assistant
The Rockefeller University - 6 years 6 months
The Rockefeller University, located in New York, is a premier biomedical research institution dedicated to advancing neuroscience, biology, and physics.
- Conducted PhD research in Mathematical Neuroscience, modeling neural network dynamics using nonlinear systems and stochastic processes.
- Proposed a mathematical framework for adaptive, self-reconfigurable neural dynamics, which was published in Journal of Statistical Physics (2017).
- Developed neural network models that simulated input-dependent computations in the visual cortex and nonlinear wave propagation in the cochlea.
- Analyzed large-scale ECoG and multi-electrode array recordings, applying advanced statistical and computational modeling techniques.
- Authored multiple publications in peer-reviewed journals and contributed to theoretical neuroscience research communities.
Technologies:
- Technologies:
Python
TensorFlow
Keras
Matlab
- Neural Network
Education
Doctor Of PhilosophyMathematical Neuroscience
The Rockefeller University · 2014 - 2019
MSc.Mathematics
Brown University · 2010 - 2012
BSc.Mathematics
National and Kapodistrian University of Athens · 2004 - 2008
Find your next developer within days, not months
In a short 25-minute call, we would like to:
- Understand your development needs
- Explain our process to match you with qualified, vetted developers from our network
- You are presented the right candidates 2 days in average after we talk
