Felipe A.

Data Scientist

Felipe is a highly skilled Data Scientist with over seven years of experience across fintech, proptech, edtech, and consultancy. He combines strong technical expertise in machine learning with the ability to effectively communicate complex concepts to stakeholders.

His technical proficiency includes working with advanced Data Science and ML tools such as Snowflake, dbt, Airflow, and MLflow. A career highlight was his role at Cambridge University, where he developed and taught an advanced online data science course, showcasing both his subject-matter expertise and ability to simplify complex topics. Additionally, at Outra, he played a key role in securing a multi-million-dollar contract with Zoopla.

Felipe’s unique blend of deep technical knowledge and strong communication skills positions him as a standout professional in the field of Data Science.

Main expertise

  • Pytest
    Pytest 2 years
  • AWS
    AWS 3 years
  • Bash
    Bash 4 years

Other skills

  • Agile
    Agile 4 years
  • PyTorch
    PyTorch 2 years
  • Asana
    Asana 1 years
Felipe

Felipe A.

United Kingdom

Get started

Selected experience

Employment

  • Lead Data Scientist

    Rylee - 10 months

    • Worked at Rylee, an e-commerce platform that provided customers with product insights and market analysis to improve strategies on Bol.com and Amazon.
    • Earned Databricks certification in Generative AI, demonstrating expertise with RAG and Agent models.
    • Developed a large-scale sales forecasting model using Rylee’s internal data and scraped data from Bol.com to predict product sales and identify bestsellers.
    • Built an API with Flask and AWS Lambda to handle product queries, delivering insights and sales forecasts.
    • Designed and implemented ETL pipelines using dbt, Airflow, and Spark to automate feature engineering, including an asynchronous solution for efficient data retrieval from the Bol.com API while respecting rate limits and parallelizing at scale; also automated data reconciliation across multiple sellers.
    • Applied PyTorch and Spark to optimize high-performance machine learning models.

    Technologies:

    • Technologies:
    • AWS AWS
    • ChatGPT API ChatGPT API
    • Data Science
    • ETL ETL
    • NumPy NumPy
    • Pandas Pandas
    • Python Python
    • SQL SQL
    • XGBoost XGBoost
    • TensorFlow TensorFlow
    • Scikit-learn Scikit-learn
    • Git Git
    • Machine Learning Machine Learning
    • Apache Spark Apache Spark
  • Lead Data Scientist

    Homemove - 3 months

    • Worked at Homemove, a comprehensive platform offering moving-related services, including surveys, removals, and mortgages, integrated within a single app.
    • Developed an LLM-powered negotiation tool that allowed users to obtain quotes and negotiate prices via an AI chatbot, automatically added customers to the CRM, and alerted the sales team upon successful negotiation, using OpenAI Assistant and GPT models.
    • Led a scalable data transformation initiative, leveraging Snowflake for cloud data warehousing and Sigma for BI and visualization.
    • Designed and implemented ETL pipelines from scratch using Snowflake, Python, SQL, dbt, and Airflow to automate data ingestion and transformation.
    • Built a predictive modeling solution to reduce marketing costs and improve targeting by identifying high-potential home movers.
    • Applied PyTorch and Snowpark for advanced machine learning to optimize high-performance models.
    • Delivered a predictive model that was planned for use in attracting investment during Homemove's Series A funding.

    Technologies:

    • Technologies:
    • Pytest Pytest
    • AWS AWS
    • ChatGPT API ChatGPT API
    • Data Science
    • ETL ETL
    • Keras Keras
    • Matplotlib Matplotlib
    • NumPy NumPy
    • Pandas Pandas
    • Python Python
    • Plotly Plotly
    • PyTorch PyTorch
    • SQL SQL
    • SQLAlchemy SQLAlchemy
    • Streamlit Streamlit
    • XGBoost XGBoost
    • TensorFlow TensorFlow
    • Scikit-learn Scikit-learn
    • Git Git
    • Snowflake Snowflake
    • Machine Learning Machine Learning
    • Apache Spark Apache Spark
  • Data Science Instructor and Course Developer

    Cambridge University & FourthRev - 8 months

    • Worked as a Data Science specialist at FourthRev, creating and teaching an advanced online data science course for Cambridge University students.
    • Developed and delivered a comprehensive curriculum covering Neural Networks, NLP for AI, Unsupervised Learning, and advanced Decision Tree algorithms including XGBoost.
    • Applied hands-on machine learning implementation from scratch and employed innovative teaching methods to enhance student engagement and learning outcomes.
    • Demonstrated deep technical expertise in data science and machine learning, earning recognition from academic peers for effective teaching and curriculum design.

    Technologies:

    • Technologies:
    • Pytest Pytest
    • Data Science
    • Matplotlib Matplotlib
    • Neural Network
    • NumPy NumPy
    • Pandas Pandas
    • Plotly Plotly
    • XGBoost XGBoost
    • Scikit-learn Scikit-learn
    • Machine Learning Machine Learning
  • Senior Data Scientist

    Outra - 2 years

    • Worked at Outra, a data-driven property insight company, specializing in delivering household-level data to optimize client services.
    • Migrated the platform from Dataiku to a custom in-house Intelligence Fabric using MLflow, Airflow, Snowflake, GitHub Actions, AWS, and DBT for data engineering.
    • Developed two major predictive models forecasting household listing and sale/rent timelines, enabling a multi-million-pound partnership with Zoopla.
    • Applied LLMs for code documentation, coding assistance, and interactive chatbots for dashboards and customer-facing data.
    • Built ETL/ELT pipelines to transform raw data and prepare it for modeling.
    • Created visualizations and maps using KeplerGI, Seaborn, and Dataiku to help non-technical users interpret complex data.

    Technologies:

    • Technologies:
    • Pytest Pytest
    • AWS AWS
    • ChatGPT API ChatGPT API
    • Data Science
    • ETL ETL
    • Keras Keras
    • Matplotlib Matplotlib
    • Neural Network
    • NumPy NumPy
    • Pandas Pandas
    • Python Python
    • Plotly Plotly
    • PyTorch PyTorch
    • SQL SQL
    • SQLAlchemy SQLAlchemy
    • Streamlit Streamlit
    • XGBoost XGBoost
    • TensorFlow TensorFlow
    • Scikit-learn Scikit-learn
    • Git Git
    • Apache Airflow Apache Airflow
    • Snowflake Snowflake
    • dbt dbt
    • Machine Learning Machine Learning
    • Apache Spark Apache Spark
  • Senior Data Scientist

    Belmont Green - 2 years 6 months

    • Worked at Belmont Green, a specialist mortgage lending company turned bank, providing financial and mortgage solutions to financially affected customers.
    • Created a conversion model using survival analysis techniques and managed the project from inception to production.
    • Built machine learning algorithms and statistical models for time series data, focusing on retention, lifetime value, and expected loss models.
    • Owned projects end-to-end, ensuring proofs of concept were implemented and deployed successfully in production.
    • Implemented machine learning algorithms for cashflow, early redemption, default, pre-payment, and conversion models using Python and R.
    • Applied clustering and segmentation techniques to analyze product usage and customer behavior for marketing and strategic purposes.

    Technologies:

    • Technologies:
    • Data Science
    • Keras Keras
    • Matplotlib Matplotlib
    • Neural Network
    • NumPy NumPy
    • Pandas Pandas
    • Plotly Plotly
    • PyTorch PyTorch
    • SQLAlchemy SQLAlchemy
    • XGBoost XGBoost
    • Scikit-learn Scikit-learn
    • Machine Learning Machine Learning

Education

  • Standalone courseMachine Learning Specialization

    Stanford University · 2023 - 2023

  • Standalone courseMachine Learning

    Massachusetts Institute of Technology · 2021 - 2022

  • BSc.Business Management with maths

    Kingston University · 2013 - 2016

  • BSc.Civil Engineering

    Adolfo Ibanez University · 2011 - 2013

Find your next developer within days, not months

In a short 25-minute call, we would like to:

  • Understand your development needs
  • Explain our process to match you with qualified, vetted developers from our network
  • You are presented the right candidates 2 days in average after we talk

Not sure where to start? Let’s have a chat