Oscar C.

Data Engineer

Oscar er en højt specialiseret Senior Data Engineer med 13 års kommerciel erfaring. Han har arbejdet i forskellige brancher som AdTech, FinTech, HealthTech og Enterprise Software, hvor han har demonstreret sin ekspertise på tværs af forskellige domæner.

Oscar har fået værdifuld erfaring med at arbejde i både USA og Holland. Hans tekniske færdigheder omfatter brug af Golang, Python, BigQuery, Apache Spark på Databricks (AWS) og Scala til at skabe robuste softwaresystemer.

En af Oscars stolteste bedrifter var at udvikle en patenteret idé med USPTO, som han med succes bragte på markedet. Dette projekt viser hans innovation og evne til at bygge bro mellem koncept og kommercialisering.

Ud over sin tekniske ekspertise har Oscar demonstreret enestående evner som teamleder gennem hele sin karriere, hvilket yderligere understreger hans evne til at levere resultater af høj kvalitet i komplekse projekter.

Hovedekspertise

  • Apache Spark
    Apache Spark 10 år
  • AWS
    AWS 10 år
  • BigQuery
    BigQuery 5 år

Andre færdigheder

  • MySQL
    MySQL 13 år
  • PostgreSQL
    PostgreSQL 13 år
  • ETL
    ETL 11 år
Oscar

Oscar C.

Guatemala

Match med udvikler her

Udvalgt oplevelse

Beskæftigelse

  • Senior MLOps Engineer

    Sago Mini - 2 måneder

    • Promoting Machine Learning models to Production on GCP

    Teknologier:

    • Teknologier:
    • Python Python
    • Vertex AI Vertex AI
  • Tech Lead / MLOps & Optimization Platform

    Occidental Petroleum (Oxy) - 4 måneder

    Summary: Led design and delivery of Oxy’s Optimization Pillar MLOps platform in AWS ● Designed and implemented MLOps platform integrating AWS (S3, ECS/Fargate, SageMaker, Lambda) with Oxy’s ODAP Lakehouse ● Packaged and deployed Python optimization models (Gurobi/Pyomo) with CI/CD pipelines in Azure DevOps + MLflow ● Built PySpark ingestion pipelines from Kabal APIs, SQL Server, and PI systems into ODAP, ensuring governance and schema validation ● Collaborated with data scientists and IT to enable predictive maintenance and vessel scheduling optimization use cases ● Mentored engineers and defined role skill matrices across MLOps, DevOps, Backend, and QA

    Teknologier:

    • Teknologier:
    • AWS AWS
    • Python Python
    • Machine Learning Machine Learning
  • Senior Backend Developer

    Reddit - 5 måneder

    Summary: Backend development in Golang and Python ● Developed new integrations with Notification Platform to send emails to 300k+ users ● Implemented concurrency in email send increasing performance by 98% ● Implemented Spam filters in email send increasing performance further by 46% ● Developed client support for Business Experiences team to tap into Notification Platform, making progress towards goal of deprecating integrations with legacy Mailroom messaging. ● Code reviews and various team activities

    Teknologier:

    • Teknologier:
    • Golang Golang
    • Apache Kafka Apache Kafka
  • Senior Data Engineer

    Curinos - 8 måneder

    • Led data product development on the Databricks Lakehouse platform, ensuring efficient data handling and analysis;

    • Migrated data from MySQL and PostgreSQL databases using AWS Database Migration Service (DMS) to streamline data management;

    • Developed Data Pipelines using Delta Live Tables (DLT) for real-time and batch processing of data;

    • Created a Code Generation tool to automatically generate Scala code for Databricks, enhancing development speed and accuracy;

    • Proficient in Databricks, Scala, and Python, with a strong focus on scalable data engineering solutions.

    Teknologier:

    • Teknologier:
    • MySQL MySQL
    • PostgreSQL PostgreSQL
    • AWS AWS
    • Databricks Databricks
    • Python Python
    • Scala Scala
    • Data Engineering
  • Senior Data Engineer

    Clevertech - 2 flere år 11 måneder

    • Developed a Reporting API for analyzing large-scale advertising campaigns (Golang, BigQuery)
    • Created an Advanced Query Tool in Golang for complex SQL queries, reducing processing time by 50%
    • Implemented Data Modeling for forecasting TV Ads performance to extrapolate impressions, increasing revenue by 20%
    • Debugged and improved complex queries in BigQuery, reducing overall query complexity
    • Enhanced collaboration with the Data Science team by serving as an interface with the Backend team

    Teknologier:

    • Teknologier:
    • Golang Golang
    • SQL SQL
    • Data Engineering
    • BigQuery BigQuery
  • Co-Founder and CTO

    Sciencesheet - 1 år 8 måneder

    • Developed Codegen for ML pipelines (Spark, Scala, Python), accelerating data science processes by 10x
    • Invented and patented Codegen technology for processing millions of spreadsheet rows in Spark using Excel formulas
    • Launched a startup from idea to market within one year
    • Increased market reach by developing plugins for Google Sheets and Microsoft Excel
    • Successfully developed the AWS Backend using Sagemaker Autopilot, Lambda, EC2, SNS, and SES

    Teknologier:

    • Teknologier:
    • AWS AWS
    • Python Python
    • AWS Lambda AWS Lambda
    • Scala Scala
    • AWS EC2 AWS EC2
    • Hadoop Hadoop
    • Microsoft Excel Microsoft Excel
  • Data Scientist / Engineering Manager

    PayPal (Xoom) - 3 flere år 9 måneder

    • Tech Lead for Data Science and Engineering team (Spark, Scala, Python)
    • Managed a team of five data scientists and data engineers
    • Developed a Locations indexer, doubling the speed of finding bank branches in India
    • Increased market coverage for the Sendmoney product by supporting FP&A analyses in Spark instead of Excel
    • Enhanced the effective reach of push notifications by 20% through segmentation analyses in Spark

    Teknologier:

    • Teknologier:
    • Apache Spark Apache Spark
    • Python Python
    • Scala Scala
    • Data Engineering
    • Team Leading
    • Microsoft Excel Microsoft Excel
  • Cloud Engineer

    Mendix - 3 flere år 5 måneder

    • Developed the Mendix Cloud platform using a Mendix code generation tool, streamlining the development process;

    • Architected robust security protocols for the Mendix Enterprise Cloud Platform, ensuring data protection and compliance;

    • Automated parallel firewall installation and configuration across thousands of cloud nodes, enhancing security and operational efficiency;

    • Reverse-engineered Mendix code generation to reproduce applications using the open-source WebDSL language, expanding platform versatility and open-source integration.

    Teknologier:

    • Teknologier:
    • AWS AWS
    • Python Python
    • Data Engineering
  • Summer Intern

    Google - 3 måneder

    • Conducted data mining on a Git repository containing 70 Apache projects, extracting valuable insights for analysis;

    • Presented the project findings at ApacheCon US in Atlanta, showcasing expertise and contributing to the open-source community.

    Teknologier:

    • Teknologier:
    • Apache Spark Apache Spark
    • Data Engineering
    • Git Git

Uddannelse

  • MSc.Computer Science

    Delft University of Technology · 2009 - 2011

  • MSc.Management and Technology

    Delft University of Technology · 2007 - 2009

  • MSc.Management and Technology

    Delft University of Technology · 2007 - 2009

  • BSc.Computer Science

    Universidad Francisco Marroquín · 1997 - 2002

Find din næste udvikler inden for få dage, ikke måneder

Book en 25-minutters samtale, hvor vi:

  • udfører behovsafdækning med fokus på udviklingsopgaver
  • Forklar vores proces, hvor vi matcher dig med kvalificerede, godkendte udviklere fra vores netværk
  • beskriver de næste trin for at finde det perfekte match på få dage

Lad os snakke om det