Arian F.

Data Engineer

Arian is a Data Engineer with more than four years of professional experience. He specializes in designing and optimizing data pipelines using Databricks, SQL, Apache Spark, Python, AWS, and Airflow. He builds scalable, cloud-based solutions and ensures efficient data processing across systems.

One of his standout accomplishments was the optimization of his company’s cloud infrastructure. By conducting in-depth analysis and adjusting resource configurations and utilization, Arian was able to cut costs by 30%, earning widespread recognition for his contributions.

Known for his meticulous attention to detail and thoughtful problem-solving approach, Arian delivers robust, high-performance data solutions across every project.

Main expertise

  • Git
    Git 4 years
  • YAML
    YAML 3 years
  • Fact Data Modeling 3 years

Other skills

  • Jira
    Jira 4 years
  • Microsoft Excel
    Microsoft Excel 3 years
  • AWS S3
    AWS S3 3 years
Arian

Arian F.

Kosovo

Get started

Selected experience

Employment

  • Senior Data Engineer

    Valtech - 1 year 1 month

    • Working with a large global retail client to rebuild and improve their data platform, focusing on practical use of Databricks and Azure tools;
    • Building and maintaining ETL pipelines in Databricks, handling data from source systems through to final tables;
    • Developing a set of employee and HR KPIs that help stakeholders track hiring, retention, and other workforce trends. These KPIs are actively used by leadership to support planning and decision-making;
    • Coordinating with technical and non-technical teams to ensure data is structured correctly and available when needed.

    Technologies:

    • Technologies:
    • AWS AWS
    • Databricks Databricks
    • Apache Spark Apache Spark
    • Python Python
    • SQL SQL
    • AWS S3 AWS S3
    • Azure Azure
    • Azure Data Factory Azure Data Factory
    • DevOps DevOps
    • Data Engineering
    • Git Git
    • ELT
    • Apache Airflow Apache Airflow
    • Data Analytics
    • Data Modeling
    • Pytest Pytest
    • Dimensional modeling
    • Fact Data Modeling
    • YAML YAML
    • PySpark PySpark
    • GitHub Actions GitHub Actions
  • Senior Data Engineer

    Starbucks - 9 months

    • Migrated production pipelines from Alteryx to Databricks, improving end-to-end performance and making the workflows easier to run and maintain;
    • Led the move from Databricks Hive Metastore to Unity Catalog, standardizing governance and access without disrupting existing workloads;
    • Led the build of a new employee movement tracking product end-to-end + data model where I delivered clean, consistent data that powers the dashboard.

    Technologies:

    • Technologies:
    • Databricks Databricks
    • Python Python
    • SQL SQL
    • Azure Data Factory Azure Data Factory
    • Data Engineering
    • ELT
    • Apache Airflow Apache Airflow
    • Data Analytics
    • Data Modeling
    • Fact Data Modeling
    • Data Governance
    • PySpark PySpark
    • Data Quality
  • Data Engineer - Consultant

    XponentL Data - 9 months

    • Modernized the data infrastructure for a global oil and gas company by optimizing the data architecture to enhance efficiency and improve accessibility.
    • Led the end-to-end development of Key Performance Indicators (KPIs), from gathering requirements to deploying solutions in production environments.
    • Ensured that KPIs were fully aligned with business objectives and served as critical tools for driving organizational success.
    • Implemented best practices for data management and governance, supporting the long-term scalability and reliability of the data ecosystem.

    Technologies:

    • Technologies:
    • AWS AWS
    • Databricks Databricks
    • Apache Spark Apache Spark
    • Python Python
    • SQL SQL
    • AWS Lambda AWS Lambda
    • AWS S3 AWS S3
    • Azure Azure
    • Microsoft Power BI Microsoft Power BI
    • Azure Data Factory Azure Data Factory
    • Pandas Pandas
    • DevOps DevOps
    • Data Engineering
    • Jira Jira
    • Git Git
    • ELT
    • Apache Airflow Apache Airflow
    • Data Analytics
    • Snowflake Snowflake
    • Data Modeling
    • Dimensional modeling
    • Fact Data Modeling
    • YAML YAML
    • PySpark PySpark
  • Python Lecturer

    Creative Hub - 10 months

    • Designed and delivered engaging Python lectures for a data science bootcamp, with a focus on data analysis and introductory machine learning concepts.
    • Developed practical lesson plans that emphasized hands-on learning and real-world applications of Python.
    • Guided students through foundational Python programming, ensuring clarity in core data science and machine learning principles.
    • Provided personalized support to students, addressing individual challenges and fostering confidence in applying Python skills to projects.

    Technologies:

    • Technologies:
    • Project Management
    • Python Python
    • Data Engineering
  • Data Enginner

    Raiffeisen Tech - 2 years 1 month

    • Achieved a 30% reduction in AWS costs through strategic optimization and resource management.
    • Led the migration of a data lake to Databricks, ensuring seamless integration and improved performance.
    • Managed end-to-end data engineering processes, including data extraction, transformation, and loading (ETL) pipelines.
    • Leveraged cloud environments for efficient data storage, maintenance, and scalability.
    • Developed detailed data products tailored to clients’ requirements, earning recognition for delivering high-quality solutions.

    Technologies:

    • Technologies:
    • AWS AWS
    • Databricks Databricks
    • Apache Spark Apache Spark
    • Python Python
    • SQL SQL
    • AWS Lambda AWS Lambda
    • AWS S3 AWS S3
    • NumPy NumPy
    • DevOps DevOps
    • Data Engineering
    • AWS Athena AWS Athena
    • Jira Jira
    • Git Git
    • ELT
    • Apache Airflow Apache Airflow
    • Data Analytics
    • Data Modeling
    • Amazon CloudWatch Amazon CloudWatch
    • Pytest Pytest
    • Dimensional modeling
    • Fact Data Modeling
    • Apache Iceberg Apache Iceberg
    • YAML YAML
    • Salesforce Salesforce
    • PySpark PySpark
    • Microsoft Excel Microsoft Excel
  • Data Engineer/Analyst

    Vianova AI - 1 year 7 months

    • Developed the company’s first patient reporting system, converting raw datasets into actionable insights.
    • Utilized SQL and Python to extract, clean, process, and validate real-world patient data.
    • Designed and implemented data transformation workflows to ensure accuracy and reporting consistency.
    • Created visualizations and reports to present data in a user-friendly format, supporting ongoing analysis and decision-making.

    Technologies:

    • Technologies:
    • Flask Flask
    • Postman Postman
    • Python Python
    • SQL SQL
    • NumPy NumPy
    • Pandas Pandas
    • Data Engineering
    • Jira Jira
    • Git Git
    • ELT
    • Data Analytics
    • Data Modeling
    • Microsoft Excel Microsoft Excel

Education

  • BSc.Computer Engineering

    University of Prishtina · 2017 - 2021

Find your next developer within days, not months

In a short 25-minute call, we would like to:

  • Understand your development needs
  • Explain our process to match you with qualified, vetted developers from our network
  • You are presented the right candidates 2 days in average after we talk

Not sure where to start? Let’s have a chat