Elvijs Pārpucis - Senior Data Engineer

Q: Where is Elvijs based?

Elvijs is based in Riga, Latvia .

Q: What languages does Elvijs speak?

Elvijs speaks the following languages: Latvian (Native), English (Advanced), Lithuanian (Elementary) .

Q: How many years of experience does Elvijs have?

Elvijs has at least 10 years of experience. During this time, Elvijs has worked in at least 4 different roles and for 4 different companies . The average length of individual experience is 3 years and 7 months . Note that Elvijs may not have shared all experience and actually has more experience.

Q: What is Elvijs's latest experience?

Elvijs's most recent position is Senior Data Engineer at SolutionLab .

Q: What companies has Elvijs worked for in recent years?

In recent years, Elvijs has worked for SolutionLab , Deloitte , and Zabbix .

Q: Which industries is Elvijs most experienced in?

Elvijs is most experienced in industries like Information Technology and Professional Services .

Q: Which business areas is Elvijs most experienced in?

Elvijs is most experienced in business areas like Business Intelligence and Information Technology .

Q: Which industries has Elvijs worked in recently?

Elvijs has recently worked in industries like Information Technology and Professional Services .

Q: Which business areas has Elvijs worked in recently?

Elvijs has recently worked in business areas like Business Intelligence and Information Technology .

Recommended expert

Riga, Latvia

Check rate

Experience

Dec 2024 - May 2026

1 year 6 months

Lithuania

Senior Data Engineer

SolutionLab

Architected and built scalable ETL/ELT data pipelines using Python, SQL, and Spark (EMR/Databricks) to process large-scale batch and streaming datasets for analytics and reporting systems.
Designed and optimized cloud data warehouse solutions (Snowflake, Redshift, BigQuery, Azure ADX), including schema design, data modeling (star/snowflake/medallion), and query performance tuning.
Developed and maintained real-time and batch data ingestion pipelines using Kafka, Kinesis, Azure Data Factory, AWS Glue, and Pub/Sub to support high-volume, low-latency data processing.
Designed and implemented real-time event streaming pipelines using Apache Flink and Kafka to enable live user behavior tracking and support personalized feature delivery.
Implemented workflow orchestration and scheduling systems using Apache Airflow, Prefect, AWS Step Functions, and Kubernetes-based jobs for reliable data operations.
Established enterprise-grade data governance, quality, and observability frameworks using Great Expectations, Soda, Monte Carlo, Prometheus, Grafana, and ELK stack.
Designed and maintained data models, warehouses, and analytical marts supporting BI dashboards and reporting tools such as Power BI and Tableau.
Led system optimization, cost tuning, and performance engineering efforts, including query optimization, storage efficiency, and compute cost reduction across distributed cloud systems.

Sep 2021 - Dec 2024

3 years 4 months

United Kingdom

Data Engineer

Deloitte

Designed and implemented scalable end-to-end data pipelines (ETL/ELT) using Python, SQL, and Apache Spark (EMR/Databricks) to process high-volume transactional and behavioral data.
Built and optimized AWS-based data architectures (S3, Glue, Lambda, Kinesis, Redshift) enabling reliable ingestion, storage, transformation, and analytics at scale.
Developed and maintained orchestrated workflows using Apache Airflow and AWS Step Functions, ensuring fault-tolerant, scheduled, and observable data processing pipelines.
Designed robust data models (star schema, dimensional modeling, analytical data marts) to support BI dashboards, reporting, and self-service analytics across business teams.
Implemented Infrastructure as Code (Terraform/CloudFormation) and integrated CI/CD pipelines (Jenkins, CircleCI) to automate deployment and improve system reliability.
Established data quality frameworks, monitoring, logging, and alerting systems, improving data accuracy, pipeline observability, and production stability.
Partnered with product, engineering, analysts, and ML teams to standardize data access, support feature engineering, and enable machine learning and analytics workloads.
Performed SQL performance tuning and big data optimization, improving query efficiency and reducing processing time on large-scale datasets across cloud warehouses (Redshift/Snowflake/BigQuery).

Nov 2018 - Aug 2021

2 years 10 months

Latvia

Data Scientist

Zabbix

Led end-to-end development of machine learning and statistical models in Python (pandas, scikit-learn), covering problem framing, feature engineering, training, evaluation, and production deployment.
Designed and executed structured experiments (A/B testing, hypothesis testing, causal inference methods) to evaluate model effectiveness and optimize data-driven decision-making.
Performed deep exploratory data analysis (EDA) on large, messy datasets to uncover patterns, detect data quality issues, and identify key predictive drivers.
Engineered and optimized data pipelines using SQL and big data frameworks (Spark, Hadoop) to support scalable feature generation and high-volume data processing.
Developed and deployed deep learning models (CNN, RNN, LSTM) using TensorFlow, Keras, and PyTorch for complex predictive and pattern recognition tasks.
Developed and maintained messaging-based data ingestion pipelines using RabbitMQ, enabling asynchronous, scalable, and reliable data movement across distributed systems.
Collaborated with cross-functional teams (product, engineering, marketing, stakeholders) to integrate ML solutions into production systems and deliver measurable business impact.

Oct 2015 - Sep 2018

3 years

Latvia

Data Analyst

SIDC Group Ltd

Leveraged SQL, Python, Excel, and Power Query to analyze large-scale datasets covering SKU performance, cost structures, service levels, and reporting KPIs across multiple markets.
Designed and built ETL data pipelines using Python and Apache Spark to enable reliable processing of large-scale structured enterprise datasets for reporting, analytics, and business intelligence.
Designed and maintained data validation frameworks for supplier invoices, ensuring financial accuracy, operational efficiency, and reduced discrepancies.
Managed and optimized relational and distributed data platforms, including PostgreSQL and Hive, to support high-volume reporting and large-scale historical data processing.

Industry Experience

See where this freelancer has spent most of their professional time.

Experienced in Information Technology and Professional Services.

Information Technology

Professional Services

Business Area Experience

See which departments and functions this freelancer has contributed to most.

Experienced in Business Intelligence and Information Technology.

Business Intelligence

Information Technology

Summary

Senior Data Engineer & Data Scientist with 10+ years of experience designing and building large-scale distributed data systems, real-time streaming architectures, and cloud-native AI/ML data platforms across fintech, healthcare, SaaS, and e-commerce domains. Strong expertise in Python, SQL, Apache Spark, Flink, Kafka, Airflow, Databricks, Snowflake, TensorFlow, PyTorch, AWS, Azure, and GCP, with deep experience in ETL/ELT pipeline development supporting both batch and streaming workloads. Skilled in modern data lake and analytics ecosystems including Trino, ClickHouse, HDFS, and S3-compatible storage, focused on performance, scalability, and reliability. Experienced in containerized and cloud-native infrastructure using Kubernetes, Docker, CI/CD pipelines, and GitOps (Argo CD). Proven ability to deliver end-to-end solutions across data engineering, machine learning systems, MLOps workflows, and real-time analytics that drive business growth, operational efficiency, and data-driven decision-making.

Skills

Programming Languages: Python (Pandas, Numpy, Matplotlib), Sql, Java, Bash, Shell Scripting, Yaml
Data Engineering & Pipelines: Etl/Elt Design, Batch & Real-Time Streaming, Data Modeling, Data Warehousing, Lakehouse Architectures, Data Integration, Data Transformation
Big Data & Distributed Systems: Apache Spark, Flink, Kafka, Rabbitmq, Hadoop (Hive, Hdfs), Trino, Clickhouse, Databricks, Airflow, Dbt, Prefect
Analytics Tools: Tableau, Power Bi, Looker, Google Data Studio
Cloud Platforms & Data Services: Aws (S3, Glue, Lambda, Redshift, Athena), Azure (Data Factory, Synapse), Gcp (Bigquery, Pub/Sub, Cloud Storage)
Databases & Storage: Postgresql, Mysql, Cassandra, Redis, Snowflake, Data Lakes (S3, Hdfs, Ceph)
Devops & Ci/Cd: Docker, Kubernetes, Terraform, Helm, Argo Cd, Jenkins, Infrastructure As Code (Iac), Ci/Cd
Monitoring & Reliability: Prometheus, Grafana, Elk Stack, Cloudwatch, Logging, Alerting, Performance Tuning
Ai / Ml Data Engineering & Tools: Feature Engineering, Ml Pipelines, Ai Data Pipelines, Sagemaker, Git, Agile, Sdlc, Jira, Confluence, Ai-Assisted Tools (Cursor, Claude, Github Copilot)

Languages

Latvian

Native

English

Advanced

Lithuanian

Elementary

Education

Sep 2011 - Jun 2015

Riga Technical University

Bachelor’s Degree · Computer Science · Riga, Latvia

Statistics

Experience

Total positions 4

Experience in Information Technology 7 y

Avg length 2 y 7 m

Longest experience 3 y 3 m

Global Experience

Countries worked in 3 (Latvia, Lithuania, United Kingdom)

Primary country Latvia

Expertise

Recent roles Senior Data Engineer, Data Engineer, Data Scientist

Main industries Information Technology, Professional Services

Main business areas Business Intelligence, Information Technology

Qualifications

Highest degree Bachelor

Profile

Created

June 2026

Need a freelancer? Find your match in seconds.

Try FRATCH GPT

Frequently asked questions

Have questions? Find more information here.

Where is Elvijs based?

Elvijs is based in Riga, Latvia.

What languages does Elvijs speak?

Elvijs speaks the following languages: Latvian (Native), English (Advanced), Lithuanian (Elementary).

How many years of experience does Elvijs have?

Elvijs has at least 10 years of experience. During this time, Elvijs has worked in at least 4 different roles and for 4 different companies. The average length of individual experience is 3 years and 7 months. Note that Elvijs may not have shared all experience and actually has more experience.

What roles would Elvijs be best suited for?

Based on recent experience, Elvijs would be well-suited for roles such as: Senior Data Engineer, Data Engineer, Data Scientist.

What is Elvijs's latest experience?

Elvijs's most recent position is Senior Data Engineer at SolutionLab.

What companies has Elvijs worked for in recent years?

In recent years, Elvijs has worked for SolutionLab, Deloitte, and Zabbix.

Which industries is Elvijs most experienced in?

Elvijs is most experienced in industries like Information Technology and Professional Services.

Which business areas is Elvijs most experienced in?

Elvijs is most experienced in business areas like Business Intelligence and Information Technology.

Which industries has Elvijs worked in recently?

Elvijs has recently worked in industries like Information Technology and Professional Services.

Which business areas has Elvijs worked in recently?

Elvijs has recently worked in business areas like Business Intelligence and Information Technology.

What is Elvijs's education?

Elvijs holds a Bachelor in Computer Science from Riga Technical University.

What is the availability of Elvijs?

Elvijs is immediately available full-time for suitable projects.

What is the rate of Elvijs?

Elvijs's rate depends on the specific project requirements. Please use the Meet button on the profile to schedule a meeting and discuss the details.