-
Data Engineer, Upwork
March 2025 – Present
Built a flood prediction product using geospatial (point-in-polygon) operations with Spark, Java, Geotrellis, and AWS EMR Step Functions, boosting CPU utilization by 32%. Developed a scalable address interpolation algorithm with Python Dask and AWS Redshift, increasing address
coverage by 44% through an automated two-pointer approach. Designed a centralized data repository and ingestion framework for 40 data products into Snowflake and
Databricks using AWS Step Functions. Optimized Snowflake data uploads with Python multithreading, file splitting, and compression via AWS S3
external stages, enhancing speed by 74% and eliminating legacy cloud EMR costs.
Tech stack: Spark, Java, Geotrellis, AWS EMR, Python Dask, AWS Redshift, Snowflake, Databricks, AWS
-
Machine Learning Researcher, University of Alabama
April 2025 – Present
Designing and implementing a scalable sparse autoencoder pipeline for classifying and reconstructing particle collision events using minimally processed detector image data. Supervisor - Dr. Sergei V. Gleyzer
Tech stack: Python, PyTorch, NumPy, MLflow, HPC, SLURM
-
Open-Source Software Engineer, Probabl ai
April 2025 – July 2025
Contributing to Probabl.ai’s “skore” project to enhance data-science testing and visualization capabilities.
Tech stack: Python, Pandas, NumPy, scikit-learn, pytest
-
Data Analytics & DevOps Engineer, Nordblock
September 2024 – February 2025
Deployed monitoring infrastructure with Prometheus, Grafana, and InfluxDB for carbon-neutral bitcoin mining operations across Nordic regions, implementing LSTM autoencoder anomaly detection for power consumption patterns, reducing equipment downtime by 25%. Built ELK stack pipeline ingesting logs from AWS S3 into ElasticSearch with Kibana visualization for energy-to-heat conversion analysis and rapid troubleshooting. Orchestrated scalable analytics workflows using Apache Spark Structured Streaming to process Kafka data streams, with Airflow managing ETL job scheduling and automated model retraining pipelines via Kubernetes.
Tech stack: Python, Linux, Docker, Kubernetes, Prometheus, Grafana, InfluxDB, ELK Stack, Apache Spark, Kafka, Airflow, AWS
-
Technical Student, CERN (European Organization for Nuclear Research)
June 2023 – July 2024
Architected and maintained systems for a 6K-user global CMS experiment collaboration, optimizing publication, personnel, and institute management for reliability and scalability. Delivered on-demand user support and enhanced the iCMS-web ecosystem to streamline global workflows. Developed iCMS-teams, a web service to register new collaborators within the CMS experiment. Contributed to iCMS-statistics providing configurable analytics on CMS collaboration via tables, plots, and exportable data. Built microservices and web apps using Python (Django, Flask API), Java (Spring Boot), and Vue.js. Automated periodic database synchronization with CRON jobs using Bash and Python scripts. Optimized resource allocation for EOS storage and implemented CI/CD pipelines with GitLab and OpenShift for seamless deployments. Witnessed server migrations from CentOS 7 to AlmaLinux 9 post end-of-life.
Tech stack: Python, Django, Flask, Java (Spring Boot), Javascript, Typescript, Vue.js, Databases (PostgreSQL, MariaDB, OracleDB), Linux, Docker, Bash, Shell scripting, Git, GitLab CI/CD, Helm, OpenShift
-
Data Engineering Intern, Roni Analytics
February 2023 – May 2023
Contributed as part of the co-founding team of a startup, developing a crypto analysis tool incorporating real-time metrics from Ethereum-based layer 1 blockchains. Designed and optimized Spark SQL queries on Hadoop HDFS systems to process large-scale blockchain datasets for efficient analysis and backtesting workflows. Worked on system monitoring and backend integration, leveraging Docker and Kubernetes.
Tech stack: Python, Docker, Apache Spark, Hadoop HDFS, Docker, Kubernetes
-
Software Engineering Intern, CERN-HSF (Google Summer of Code)
May 2022 – August 2022
Developed and published an Etherpad plugin via NPM to enable collaborative file-sharing across the CS3 Science Mesh platform using Golang REST APIs.
Tech stack: JavaScript, Node.js, Golang, Docker
-
Software Engineering Intern, bcoin (Summer of Bitcoin)
May 2022 – August 2022
Adapted Bitcoin-Core compact block relay (BIP152) in the bcoin library; implemented end-to-end tests to raise coverage by 40%.
Tech stack: JavaScript, Node.js, C++, Mocha
-
Software Engineering Intern, Public Lab (Google Summer of Code)
May 2021 – August 2021
Updated a spectrometry data-analysis library for cross-browser WebRTC camera switching; added mapping integrations and boosted test coverage by 30%.
Tech stack: JavaScript, Ruby on Rails, WebRTC, Leaflet.js, Cypress
-
Software Engineering Intern, moja Global
June 2021 – September 2021
Prototyped a Vue.js dashboard interfacing with Flask APIs; containerized via Docker and managed infra with Terraform on GCP.
Tech stack: Vue.js, Python, Flask, Docker, Terraform, GCP, AWS, Azure