Hi👋, I'm Harshil Gupta.

I love being a 

About Me

A Brief Intro

A BTech(Bachelors in Technology) graduate from Netaji Subhas University of Technology with major in Instrumentational Engineering and minor in Robotics and Artificial Intelligence.
I like to work with new and exciting tech stacks and I constantly work to improve my knowledge and enhance my skillset.

Codeforces CodeChef LeetCode

Check out my Resume

Work Experience

Software Engineer at NatWest Group

July 2023 - Present

Onboarded 5+ data products using OBDEF (One Bank Data Engineering Framework (OBDEF) built on Python/Scala, improving data quality via AWS Deequ checks on in-transit/at-rest data.

Migrated on-premise Oracle RDBMS to AWS S3 using Airflow DAGs (Directed Acyclic Graphs) for real-time and batch ingestion of data on AWS, reducing storage costs by upto 25%.

Developed transformation logic for complex nested data using SQL and conducted performance tuning with optimization of data pipelines, reducing processing time and improving system performance.

Responsible for managing AWS EMR (Elastic MapReduce) configuration, resulting in a significant reduction of AWS compute costs by nearly 40%.

Managed the ETL (Extract-Transform-Load) process, extracting data in JSON format from Kafka and persisting it as Parquet on S3, benefiting from efficient columnar storage, reduced storage costs, and faster query performance.

Currently part of a team focused on integrating AWS DataZone at the bank to streamline data management processes, improve data accessibility, and ensure robust governance and security measures. This integration includes leveraging AWS services such as EMR, Athena and Glue to enhance overall data capabilities.

Performed rigorous testing for the data products built. This includes load testing of Kafka topics through automation scripts, as well as system testing, regression testing, and performance testing of already existing SQL transformation scripts, Airflow DAG code, and AWS configurations.

Used Hashicorp's Terraform as an IaaC tool (infrastructure as code) to manage AWS cloud infrastructure.

Software Engineering Intern at Royal Bank of Scotland

August 2022 - June 2023

Assisted the Data and Analytics team to build an MLOPS (Machine Learning Operations) framework for their new customer conversation intelligence (CCI) model.

Developed monitoring scripts for the CCI model for smooth execution and assisted the work team in gathering label data for model validation.

Subject Matter Expert at Chegg

Sep 2021 - Dec 2022

Registered Subject: Computer Science

Teaching Assistant at Coding Ninjas

Jul 2021 - Sep 2021

Responsible for mentoring 100+ students in debugging their code. Helping students with the concepts of data structures and algorithms with an average rating of 4.87 out of 5.0.

Let's talk

Contact