Bachelor's Degree
California State University
August 2020 - May 2023
Majored in computer science, specialized in Data Artificial Intelligence.
Minor in Mathematics
I am, essentially, a curious data-driven software engineer with dreams to carve out a career path that allows me to utilize and expand my skillset as a Data Engineer, Data Scientist and ML Engineer. Right from childhood, I have had a deep-seated interest in automating things. Once, when a family member got seriously injured at a high-risk job, I remember wondering why no one thought about getting such risk-prone jobs done by a machine. My foray into the field of Computer Science has been to broaden my knowledge about intelligent systems. I am very good at Math and Statistics. I also thoroughly enjoy working with diverse Machine Learning models. The best part is converting a hypothetical model created on a whiteboard to an actual coding of the model.
Majored in computer science, specialized in Data Artificial Intelligence.
Minor in Mathematics
Sep 2022 - May 2023
Student Assistant
Sacramento state University
•Maintain excellent customer service and positive attitude towards guest, customers, clients and co-workers.
• Prepared signs, posters, and mailings and assisted with other tasks for events at Sac State University.
• Maintain all serving schedules; ensure that all food items are served per menu specifications in a safe and appropriate manner following department policies and procedures.
June 2022 - Aug 2022
Data Engineer Intern
McKinney, Texas
• Built a migration pipeline from Oracle database to ADLS gen 2 using Stream Sets pipelines which helped in scaling database for future.
• Maintain optimal data processing pipeline architecture using PySpark and Azure Data Factory to reduce time complexity by 3 hours.
• Spearheaded key performance indicator (KPI) development for projects by making a detailed analysis and reporting of past project data and customer feedback using python, increasing average customer satisfaction rating by 8%.
•Migrated 237k data points from My SQL to SnowFlake using Stream Sets data loader with a 99.9% success rate, reducing data processing time by 50%.
•Implemented advanced monitoring techniques during the migration process to ensure zero loss of vital information and maintained full-time availability of business-critical data.
SnowFlake, Stream Sets,My SQL
•Developed automated SQL scripts to generate daily and weekly sales reports, reducing manual processing time by 80%.
•Scheduled pipeline using Apache Airflow DAGs triggered every day and weekly.
Python, SQL, Airflow
@Chico State University
• After two days of intense data wrangling, analysis, and presentation design, each team is allowed a few minutes and no more than two slides to impress a panel of judges.
Our group won Best Visualization Award
Python, Pandas, Keras
• Trained CNN, DNN, logistic regression and SVC to classify whether the sentiment of a drug review is positive with an accuracy of 92%.
• SMOTE technique for sampling the imbalanced dataset which increased the accuracy of the model by 8%.
Python, TensorFlow, Keras
• Data Ingestion, cleaning, validation, and processing data using ADF.
• Visualized key statistics like customer trends and average tip per mile driven by neighborhood and by time in user friendly dashboards
• Data Orchestration using Airflow for scheduling pipeline on time.
Docker, Microsoft Azure, Data Factory, Tableau