About Me

Hello! 👋 I’m a data scientist with a passion for creating innovative solutions to complex problems. Currently, I am pursuing a Masters in Information Systems Management - Business Intelligence & Data Analytics at Carnegie Mellon Univeristy.

I hold a B.Tech in Computer Science Engineering with a minor in Economics from Shiv Nadar University, and I previously worked as a Machine Learning R&D Engineer at Hewlett Packard Enterprise.

My research interests lie in the intersection of Natural Language Processing (NLP) and Human Computer Interaction (HCI) to build intelligent interactive systems that can communicate with humans effectively and enable individuals to conduct and analyze digital longitudinal data studies in an efficient and ethical way.

When I’m not immersed in my work, you’ll likely find me at the gym, planning my next hiking trip, or playing tennis with my friends!

Interests

  • Data Science
  • Computational Social Science
  • Natural Language Processing
  • Machine Learning
  • Data Engineering

Education

  • Masters in Information Systems Management - Business Intelligence & Data Analytics, 2024-2025
    Carnegie Mellon University, Pittsburgh, PA
  • B.Tech in Computer Science and Engineering, 2018-2022
    Shiv Nadar University, India

Recent News (scroll for more)



Professional Experience

Hewlett Packard Enterprise
Machine Learning R&D Engineer, HPE Compute
Aug 2022 – May 2024
Bangalore, India
  • Developed ML & data warehousing solutions within the Data Engineering & Analytics team to effectively store and manage HPE Infosight server data using the ELK stack.
  • Developed & deployed a custom Named Entity Recognition model with an ETL pipeline to overhaul a legacy Customer Advisory (CA) recommendation system, reducing the average percentage error of the system from 11% to 2.5%, reducing TAT by 40% and elevating self-resolutions significantly.
  • Contributed to core Python backend development for HPE iLO server simulator (via Docker, Postgres, Kubernetes, Nginx), delivering a robust and scalable simulation environment to enhance HPE server technology testing and validation.
  • Actively engaged in data analysis and dashboard creation to highlight outcomes of ML-driven initiatives.
R&D Intern, HPE Storage
Jan – Jul 2022
Bangalore, India
  • Designed IO generation tests for diverse storage arrays (HPE 3PAR & Primera) & host OSs via Python IDART framework.
  • Optimized IDART test script automation for HP-UX & IBM-AIX OSs, resulting in a 60% reduction in testing time across stand-alone and MultiOS environments.
KPMG India
Data Science Intern, KPMG Digital Lighthouse
Jun - Aug 2021
Bangalore, India
  • Analyzed phishing URL patterns to devise a Levenshtein distance-based algorithm to enhance phishing URL detection accuracy by 30% for the KPMG Digital Signals Insights Platform (DSIP).
  • Introduced multilingual capabilities in the fraud detection pipeline to detect and flag punycode phishing URLs.
  • Built & integrated a PII microservice module into the DSIP platform to ensure GDPR compliance for key customers.
Profinch Solutions
Data Warehousing Intern
Jun - Aug 2020
Bangalore, India
  • Designed and implemented a 360° Customer dashboard that provides a holistic bank-wide overview of a customer to drive targeted cross-selling/upselling opportunities in core banking systems for key clients.
  • Utilized Hadoop, Hive and Tableau for big data management & data visualization.


Research & Teaching Experience

The Alan Turing Institute
Researcher, Turing Data Study Group
May 2022
Birmingham, UK
  • Worked on a research study with the Rolls Royce R&D division to identify causes for turbine manufacturing failure under the guidance of Dr. Kit Windows-Yule at the University of Birmingham.
  • Analyzed semi-structured high-dimensional manufacturing data and co-designed a hyperparameter-tuned Gradient-boosted CatBoost classifier model with a prediction accuracy of 77%, outperforming alternative methods.
  • Co-authored a report underscoring the academic impact of the study, soon to be published by the Alan Turing Institute.
Department of Computer Science, Shiv Nadar University
UG Teaching Assistant
Aug 2021 - May 2022
Greater Noida, India
  • CSD355: Foundations of Data Science in the Spring 2022 semester under Prof. Sonia Khetarpaul.
  • CSD101: Introduction to C Programming in the Monsoon 2021 semester under Prof. Harish Karnick.
Department of Mathematics, Shiv Nadar University
UG Research Assistant, Human-Computer Interaction
Nov 2021 - May 2022
Greater Noida, India
  • Designed an assistive framework for the automation of remote contextual inquiries and subjective respondent data analysis using conversational agents and NLP-driven coding methodologies.
  • Designed a custom NER model using language CNNs, performed sentiment polarity extraction and subjective theme extraction from natural language textual responses.

Publications

Achievements

Clubs & Community Building

  • Chair, ACM-W Shiv Nadar University Chapter 2020-2021
    • Launched and led the ‘Women in Tech’ talk series to promote mentorship and skill development for female students.
    • Started and led a special interest group for Machine Learning under ACM, SNU Chapter to promote peer learning and targeted skill development among students.
    • Organized HackData 5.0, Shiv Nadar University’s annual 48-hour inter-college hackathon.
  • Head of App Development, Google Developer Student Clubs Shiv Nadar University 2020-21
    • Led a team of 30 student volunteers to create impactful mobile apps for the student community and university stakeholders.
  • App Development Lead, Surge - Shiv Nadar University Sports Fest 2021

  • Summer Volunteer, ERIDE (Empowering Rural India with Digital Education) at IIM Bangalore NSRCEL 2018

  • Member, Women Who Code Bangalore

  • Member, Zomato’s Feeding India Shiv Nadar University Chapter

  • Sponsorship Team, TEDx Shiv Nadar University 2019

  • Shiv Nadar University Acapella Club - Synergy 2019