hi , i am
Saumya
Gupta .

AI Research Engineer/Data Scientist

download resume

about me

I am Saumya Gupta, an AI researcher with a passion for harnessing cutting-edge technology to solve complex problems. With a Master's degree in Artificial Intelligence from Northeastern University and extensive experience in AI research and development, I specialize in domains such as Natural Language Processing and generative AI.

As an AI Researcher at the Institute of Experiential AI, I develop Transformer models to predict alternative splicing in genetics, advancing bioinformatics. At Razorpay and Rebel Foods, I built backend APIs and applied AI to deploy generative systems and optimize training.
My AI journey began with a sign language detection system to empower specially-abled individuals, igniting my passion for transformative tech. Since then, I’ve built multilingual RAG systems, mastered LLMs, and delved into geometric deep learning and diffusion models. I thrive on solving challenges, mentoring, and shaping AI’s future through impactful innovation.
A tech enthusiast and lifelong learner, I’m always exploring new horizons and ready for the next adventure!

email

gupta.saumy@
northeastern.edu

education

2023 - 2025

master of Artificial Intelligence

northeastern university

Boston, massachusetts, USA

GPA: 4.0
Subjects Taken: Algorithms, Natural Language Processing , Foundations of AI , AI for Human Computer Interaction , Machine Learning, Programming Design Paradigms, AI for Human Computer Interaction

2017 - 2021

bachelor of computer science

vellore Institute of technology

Vellore, Tamil Nadu , India

GPA: 3.8
Subjects Taken: Algorithms, Artificial Intelligence, Natural Language Processing, Machine Learning, Computer Vision, Statistics, Calculus, Data Visualization, Databases, OOPs, Graph theory, Linear Algebra, Operating Systems, Distributed Systems

skills

Python Logo
Python
Python Logo
Python
Pytorch Logo
Pytorch
Pytorch Logo
Pytorch
TensorFlow Logo
Tensorflow
TensorFlow Logo
Tensorflow
Golang Logo
Golang
Golang Logo
Golang
Java Logo
Java
Java Logo
Java
Node.js Logo
NodeJS
Node.js Logo
NodeJS
JavaScript Logo
JavaScript
JavaScript Logo
JavaScript
C++ Logo
C++
C++ Logo
C++
Linux Shell Script Logo
Shell Script
Linux Shell Script Logo
Linux Shell Script
PostgreSQL Logo
PostgreSQL
PostgreSQL Logo
PostgreSQL
Neo4j Logo
Neo4j
Neo4j Logo
Neo4j
VectorDB Logo
VectorDB
VectorDB Logo
VectorDB
MongoDB Logo
MongoDB
MongoDB Logo
MongoDB
Docker Logo
Docker
Docker Logo
Docker
Kubernetes Logo
Kubernetes
Kubernetes Logo
Kubernetes
Apache Spark Logo
Apache Spark
Apache Spark Logo
Apache Spark
Apache Airflow Logo
Apache Airflow
Apache Airflow Logo
Apache Airflow
Hadoop Logo
Hadoop
Hadoop Logo
Hadoop
Azure Logo
Microsoft Azure
Azure Logo
Microsoft Azure
AWS Logo
AWS
AWS Logo
AWS
Machine Learning Logo
Machine Learning
Machine Learning Logo
Machine Learning
Deep Learning Logo
Deep Learning
Deep Learning Logo
Deep Learning
NLP Logo
Natural Language Processing
NLP Logo
Natural Language Processing
Computer Vision Logo
Computer Vision
Computer Vision Logo
Computer Vision
Data Analysis Logo
Large Language Models
Data Analysis Logo
Large Language Models
RAG Logo
Retrieval-Augmented Generation
Data Analysis Logo
Retrieval-Augmented Generation
dsa Logo
Data Structures and Algorithms
dsa Logo
Data Structures and Algorithms
Data Analysis Logo
Data Visualization
Data Analysis Logo
Data Visualization
Statistics Logo
Statistics
Statistics Logo
Statistics
Calculus Logo
Calculus
Calculus Logo
Calculus
OS Logo
Operating Systems
OS Logo
Operating Systems
DBMS Logo
Database Mangement Systems
DBMS Logo
Database Mangement Systems
distributed system Logo
Distributed Systems
istributed system Logo
Distributed Systems
devops Logo
Devops
devops Logo
Devops
mlops Logo
MLOps
mlops Logo
MLOps
web dev Logo
Full Stack Software Engineering
web dev Logo
Full Stack Software Engineering

experience

  • July 2024 - Present

    Research Associate , AI Research Coop

    Institute of Experiential AI at Northeastern University

    Boston, USA

    I am developing Large Language Models to predict alternative splicing, leveraging multi-GPU training to handle imbalanced genetic datasets and improve the understanding of splicing events in pre-mRNA sequences. Additionally, I am applying geometric deep learning principles to refine Latent Diffusion Models, focusing on symmetric latent spaces for enhanced uncertainty quantification and advancing the reliability of AI models.

  • January - April 2024

    Khoury Graduate Teaching Assistant

    Khoury College of Computer Sciences, Northeastern University

    Boston, USA

    I guide students in Foundations of AI through interactive tutorials, clarifying concepts, and fostering their learning journey. By assisting professors with course development, grading, and creating a collaborative environment, I promote teamwork and inclusivity to ensure students excel in AI fundamentals.

    January - April 2024

    Graduate Teaching Assistant

  • April 2022 - August 2023

    Software Development Engineer

    Razorpay

    Bangalore, India

    I led Golang development for the Optimizer - Payments Team, delivering customer-focused solutions to streamline payment operations. I enhanced payment gateway configurations through rule-based prioritization, containerized microservices with Docker, and deployed them on Kubernetes using Helm. I also implemented robust integration tests, ensuring platform reliability. Notably, I introduced a one-click Paytm wallet feature, boosting user engagement, and mentored an intern to develop a charge-back prediction model using an ANN, achieving impactful results.

  • July 2021 - April 2022

    Software Development Engineer

    Rebel Foods (Formely Faasos)

    Bangalore, India

    I led back-end development for the In-order team, managing critical microservices for kitchen staff, food orders, and inventory. I designed scalable schemas for storing order and staff data and implemented a key feature enabling validated order cancellations, reducing revenue loss and ensuring system integrity.

    July 2021 - April 2022

    Software Development Engineer

  • May 2020 - July 2020

    AI Research Intern

    Indian Institute Of Information Technology

    Prayagraj, India

    I conducted a comparative study on GANs for text-to-image synthesis using the Caltech bird dataset, devising a novel evaluation method with t-SNE to analyze overlaps between original and generated images. This research provided nuanced insights into GAN performance and advanced methodologies in AI-driven image synthesis.

  • April 2020 - May 2020

    Computer Vision Intern

    Ocean Energy

    Mumbai, India

    I developed a system for detecting traffic rule violations from video inputs, identifying vehicles breaching traffic signals, extracting license plate data, and converting it to text. The system integrated with a database for organized information storage, enhancing traffic monitoring and enforcement.

    April 2020 - May 2020

    Computer Vision Intern

projects

Project 1 Image

Shroom: Hallucination Detection in LLMs

This project tackles the critical challenge of detecting hallucinated outputs in LLMs, which are factually inaccurate despite being grammatically correct. Focusing on tasks like definition modeling, paraphrasing, and machine translation, I used clustering with task-specific metrics and trained a Siamese network with BERT embeddings to evaluate LLM performance. Additionally, I experimented with prompting techniques using the LLaMA model. This comprehensive approach improves the reliability and trustworthiness of AI-generated text across domains.

View Project
Project 2 Image

Ayurveda RAG: Knowledge Graph RAG

The Ayurveda RAG project combines the ancient wisdom of Ayurveda with the power of modern AI to create an intelligent retrieval-augmented generation system. This system is designed to answer complex Ayurveda-related questions, even those containing Hindi or Sanskrit terms, with accuracy and depth. By leveraging a Neo4j-powered knowledge graph and a FAISS index for efficient retrieval, the project ensures precise, context-aware responses. This innovation bridges traditional healthcare knowledge with cutting-edge AI, making Ayurvedic wisdom more accessible and actionable for a global audience.

View Project
Project 2 Image

Semantic Segmentation Of Images : Autonomous Driving Vehicles

Performed semantic segmentation on the Indian driving dataset using deep convolutional networks - FCN8, UNET, LINKNET, PSPNET, and DEEPLABV3+. Conducted a thorough performance analysis with mean Intersection over Union (IOU) score and mean F1 score as metrics on validation dataset, showcasing superior results of IOU of 0.77 and Mean F1 as 0.78 for DEEPLABV3+. Remarkable outcomes were also observed for DeeplabV3+ and Linknet architectures.

View Project
Project 2 Image

Picto Phrases: Image Caption Generator

I utilized InceptionNetV3 for generating image features in combination with Bidirectional LSTM for image captioning on the Flickr8k dataset. Employing beam search with factors 3 and 5, I further explored predicted captions to enhance the results. The generated captions successfully captured the primary context of the images, achieving a CORPUS – BLEU score of 0.435339.

View Project
Project 2 Image

Sarcasm Detection on News Headlines

I conducted a performance comparison among Decision Tree, SVM, Random Forest, and a basic LSTM model for sarcasm detection using the Kaggle News Headlines Dataset. The accuracy results revealed that the LSTM model achieved 90%, Random Forest achieved 89%, and SVM achieved 86%. This exploration provided insights into the working mechanisms of these diverse models and their efficiency in comprehending the relationships between words in a sequence to identify sarcasm.

View Project
Project 2 Image

Online Job Search Platform

Developed a website featuring registration for Candidates and Recruiters. Candidates can upload resumes, parsed for skills, education, and experiences, stored in the database. Recruiters can log in, post jobs, and view applicants sorted by matching skills. Candidates can filter jobs and view applied jobs and registered companies. Technologies used: HTML, CSS, Bootstrap, JavaScript, jQuery, MongoDB, Node.js. Adhered to solid principles.

View Project

contact me

gupta.saumy@northeastern.edu