SV

Srikar Veluvali

IIIrd Year Student At Keshav Memorial Institute of Technology

About Me

Hey! I'm Srikar Veluvali, currently pursuing a Bachelor of Technology in Information Technology at Keshav Memorial Institute of Technology with a CGPA of 9.67.

I'm currently a Research Intern at Microsoft in Bengaluru. Previously, I was an HPC Software Engineering Intern at DRDL (DRDO).

I've built projects like Astor AI, a medical chatbot fine-tuned on Llama 3 with Retrieval-Augmented Generation achieving 400+ downloads, and a Heart Health Web App using the MERN stack and machine learning with Google's Gemini AI for personalized plans, nominated for the People's Choice Award in the Google Gemini API Developer Competition.

When I'm not coding or tinkering with tech, I enjoy solving problems on Leetcode (>800 problems, rating 1713) or exploring the latest in generative AI.

Academic Performance

B.Tech - Information Techology

Year 1

Semester 1

9.84

Semester 2

9.79

Year CGPA

9.81

Year 2

Semester 1

9.62

Semester 2

9.86

Year CGPA

9.74

Year 3

Semester 1

9.26

Semester 2

9.37

Year CGPA

9.31

Year 4

Semester 1

N/A

Semester 2

N/A

Year CGPA

N/A

Overall CGPA

9.62

Skills & Expertise

Work Experience

Research Intern

Microsoft | Jan 2025 - Present

  • Researched optimization techniques to enhance the efficiency and scalability of kernels for large language models, minimizing latency and maximizing throughput
  • Parsed, restructured, and improved the infrastructure of compiler intermediate code using graph transformations to boost performance
  • Conducted performance profiling and benchmarking to validate improvements and ensure stability
  • Collaborated with research and engineering teams to integrate optimized GPU kernel solutions into production workflows

CUDA Programming Intern

Defence Research Development Laboratory (DRDO) | Jun 2024 - Dec 2024

  • Contributed to the development and optimization of CUDA programs for Computational Fluid Dynamics (CFD) simulations
  • Achieved a 25% reduction in processing time and enhanced computational efficiency on an NVIDIA RTX 4080 GPU
  • Collaborated with a multidisciplinary team of engineers to implement high-performance computing solutions
  • Improved overall simulation speed and accuracy through optimization of CUDA code
  • Worked on high-performance computing and parallel processing techniques to improve simulation efficiency

Featured Projects

Searchly: AI-Powered Product Recommendation System

Developed Searchly, an AI-driven e-commerce assistant built on a robust microservices architecture with an agentic workflow that orchestrates data scraping, embedding, retrieval, and UI interactions. Features include "Aivy" natural-language chat, personalized recommendations via HuggingFace embeddings and Pinecone vector search, real-time product data scraping, JWT-based authentication, and a responsive React/Tailwind UI.

Flask
Python
MongoDB
Pinecone
HuggingFace Embeddings
Groq API
BeautifulSoup
React
Tailwind CSS

Astor AI: A Chatbot for Medical Queries

Built a medical chatbot using the Llama 3 model with Retrieval Augmented Generation (RAG) for more accurate responses and faster query handling, optimized for local system use.

React.js
Flask
LLMs
Generative AI

Heart Health Web Application

Developed a MERN stack web app with Machine Learning for heart disease prediction, offering personalized diet and exercise plans and integrating Google's Gemini AI for diet suggestions.

MERN
Machine Learning
Google Gemini AI
Flask

OCR Entity Extraction with BERT

Developed an Optical Character Recognition (OCR) system using PaddleOCR combined with a fine-tuned BERT model to extract and classify entities from text in images.

Optical Character Recognition
BERT
Python
Machine Learning

Battle Engine (Remaster)

Developed a Pokemon-style RPG game using Java and Object-Oriented Programming concepts. Players engage in turn-based battles and progress through various levels with different strategies.

Java
OOPS

Dataset - Extraction, Analysis, and Visualization

A Python-based data analysis project that explores the Video Game Sales Analysis dataset to answer 15 key questions related to video games. Presented at PRAKALP 2023.

Python
Data Analysis
Visualization
Tableau

LUNA and LEVI

AI-powered navigation assistants. LUNA is voice-based, while LEVI is text-based, designed to assist users with website navigation, answering queries, playing music, and more.

AI
Python

Battle Engine

A text-based command-line game in C, simulating battles between the player and various bots. The game features strategic combat and a secret battle with a special challenge.

C
Command-line

Education & Certifications

Bachelor of Technology in Information Technology

Keshav Memorial Institute of Technology, Expected 2026

Class 12

Sri Chaitanya Junior College, 2022

Class 10

Dilsukhnagar Public School, 2020

Get in Touch