Pratham Chopra

AI/ML Engineer & Data Scientist

Master's student at Northeastern University specializing in Applied Machine Intelligence. Top 250 globally in RSNA Intracranial Aneurysm Detection Kaggle Competition. Passionate about leveraging AI to solve complex real-world problems in healthcare and beyond.

Connect With Me

About Me

Enthusiastic Computer Science graduate specializing in AI and ML. Currently pursuing my Master's in Applied Machine Intelligence at Northeastern University (Expected 2027), building upon my B.Tech in Computer Science with AI&ML specialization from Jain University (CGPA: 8.76/10). Experienced in machine learning models, data analysis, and algorithm optimization. Skilled in problem solving and proficient in Python. Excited to apply my expertise to innovative ventures in healthcare AI, computer vision, and NLP.

Featured Projects

🏆 Kaggle Competition

RSNA Intracranial Aneurysm Detection

Developed production-ready 3D medical imaging pipeline for aneurysm detection. Processed 4,000+ DICOM brain scans with optimized preprocessing. Built EfficientNet3D ensemble with SE attention and focal loss, reducing runtime from 25 min to under 30 seconds per series.

PyTorch MONAI 3D CNNs Medical Imaging
Top 300 Global
50x Faster
🏥 Full-Stack Application

Healthcare Queue Management System

Built comprehensive healthcare queue management system using FastAPI and PostgreSQL. Implemented real-time patient tracking, appointment scheduling, and automated queue optimization algorithms to reduce wait times.

FastAPI PostgreSQL Python REST API
Real-time Tracking
Automated Queue
📚 RAG Application

Research Paper Analysis Chatbot

Developed full-stack AI app with Streamlit using RAG architecture. Integrated LangChain and FAISS for semantic search across 50+ arXiv papers per query with 99% accuracy. Reduced research reading time by 5+ hours per session.

LangChain FAISS Streamlit RAG
75% Efficiency
99% Accuracy
🎙️ Deep Learning

Speech Emotion Recognition

Engineered LSTM model achieving 85.34% testing accuracy and 96.18% training accuracy on emotion classification. Integrated noise addition and pitch stretching for data augmentation. Used TESS, RAVDESS, CREMA, and SAVEE datasets.

LSTM TensorFlow Audio Processing Deep Learning
85.34% Accuracy
4 Datasets
💬 NLP

Multi-Modal Document Intelligence System

Built full-stack AI application integrating document analysis and database querying. Implemented NLP capabilities for automatic SQL query generation from natural language, improving database query efficiency by 40%.

NLP SQL Python LLMs
40% Faster
30% More Access
📊 Data Analysis

College Scorecard Data Visualization

Comprehensive analysis of Department of Education's College Scorecard data. Created interactive visualizations comparing graduation rates, earnings, and debt across institutions. Built actionable insights for policy recommendations.

Pandas Matplotlib Seaborn Data Viz
Interactive Dashboard
🏙️ Urban Analytics

NYC 911 Service Request Analysis

Analyzed NYC 911 service request data with focus on noise complaints. Identified temporal and spatial patterns, developed predictive models for resource allocation, and provided actionable policy recommendations for city planning.

Python Pandas Geospatial Analysis ML
Policy Impact
⚕️ Medical AI

ECG Signal Digitization System

Developed automated system for digitizing ECG signals from paper-based records. Implemented computer vision algorithms for signal extraction and deep learning models for arrhythmia classification.

OpenCV Signal Processing Deep Learning Medical AI
Automated Processing
🏢 Enterprise Systems

Walmart Big Data Architecture Analysis

Comprehensive evaluation of Walmart's enterprise information architecture and big data systems. Analyzed data warehouse design, real-time processing capabilities, and scalability strategies for retail analytics.

Big Data Architecture Data Warehouse Analytics
Enterprise Scale
🔒 Security

Real-Time Fraud Detection System

Built machine learning pipeline for real-time fraud detection using ensemble methods. Implemented behavioral pattern recognition with anomaly detection algorithms achieving high precision with minimal false positives.

Scikit-learn Anomaly Detection Ensemble Methods Real-time
High Precision

Research & Publications

AR in Fashion Industries

Authors: Dwaj Ranka, Pratham Chopra, Ranvir M Mehta

IEEE Xplore, 2022 - 4th IEEE International Conference

Developed a virtual trial room program using OpenCV and Augmented Reality for real-time cloth simulation. The application identifies background and subject using color palette analysis and thresholding techniques.

Read on IEEE Xplore →

Robotics and AI in Industry 4.0

Authors: Dwaj Ranka, Neell Ravindra Ambere, Pratham Chopra, Ranvir M Mehta

Research Paper

Examined the incorporation of RPA and AI technologies within Industry 4.0. Explored integration with Neural Networks, Text Mining, and NLP for data extraction, classification, and process optimization.

Read Paper →

LAI (LIFE LIKE AI): Voice Assistant with Emotional Response

Authors: Dwaj Ranka, Neell Ravindra Ambere, Pratham Chopra, Ranvir M Mehta

Research Paper

Revolutionary paradigm giving voice assistants emotional intelligence using ML and audio preprocessing. Captures user emotions and generates contextually relevant responses with sentiment analysis.

Read Paper →

Certifications

Microsoft AI-900

Microsoft

January 2024

Post Graduate Program in Data Science and AI

IIIT-B

2024

Generative AI with LLMs

Deep Learning AI (Coursera)

August 2023

Machine Learning with Python

Cognitive Class (IBM)

September 2023

Prompt Engineering

Futurense Technologies

February 2024

Data Analysis with Pandas and Python

Udemy

January 2023

Data Visualization and Storytelling

Futurense Technologies

January 2024

Data Warehousing and Business Intelligence

UC Irvine (Coursera)

December 2022

Introduction to AR and ARCore

Daydream (Coursera)

July 2023

Technical Skills

Programming Languages

Python SQL HTML CSS

Machine Learning Frameworks

PyTorch TensorFlow Scikit-learn Keras MONAI XGBoost

AI & NLP

LangChain LLMs HuggingFace Ollama RAG Architecture FAISS

Computer Vision

OpenCV CNNs 3D CNNs Medical Imaging Image Processing

Data Analysis & Visualization

Pandas NumPy Matplotlib Seaborn Power BI Plotly

Deep Learning

RNNs/LSTMs Transformers GANs Autoencoders Transfer Learning Fine-tuning

Backend & Databases

FastAPI PostgreSQL MySQL REST APIs SQLAlchemy

MLOps & Tools

Git Docker WSL2 Jupyter Streamlit MLflow

Specialized Skills

Agentic AI Model Context Protocol Ensemble Methods Medical AI Signal Processing Prompt Engineering

Research & Development

Kaggle Competitions Research Papers Prototyping A/B Testing Model Optimization

Experience & Education

2025 - 2026 (Expected)

Master in Applied Machine Intelligence

Northeastern University, Boston, MA

  • Advanced coursework in Data Visualization, Enterprise Information Architecture
  • Research focus: Healthcare AI, Computer Vision, and NLP applications
  • Projects: College Scorecard Analysis, NYC 311 Analytics, Healthcare Queue Systems
January - May 2024

Data Science Intern

Futurense Technologies, Bangalore

  • Cleaned and analyzed large datasets from Indian census, housing and healthcare sectors
  • Visualized data to support healthcare policy, highlighting regions lacking hospital beds
  • Analyzed Seattle Airbnb data to extract insights on pricing, amenities, and user reviews
  • Ensured data accuracy and integrity while working with large-scale datasets
2020 - 2024

B.Tech in Computer Science (AI&ML)

Jain (deemed to be) University, Bangalore

CGPA: 8.76/10

  • Specialized in Artificial Intelligence and Machine Learning
  • Published 1 research paper in IEEE
  • Reasearched on 3 different papers
  • Competed in Kaggle competitions, achieving Top 300 globally

Technical Blogs

Getting Started with Python for Data Analysis

A comprehensive guide to beginning your journey in data analysis with Python. Learn about essential libraries like Pandas, NumPy, and Matplotlib, and how to set up your environment for success.

Read More

Best Practices for Data Cleaning and Preprocessing

Essential techniques for ensuring your data is accurate, consistent, and ready for analysis. Learn how to handle missing values, outliers, and data transformation strategies.

Read More

Get In Touch

Let's Connect

Email

prathamchopra.me@gmail.com

Phone

+1 (781) 805-0647

Send a Message