Namratha Tiptur Manjunath

Data Engineer & ML Enthusiast

Building intelligent systems and transforming data into insights

Seeking Co-op/Internship | Summer 2026

Roles Available

Data Engineer ML Engineer Data Analyst Data Scientist Software Engineer

Duration

Summer 2026 (May - August)

Full-time • On-site/Remote

Recent Activities

KNARV Team at Graduate Case Competition Fall 2025

Graduate Case Competition Fall 2025

Syracuse University - Martin J. Whitman School of Management

Fall 2025

Excited to share that our team, KNARV, was selected to compete in the Graduate Case Competition Fall 2025 at Syracuse University - Martin J. Whitman School of Management.

We worked on a real business challenge for Spatchcock Funk, a lifestyle food brand. Our team analyzed their audience, digital presence, and growth opportunities, presenting strategic recommendations with data-backed insights and measurable KPIs.

This experience strengthened my interest in data-driven strategy and business innovation. Grateful for the opportunity to apply analytics to real-world challenges.

About

Namratha Tiptur Manjunath

I'm a data engineer and ML enthusiast, currently pursuing my Master's at Syracuse University. I work with healthcare data, build ETL pipelines, and design machine learning models. Before grad school, I spent a year as a data engineer at Carelon Global Solutions in Bangalore, processing terabytes of data daily.

I enjoy the problem-solving aspect of data work. Whether it's optimizing SQL queries, building pipelines, or training ML models, I like figuring out how to make things work better. I also work on side projects and participate in case competitions to keep learning.

I'm looking for opportunities to apply what I've learned in real-world settings, whether that's through internships, co-ops, or collaborative projects.

Education

Dec 2026

Master of Science, Information Systems

Syracuse University

Coursework: Machine Learning, Data Analytics, Database Management, Business Intelligence

Aug 2023

Bachelor's of Technology, Computer Science and Engineering

Reva University

GPA: 3.5/4.0

Coursework: Object Oriented Programming, Data Structures, Machine Learning for Data Analytics, Computer Networks, Augmented and Virtual Reality, Digital Logic Design, Python for Data Analysis, Cryptography and Network Security, Artificial Intelligence, Database Management System, Java, Embedded System Design, Mobile Application Development, Cloud Computing and Big Data, Probability and Statistics, Computer Organization and Architecture, Data Science

Experience

Aug 2023 - Aug 2024

Associate Software Engineer (Data Engineering & Analytics)

Carelon Global Solutions · Bangalore, India

  • Architected and optimized ETL pipelines using Informatica and Python (Pandas, NumPy) to integrate 20+ data sources, processing 1TB+ daily with 99.5% data accuracy
  • Reduced data processing time by 40% through implementation of parallel processing and SQL query optimization techniques in MySQL
  • Developed automated data quality monitoring using Python scripts, identifying and resolving anomalies that improved data quality by 30%
  • Built 15+ SQL queries and Python analytics scripts for generating automated reports, reducing manual reporting time by 25 hours/week
  • Collaborated with cross-functional analytics teams using Tableau dashboards to deliver actionable insights, supporting 10+ strategic business decisions
  • Applied data transformation and cleansing rules to standardize healthcare datasets, ensuring compliance with data governance standards
Sep 2022 - Nov 2022

Data Analyst Intern

Belgian Waffle Factory · Bangalore, India

  • Extracted and transformed 50K+ records using SQL (MySQL) and Tableau Prep Builder, creating a unified data model for business analytics
  • Developed 8 interactive Tableau dashboards tracking sales trends, inventory metrics, and operational KPIs, enabling data-driven decision-making for management
  • Automated weekly reporting processes using SQL and Python, reducing report generation time by 60%
  • Implemented data validation scripts in Python to ensure 99% data integrity across critical business metrics
  • Performed exploratory data analysis (EDA) in Excel and Python to identify revenue optimization opportunities worth ₹50K+ quarterly

Projects

US County Health Outcomes Dashboard

Interactive Tableau dashboard analyzing health outcomes and prevention measures across US counties with 29+ health metrics, maps, and dynamic parameters. Built comprehensive visualizations tracking health disparities, prevention measures, and public health indicators across 3,000+ US counties. Implemented dynamic filtering and parameter controls for interactive exploration of county-level health data, enabling data-driven insights for public health research and policy decisions.

ContentCrew – Multi-Agent Content Creation System

Multi-agent content creation system built with crewAI featuring collaborative AI agents for research, writing, editing, fact-checking, and content strategy. Designed and implemented a sophisticated multi-agent workflow where specialized AI agents collaborate to produce high-quality content. Each agent has distinct roles and capabilities, working together to research topics, draft content, perform quality edits, fact-check information, and develop content strategies. Built with Python, crewAI framework, and integrated with OpenAI APIs to enable seamless agent collaboration and content generation workflows.

Cogni Research – AI-Powered Research Assistant

AI-powered research assistant using LangGraph, Claude, and MCP that automatically researches topics, synthesizes information, and generates comprehensive reports with citations. Built an intelligent research pipeline that leverages LangGraph for workflow orchestration, Anthropic's Claude for advanced reasoning, and Model Context Protocol (MCP) for enhanced capabilities. The system performs automated research across multiple sources, synthesizes findings, and generates well-cited research reports. Implemented with FastAPI, Streamlit for the interface, ChromaDB for vector storage, and integrated Tavily for web search capabilities.

REL
GRD
ATT
COV
CON

RAG Observatory – LLM Evaluation & Monitoring System

Designed end-to-end Retrieval-Augmented Generation (RAG) observability framework with custom evaluation metrics to assess LLM reliability and detect hallucinations. Implemented 5 evaluation metrics: document relevance scoring, query coverage analysis, answer grounding rate, attribution accuracy, and response consistency. Built real-time monitoring dashboard using Streamlit to visualize retrieval quality, grounding trends, and query performance across 1,000+ test queries. Achieved 15% improvement in hallucination detection through attribution scoring innovation and semantic similarity analysis. Logged and aggregated query-level data using Python and JSON to identify systemic weaknesses and optimize retrieval strategies.

PCOS Subtype Discovery & Clustering Robustness Analysis

Reproduced and extended Nature Medicine (2025) research to identify 4 clinically distinct PCOS subtypes using unsupervised learning on multi-dimensional clinical data. Evaluated 5 clustering algorithms (K-means, Hierarchical, DBSCAN, GMM, Spectral) using silhouette scores, ARI, and bootstrap validation. Quantified model robustness via 100 bootstrap iterations and multi-seed analysis, achieving 82% stability and 0.73 ARI score. Implemented uncertainty-aware classification, flagging 27% ambiguous cases to improve clinical decision confidence. Validated cross-dataset generalization using external PCOS cohorts, demonstrating 78% subtype consistency.

Healthcare Analytics Portfolio

Designed 10+ advanced SQL analytics projects including care fragmentation analysis, diagnosis code drift detection, and health equity metrics. Built CDC PLACES data dashboard in Tableau visualizing depression prevalence, health risk behaviors, and social determinants across 3,000+ US counties. Created AI job landscape dashboard using BLS employment data, analyzing 50K+ job postings with treemaps, word clouds, and trend analysis.

Technical Skills

Programming

Python, Java, SQL, HTML, CSS, R Programming

ML & AI

Scikit-learn, NumPy, Pandas, TensorFlow, PyTorch, Keras, XGBoost

Data Analytics & BI Tools

Tableau, QuickSight, Power BI, Excel, R-Studio, Jupyter Notebook

Data Management & ETL

MySQL, MongoDB, Informatica, ETL pipeline development

Cloud & Tools

AWS (S3, EC2), GCP (BigQuery), Salesforce, Git, Anaconda

Soft Skills

Leadership, Problem-solving, Communication, Collaboration, Strategic Thinking

Certifications

Google Data Analytics Professional Certificate

Data Analytics & Business Intelligence with Advance EXCEL

IBM Data Science Professional Certificate (Coursera)

In progress

Get in Touch

I'm always open to discussing new opportunities and interesting projects.