Mani Smaran Nair

Machine Learning Engineer

LinkedIn | GitHub

About

Highly skilled Machine Learning Engineer and MSc Data & Computer Science candidate with expertise in designing and deploying advanced Large Language Model (LLM) and Natural Language Processing (NLP) solutions. Proven ability to leverage Transformers, RAG, and production-ready ML pipelines (PyTorch, Hugging Face) to deliver scalable, high-impact AI applications. Adept at bridging research and deployment, driving innovation in applied NLP, conversational AI, and speech technology.

Work Experience

Research Assistant

DKFZ German Cancer Centre

Jan 2025 - Jul 2025

Heidelberg, Baden-Württemberg, DE

Conducted advanced research in deep neural networks and variational inference to improve model generalization for regulatory region tasks at a leading cancer research center.

  • Integrated a Foundational model using Borzoi-generated embeddings to train a deep neural network, significantly improving model generalization across diverse regulatory region tasks.
  • Developed a modular guide class (AmortizedNormal) to seamlessly combine LLM embeddings with variational inference in NumPyro, streamlining complex model architectures.
  • Performed scalable Stochastic Variational Inference (SVI) on over 20,000 DNA sequences, boosting ELBO convergence speed by ~30%.

Machine Learning Engineer – GenAI Systems

Freudenberg Innovation

Aug 2024 - Jul 2025

Weinheim, Baden-Württemberg, DE

Led the development and deployment of GenAI systems, focusing on real-time RAG pipelines and LLM-based solutions for knowledge extraction.

  • Built and deployed a real-time Retrieval-Augmented Generation (RAG) pipeline with FastAPI, Sublime, and LangChain, retrieving enterprise knowledge from over 10,000 records.
  • Collaborated cross-functionally on internal GenAI initiatives, including LLM-based solutions, to enhance knowledge extraction capabilities.
  • Conducted user interviews, documented workflows, and facilitated workshops with cross-functional teams to gather requirements and ensure alignment for AI applications.
  • Developed comprehensive test cases and automated evaluation pipelines for pilot AI applications, ensuring robust performance and reliability.
  • Synthesized complex technical and business feedback into structured reports to guide strategic deployment and iterative improvements.

Research Assistant

Heidelberg University

Jul 2024 - Feb 2025

Heidelberg, Baden-Württemberg, DE

Analyzed patient-question interactions and enhanced predictive accuracy through advanced probabilistic modeling and data visualization techniques.

  • Analyzed 15,000 patient-question interactions using advanced probabilistic response models to derive critical insights.
  • Improved predictive accuracy by 18% compared to baseline cognitive models, demonstrating enhanced model performance.
  • Utilized Seaborn and Matplotlib visualizations to conduct comprehensive model performance comparisons and present findings effectively.

Research Assistant

Central Institute of Mental Health

Aug 2023 - Dec 2023

Mannheim, Baden-Württemberg, DE

Managed and automated fMRI data preprocessing pipelines for a large patient cohort, significantly enhancing efficiency and data consistency.

  • Processed fMRI data from over 15,000 patients leveraging BIDS pipelines, ICA, and BET for precise brain extraction.
  • Automated critical data preprocessing pipelines, reducing data preparation time by 50% and ensuring high consistency for research studies.

Education

Data and Computer Science

Heidelberg University

Sep 2023

Heidelberg, Baden-Württemberg, DE

Courses

  • Natural Language Processing
  • Machine Learning
  • Deep Learning
  • Artificial Intelligence
  • Probabilistic programming languages
  • Software Engineering
  • Computer Vision
  • LLM

Computer Science & Engineering

Presidency University

Sep 2019 - Jun 2023

Bangalore, Karnataka, IN

Courses

  • Machine learning
  • DBMS
  • SQL

Certificates

Getting started AWS machine learning

Coursera

Dec 2021

Sentiment Analysis using NLTK

Coursera

Dec 2021

XOBIN Bootcamp

XOBIN

Dec 2019

Projects

Fullstack RAG Chatbot using Langchain

Jan 2023 - Jun 2024

Developed a fullstack Retrieval-Augmented Generation (RAG) chatbot leveraging Langchain for intelligent document querying.

Food Captioning Using Diffusion-Augmented COCO Dataset

Jan 2023 - Jun 2024

Implemented a food captioning model using a Diffusion-Augmented COCO Dataset to enhance image understanding and descriptive capabilities.

Skills

Programming & Automation

  • Python
  • R
  • Shell scripting
  • REST APIs
  • Workflow orchestration

AI/ML Frameworks

  • PyTorch
  • TensorFlow
  • NumPyro
  • LangChain
  • LangGraph
  • JAX
  • Deep Learning

AI Systems

  • RAG pipelines
  • LLM fine-tuning
  • Agent-based automation
  • Predictive modeling

Data Handling

  • Pandas
  • NumPy
  • SQL
  • NoSQL
  • Data preprocessing & visualization
  • Matplotlib
  • Seaborn

Development Practices

  • Agile/Scrum
  • CI/CD
  • Git
  • Containerized environments (Docker)

Mentoring & Collaboration

  • Cross-functional training
  • AI adoption advocacy
  • Technical workshops

Interests

Hobbies

  • Reading books
  • Football
  • Trekking