
Soham Waghmare
GenerativeAI • Deep Learning • Cloud
building wifui
ML / GenAI


Py • Ts • Js • CSS
GramLearn
Master English effortlessly with GramLearn: Your conversational grammar coach, now with offline RAG-based LLM pipeline.

Py • Js • CSS
Sanjeevani
Medicinal Leaf Recognition System. A ground up VGG16-based model for identifying medicinal leaves.

Py
Face Authentication with Accessory Detection
A secure face authentication system with accessory detection. Implements Mediapipe, FaceNet, Dlib, Roboflow Models, and YOLO.

Py
Flight Booking Chatbot
An intelligent CLI chatbot that facilitates flight searches and bookings, leveraging advanced function-calling techniques and LLMs, integrating RapidAPI.
Tech Stack
AI
LanggraphLangchainTransformersOllamaWhisperOpenCVPyTorchCNN
Backend
FlaskFastAPISocketIOWebRTC
Databases
FAISSMongoDBPostgreSQLMySQLSQLite3
Cloud
AWSAzureGCPVercelRender
Tools & DevOps
NeoVimGitDockerLinuxBashTerraformAnsibleJenkins
Development

Rust
WifUI
A blazing fast, lightweight Terminal User Interface (TUI) for managing Wi-Fi connections on Windows. Keyboard-centric and efficient.

Ts • Js • CSS
GeminiMeetings
Seamless collaboration for devs and productive teams. Using WebRTC and Socket.IO for real-time communication.

Ts • Js • CSS
GeminiWear
Experience the epitome of fashion with an exclusive E-Commerce store for a premium clothing brand.

Ts • Js • CSS
Formbuddy
One stop solution to all your form filling needs. Stores your data in your own Google Drive.
Others
A simple and fast app that delivers the latest news from various sources in one place.
Experience
- Style Transfer & SLM Fine-tuning: Fine-tuned llama3-8b for semantic text translation to Dutch.
- Interactive Flight Booking Chatbot: Developed an intelligent chatbot that facilitates flight searches and bookings, leveraging advanced function- calling techniques and LLMs, handling real-time queries with JSON-based workflows.
- Large-Scale Translation (Project Indus): Spearheaded the translation of English data into Hindi and 3 dialects (Bhojpuri, Maithili, Dogri) for TechM’s SFT project. Achieved translation speeds of up to 30,000 rpm using parallel processing.
- AI Voice Coach Development: Created an AI-based voice comparison algorithm using metrics like pitch, volume, and fluency. Integrated advanced speech analysis technologies like Whisper, spaCy, and librosa, providing real-time language assessment.
- Spearheaded collaboration efforts at a startup, orchestrating a team to conceptualize and execute the design and development of the company’s website with chatbot utilizing ReactJS.
- Integrated the chatbot with 48 backend APIs and executed Google Authentication to fortify the Login/Signup processes for enhanced security.
- Oversaw the hosting of the website on AWS EC2 with SSL encryption, CI/CD, taking charge of managing AWS cloud servers, encompassing backup strategies and seamless migration processes.
- Managed domain and DNS configuration via Cloudflare to optimize website performance and bolster security measures.
Achievements

Copyright - Print Medium Summarization Framework With Advanced Language Parsing
Diary: 13323/2024-CO/SW

