Active Research
Khasi TTS
Speech synthesis in active iteration — including studio-grade recording collaboration with Radio Punjab for smooth takes, clean frequency response, and natural voice quality in training data.
Flagship Platform by Medharvix Systems Private Limited
Bhasaflow builds AI access for underserved Indian languages, starting with Khasi. We turn language research into real systems for translation, speech, OCR, and open resources.
Khasi-first. Research-driven. Built for real public use.
About Bhasaflow
Bhasaflow is a research-led language AI platform focused on practical impact for communities that are typically left behind by mainstream models. It bridges languages, people, and AI systems through deployable translation models, speech technology research, OCR pipelines, and open language resources.
The long-term vision is clear: AI useful for every Indian, without language barriers.
Flagship Model
Flagship Khasi machine translation model, fine-tuned on NLLB.
48
BLEU Score on Khasi MT
NLLB
Fine-tuning Foundation
V4
Current Flagship Generation
Research and Traction
Datasets and Resources
Curated sample set for rapid experimentation and evaluation of Khasi-English MT.
Open Dataset ↗Parallel corpus for model training, alignment tasks, and translation benchmarking.
Open Corpus ↗Monolingual Khasi corpus for linguistic modeling and downstream language tasks.
Open Corpus ↗Live Product Experience
Experience Bhasaflow translation in production through the official Hugging Face Space.
Open Bhasaflow TranslatorSpeech and OCR Roadmap
Active Research
Speech synthesis in active iteration — including studio-grade recording collaboration with Radio Punjab for smooth takes, clean frequency response, and natural voice quality in training data.
Launching 7 June
Automatic speech recognition stack for Khasi is nearing release.
Launching 25 June
Optical character recognition pipeline under focused development.
Recognition
Bhasaflow is under second-phase evaluation as Team Techno Tuners, validating real-world relevance of our Khasi-focused language systems.
Ecosystem
Team Techno Tuners, Medharvix Systems, academic partners, and regional collaborators together shape this language AI ecosystem.
Student collaborators contributing as individuals — not a formal university partnership or endorsement.
Institutional and Media Collaborators
With Radio Punjab, we collaborate on text-to-speech data and recording workflows: controlled studio conditions, smooth session flow, and capture that preserves high-frequency detail and warm voice character — so models learn from audio that sounds good in the real world, not just on paper.
Build Language Equity With Us
For innovators and contributors, use the dedicated contact page — forms save to Firebase once configured.