About Navana.ai

At Navana, we are at the forefront of developing Voice AI solutions tailored for Indic languages, driving growth and innovation for large enterprises.

We build real-time voice AI systems that power critical customer interactions for leading enterprises. Our stack is built for low-latency, high-throughput workloads and is deployed both in our cloud and on customer infrastructure. As we scale, we are investing in the platform that our Speech and Language models (STT, TTS, and SLMs) train on, deploy through, and run on in production.


Core Responsibilities

  1. Build the MLOps stack from the ground up - experiment tracking, model registry, pipeline orchestration, artifact management, and data/model versioning.
  2. Evaluate and select tooling across categories (e.g., MLflow / W&B, Kubeflow / Argo Workflows / Airflow, DVC / LakeFS) and own the implementation.
  3. Design training pipelines that run reliably across hyperscalers (AWS / GCP / Azure) and neo-cloud GPU providers.
  4. Productionize and optimize STT, TTS, and SLM inference in customer on-prem environments, within tight latency budgets and on constrained hardware.
  5. Own GPU infrastructure end to end - drivers, CUDA, MIG partitioning, and mixed-GPU scheduling.
  6. Establish the MLOps practices that the rest of the company will build on.

Must-Have Qualifications

Nice-to-Have