Job Details

Machine Learning
Remote
Full time
May 2

Software Engineer ML (Production / Speech & Audio)

$5,000

An international product IT project is looking for a full-time Software Engineer ML (Production / Speech & Audio). Salary from $3500 to $5000. Hybrid format with subsequent remote work.

We are a product team creating an intelligent cloud telephony ecosystem for the US and Canadian markets. Our product is a fault-tolerant platform with millions of traffic turnovers. ML is not an auxiliary feature for us, but the foundation of the system, operating in real-time. We are looking for an engineer who thoroughly understands the internal architecture of audio models and is ready to be responsible for their operation in a high-load production environment. Responsibilities: - Development of the AMD (Answering Machine Detection) system: retraining and tuning models for real-time call classification (distinguishing humans from answering machines/IVR). - Full-cycle development: from collecting and "dirty" labeling of audio data to deployment and threshold calibration in production. - Integration into the Core product: transferring ML components into the backend infrastructure (C# / SIP / RTP stack) via ONNX Runtime. - Latency optimization: fighting for milliseconds in audio streaming conditions. - Deep Analysis: identifying errors and analyzing complex edge cases in real call scenarios. - Research (R&D): experiments with noise reduction, VAD, and new speech processing architectures.

We expect: - 2+ years of experience in production ML (when your model actually worked with users). - Practical experience with Speech/Audio: understanding how audio features and modern sound processing architectures work. - Engineering approach (QA-mindset): you are genuinely interested in "digging" into data anomalies and stress-testing the system. - Understanding of classics and modern approaches: Fine-tuning, Transfer Learning, and the ability to work with metrics (Precision/Recall, ROC-AUC, Calibration). - Ability to work end-to-end: from raw files to optimized inference. What's important: - Engineering autonomy: we value those who find problems themselves and bring solutions to production. - Background: we highly welcome candidates who came to ML from Backend or QA; we value code and testing culture. - Readiness for dynamics: the project is growing, there are many tasks, and they directly impact the business. Will be a plus: - Experience in Speech/Audio domain (ASR, VAD, Audio Classification). - Understanding of VoIP specifics and stream data processing. - Experience with MLOps and model monitoring tools.

Conditions: - Mandatory offline onboarding in Tashkent (2-3 months) for product immersion, followed by full remote work. - Real production tasks in an international, high-load product. - Opportunity for professional growth and compensation review as tasks become more complex. - Work in a team with strong engineering expertise and no bureaucracy.

MFCC
Quantization
ONNX
C#
embeddings
Production
Python
wav2vec 2.0
HuggingFace Transformers
Audio
whisper
rtp
sip
spectrograms
Speech
ml
ONNX Runtime

Don't miss a single job

Subscribe to our Telegram channel

Subscribe

Similar jobs

Software Engineer ML (Production / Speech & Audio)

$5,000

Software Engineer ML (Production / Speech & Audio) position with a starting salary from $3500 to $5000, discussed individually.

N
NDA

Software Engineer ML (Production / Speech & Audio)

$5,000

International IT product company seeks a Software Engineer ML with 2+ years of experience in production ML, focusing on Speech & Audio. Full-time, hybrid role starting in Tashkent, then fully remote. Salary $3500-$5000.

I
International product IT project (VoIP / Cloud Telephony)

ML Engineer (Production, ONNX, Remote)

$5,000

International product IT project is looking for an ML engineer to work on the AMD system. Starting from $3500 to $5000. Hybrid format with subsequent full remote work.

М
Международный продуктовый IT-проект