We are seeking an experienced AI Engineer to build a machine learning transcription server that leverages the Facebook Seamless model. This project involves setting up a server that will be hosted by us and integrated with our Next.js application to provide real-time transcription services with speaker identification. The goal is to stream audio in chunks to the server, where it will be processed to deliver instant transcription. Additionally, at the end of each session, the system should generate an AI-powered summary of the conversation.

Key Project Deliverables:
Setup and Deployment of Transcription Server: Configure a server that uses the Facebook Seamless model for speech recognition and transcription.
Integration with Next.js Application: Ensure the server can stream audio from our Next.js app and return real-time transcriptions.
Speaker Identification Feature: Implement functionality to accurately identify and differentiate speakers in the transcription.
Real-Time Transcription: Develop the system to process streaming audio in real-time, providing immediate text output.
Session Summary Generation: Utilize AI to create concise summaries of transcribed sessions, highlighting key points and topics.
Skills and Experience Required:
Expertise in AI and Machine Learning: Profound knowledge of AI models, especially in speech recognition and NLP.
Experience with Facebook Seamless Model: Familiarity with implementing and optimizing the Facebook Seamless model for transcription purposes.
Proficiency in Server Setup and Management: Ability to set up a robust server that can handle streaming data and integrate seamlessly with web applications.
Next.js and Web Streaming Technologies: Strong background in Next.js development and experience with audio streaming technologies.
Data Processing and Analysis: Skills in processing and analyzing audio data to produce accurate transcriptions and meaningful summaries.
Communication and Documentation: Excellent communication skills to collaborate with our team and provide documentation on the setup and usage of the server.

Additional Information:
The ideal candidate should have a portfolio demonstrating experience with similar projects, specifically in real-time audio processing and AI-driven applications.
Please provide a brief proposal outlining your approach to the project, any tools or technologies you plan to use, and examples of past work in transcription or speech recognition.

If you are interested in this project and possess the necessary skills, please submit your application with a detailed proposal and portfolio. We are looking forward to working with an innovative AI engineer who can help bring this project to fruition.

Budget: $2,000

Posted On: February 17, 2024 08:21 UTC
Category: Full Stack Development
Skills:TypeScript, API Development, Python, Machine Learning, Node.js, Artificial Intelligence

Country: United States

click to apply

Powered by WPeMatico