Web Application for Automated Text-to-Speech and Audio Blending – Upwork

Project Overview:
We are developing a web application that generates speech audio from text using GPT and ElevenLabs, and automatically blends this speech with background music. The application will allow users to upload text or audio material, select specific schemas, and customize voice emotionality. The final output will be a blended audio file where the speech is clearly heard over the background music.

Responsibilities:

Front-End Development:
Implement a user-friendly interface based on a provided template (React.js or Vue.js preferred).
Integrate file upload functionality for text and audio materials.
Develop customizable options for users, such as selecting schemas, adjusting voice emotionality, and specifying the number of output variants.
Back-End Integration:
Set up API calls to OpenAI’s GPT for text generation and ElevenLabs for speech synthesis.
Implement a solution for automatic music-speech blending, possibly using services like Auphonic or FFmpeg.
Manage the processing pipeline to ensure smooth and efficient generation of output audio files.
Deployment:
Deploy the web app on a cloud platform (e.g., Heroku, AWS, Vercel).
Ensure that the app is scalable and can handle multiple concurrent users.
Requirements:

Proven experience with front-end frameworks (React.js or Vue.js).
Familiarity with API integration, particularly with OpenAI, ElevenLabs, and audio processing tools.
Experience in building responsive and user-friendly web applications.
Ability to work with cloud platforms for deployment and scaling.
Strong communication skills and the ability to work collaboratively.
Deliverables:

A fully functional web application as described above.
Documentation on how the app works, including setup and deployment instructions.
Ongoing support for a specified period post-deployment for any bug fixes or minor adjustments.

Budget: $350

Posted On: August 16, 2024 20:21 UTC
Category: Full Stack Development
Skills:ChatGPT API Integration, React, ElevenLabs, Audio & Music Software

Country: France

click to apply

WordPress Website

WordPress Website

Web Application for Automated Text-to-Speech and Audio Blending – Upwork

Web Application for Automated Text-to-Speech and Audio Blending – Upwork

admin

Related Posts

MI vs RCB, IPL 2025: Pandya brothers take centre stage as RCB end decade-long drought at Wankhede – Action in Images

CSK vs DC, IPL 2025: Delhi bring Chennai to screeching halt, win by 25 runs – Action in Images

Other Story

Trade war to world war III? Singapore PM issues big warning on Trump’s move | should you be worried?

Team Chhorii 2 on why Bollywood doesn’t make ‘good’ horror| Soha Ali Khan| Nushrratt| Vishal Furia

Tim David gives big challenge to Virat Kohli & Phil Salt as Bumrah set to make comeback

NATO Nation Spooked By Putin? Sweden Reactivates Nuclear Bunkers Amid Tensions With Russia

Trump considers 90-day tariff pause for India, Europe? White House’s bombshell as market turns red

Trump’s tariff bloodbath: trillions wiped in just 10 minutes? Market turns red | S&P 500 | Dow Jones

Trade war to world war III? Singapore PM issues big warning on Trump’s move | should you be worried?

Team Chhorii 2 on why Bollywood doesn’t make ‘good’ horror| Soha Ali Khan| Nushrratt| Vishal Furia

Tim David gives big challenge to Virat Kohli & Phil Salt as Bumrah set to make comeback

NATO Nation Spooked By Putin? Sweden Reactivates Nuclear Bunkers Amid Tensions With Russia

Trump considers 90-day tariff pause for India, Europe? White House’s bombshell as market turns red

Trump’s tariff bloodbath: trillions wiped in just 10 minutes? Market turns red | S&P 500 | Dow Jones

Trade war to world war III? Singapore PM issues big warning on Trump’s move | should you be worried?

Team Chhorii 2 on why Bollywood doesn’t make ‘good’ horror| Soha Ali Khan| Nushrratt| Vishal Furia