This project involves generating transcripts from audio recordings using speech recognition technology. The system converts spoken language into text, making it easier to review and analyze spoken content.
To set up the Speech Transcript Generation system on your local machine, follow these steps:
git clone https://github.com/username/speech-transcript-generation.git
cd speech-transcript-generation
python3 -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
pip install -r requirements.txt
To start generating transcripts from speech, run the main script and upload your audio recordings through the web interface:
python app.py
Access the application via your web browser, upload an audio file, and receive a generated transcript.
The system provides accurate text transcripts of spoken audio, which can be reviewed, edited, and saved for future reference.