transcript_extraction_dev

Extract a conversation in text from an audio file. This project attempts to label speakers and evaluates two different approaches for performance comparison.

Repository Structure

src/: Core Python scripts for transcription and annotation.
- transcribe.py: Approach 1 (Whisper + simple-diarizer + LLM post-processing).
- transcribe_v2.py: Approach 2 (WhisperX).
- annotate_interview.py: LLM-based speaker naming utility.
tests/: Test scripts (e.g., Langfuse connection test).
data/: (Gitignored) All audio files, intermediate data, and final transcripts.
- source/: Input audio files (.aac).
- approach_1/:
  - working/: Intermediate files and checkpoints for Approach 1.
  - output/: Final annotated results for Approach 1.
- approach_2/:
  - output/: Final results for Approach 2.
docker-compose.yml: Local Langfuse setup.

Setup

Install dependencies:
```
poetry install
```
Configuration: Copy .env.example (if provided) to .env and set your API keys:
- ANTHROPIC_API_KEY: For LLM processing.
- LF_SKEY, LF_PKEY, LF_HOST: For Langfuse tracing.
Ffmpeg: Ensure ffmpeg is installed on your system.

Usage

Place your audio files in data/source/ and run the scripts from the project root:

# Run Approach 1
python src/transcribe.py

# Run Approach 2
python src/transcribe_v2.py

Results will be generated in their respective subdirectories under data/.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

transcript_extraction_dev

Repository Structure

Setup

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

transcript_extraction_dev

Repository Structure

Setup

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages