Transcribe GitHub Files

Step 1: Upload Your Audio Files to a Public GitHub Repository

File Requirements: GitHub has a file size limit of 100MB so ensure your audio files are 100MB in size or less. The files must be in a public repository otherwise you will receive an error saying the file is not publicly accessible. For a more secure way to host files check out our Transcribing from an S3 Bucket Cookbook.

Navigate to the repository that houses the audio file.
Click on the audio file. On the next page, right-click the “View raw” link and select “copy the link address” from the context menu.

Downloadable file URLs are formatted as "https://github.com/<github-username>/<repo-name>/raw/<branch-name>/<file-name-and extension>"

POST v2/transcript endpoint

{
    "audio_url":"https://github.com/user/audio-files/raw/main/audio.mp3"
}

Python SDK

config = aai.TranscriptionConfig()
transcript = transcriber.transcribe("https://github.com/user/audio-files/raw/main/audio.mp3", config)

JavaScript SDK

const transcript = await client.transcripts.transcribe({
  audio_url: "https://github.com/user/audio-files/raw/main/audio.mp3"
});

⌘I