--- schedule: daily enabled: true --- Transcribe today's meetings from screenpipe audio recordings using Microsoft Azure Speech-to-Text, then upload the transcripts to a centralized location. ## Task 1. Query screenpipe for all audio recordings from today (full workday: 8am to 6pm) 2. For each audio chunk, collect the transcription text, speaker info, and timestamps 3. Group consecutive audio chunks into "meetings" — a meeting is a continuous stretch of audio with gaps no longer than 5 minutes 4. For each detected meeting, call the Azure Speech-to-Text API to re-transcribe the source audio file using the custom voice model 5. Write each meeting transcript to the output directory AND upload to the centralized endpoint ## Search API ``` GET http://localhost:3030/search?content_type=audio&start_time=&end_time=&limit=200 ``` Extra params: `q` (keyword), `speaker_name`, `offset` (pagination). Full API reference: https://docs.screenpi.pe/llms-full.txt ## Azure Speech-to-Text Use the Azure Speech REST API to transcribe each meeting's source MP4 file with your custom voice model. **Batch transcription endpoint:** ``` POST https://.api.cognitive.microsoft.com/speechtotext/v3.2/transcriptions Authorization: Ocp-Apim-Subscription-Key Content-Type: application/json { "contentUrls": [""], "locale": "en-US", "displayName": "Meeting