Using Live Transcribe

Create and manage live transcribe jobs in the Speech service.

The live transcribe feature provides real-time transcription, which enables you to send audio streams and receive text results in real time. Real-time transcription is required for many use cases across industries such as healthcare, call centers, and media. For example, physicians and nurses use medical dictation, which requires real-time capabilities and increases work efficiency. With the release of live transcribe, you can get accurate transcription in your application in less than a few seconds. You can use the embedded text window to try live transcribe or refer to the API document for information about how to programmatically integrate with the OCI real-time transcription service.

Creating a Live Transcribe Job

Create and submit a Speech live transcribe job to transcribe a live audio stream to text.

  1. Open the navigation menu and click Analytics & AI. Under AI Services, click Speech.
  2. In the left-side navigation menu, click Live transcribe.
  3. Under List Scope, select the compartment that you want to work in.
  4. (Optional) In the Configure transcription section, To customize the transcription, select values for one or more of the following options:
    • Choose domain: Select the domain of the speech model to be used.
    • Choose language: Select the language to transcribe in.
    • Punctuation: Configure punctuation in the generated transcriptions. Three options are available, None for no punctuation (the default value), Auto to insert punctuation automatically, and Spoken to insert punctuation when they're verbally spoken.
    • Partial silence threshold: Enter how long, in milliseconds, the service waits for additional speech, after it stops detecting speech activity, before ending the speech recognition.
    • Final silence threshold: Enter how many milliseconds of silence after a word is spoken that the service waits to end the session.
    • Partial result stability: Select the amount of confidence required for the latest tokens before returning them as part of a new partial result.
    • Enable customizations: Select this checkbox to customize the session.
      • Select the customization to use. Change compartments if necessary.
      • Click Override entities. Only customizations containing multiple entity lists can be overridden
  5. To start a session, click Start session, and begin to speak.
  6. To stop a session, stop speaking and then click Stop session.
  7. (Optional) To view the JSON file, click View JSON.
  8. (Optional) To reset the session, click Reset.