Silero Extension

Silero VAD

The Silero Voice Activity Detector Extension makes easy to detect speech in an audio signal. You can find more information at https://github.com/snakers4/silero-vad

Example

ONNX.SileroVAD

In this example, we are extending an ONNX.MLSink to create an ONNX.SileroVAD node which we can use to detect specch on an incoming signal. The speech prediction is presented as a percentage.

Switchboard Editor Preview
Switchboard JSON

{
  "title": "ONNX.SileroVAD example",
  "nodes": [
    {
      "id": "superpoweredPlayer1",
      "type": "Superpowered.AdvancedAudioPlayer",
      "uiConfig": {
        "tracks": [
          {
            "label": "Clean speech",
            "url": "https://switchboard-sdk-public.s3.amazonaws.com/assets/audio/speech/clean-voice.wav"
          },
          {
            "label": "Noisy speech",
            "url": "https://switchboard-sdk-public.s3.amazonaws.com/assets/audio/speech/Noisy-Speech.wav"
          },
          {
            "label": "Solo guitar",
            "url": "https://switchboard-sdk-public.s3.amazonaws.com/assets/audio/music/guitar-48k.wav"
          },
          {
            "label": "Music",
            "url": "https://switchboard-sdk-public.s3.amazonaws.com/assets/audio/music/elevator-music-stereo.mp3"
          }
        ]
      }
    },
    {
      "id": "onnxNode",
      "type": "ONNX.SileroVAD",
      "assets": {
        "model": "https://switchboard-sdk-public.s3.amazonaws.com/assets/models/onnx/silero_vad.with_runtime_opt.ort"
      }
    }
  ],
  "connections": [
    {
      "sourceNode": "superpoweredPlayer1",
      "sourceBusIndex": 0,
      "destinationNode": "onnxNode",
      "destinationBusIndex": 0
    },
    {
      "sourceNode": "superpoweredPlayer1",
      "sourceBusIndex": 0,
      "destinationNode": "outputNode",
      "destinationBusIndex": 0
    }
  ]
}

Node types

The Silero Voice Activity Detector Extension provides the following audio nodes for a Switchboard SDK audio graph:

Node	Description
Onnx.SileroVad	A sink node that detects speech.

Silero TTS

The Silero Text-to-Speech Extension makes easy to convert text to speech audio signal on the device. You can find more information at https://github.com/snakers4/silero-models.

This Silero Text-to-Speech Extension provides the following audio nodes for a Switchboard SDK audio graph:

Node	Description
SileroTTSNode	A source node that converts text into speech audio signal.

Silero STT

The Silero Text-to-Speech Extension makes it easy to convert speech audio signals to text on the device. You can find more information at https://github.com/snakers4/silero-models.

This Silero Speech-to-Text Extension provides the following audio nodes for a Switchboard SDK audio graph:

Node	Description
SileroSTTNode	A sink node that converts the speech audio signal into text.

Download SDK Extension

You can download this SDK extension from our Downloads Page.

Visit the page to access the latest version and start integrating it into your project!

Silero VAD​

Example​

ONNX.SileroVAD​

Node types​

Silero TTS​

Silero STT​

Download SDK Extension​