Metadata-Version: 2.2
Name: dora-vad
Version: 0.3.10
Summary: Dora Node for Text translating using Argostranslate
Author-email: Haixuan Xavier Tao <tao.xavier@outlook.com>
License: MIT
Requires-Python: >=3.8
Description-Content-Type: text/markdown
Requires-Dist: dora-rs>=0.3.9
Requires-Dist: numpy<2.0.0
Requires-Dist: silero-vad>=5.1

# Speech Activity Detection(VAD)

This is using Silero VAD.

It detects beginning and ending of voice activity within a stream of audio and returns the parts that contains activity.

There's a maximum amount of voice duration, to avoid having no input for too long.

## Input/Output Specification

- inputs:
  - audio: 8kHz or 16kHz sample rate.
- outputs:
  - audio: Same as input but truncated

## YAML Specification

```yaml
- id: dora-vad
  description: Voice activity detection. See; <a href='https://github.com/snakers4/silero-vad'>sidero</a>
  build: pip install dora-vad
  path: dora-vad
  inputs:
    audio: dora-microphone/audio
  outputs:
    - audio
```

## Reference documentation

- dora-sidero
  - github: https://github.com/dora-rs/dora/blob/main/node-hub/dora-vad
  - website: http://dora-rs.ai/docs/nodes/sidero
- Sidero
  - github https://github.com/snakers4/silero-vad

## Examples

- Speech to Text
  - github: https://github.com/dora-rs/dora/blob/main/examples/speech-to-text
  - website: https://dora-rs.ai/docs/examples/stt

## License

The code and model weights are released under the MIT License.
