askill
walkie-talkie

walkie-talkieSafety 90Repository

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

0 stars
1.2k downloads
Updated 2/14/2026

Package Files

Loading files...
SKILL.md

Walkie-Talkie Mode

This skill automates the voice-to-voice loop on WhatsApp using local transcription and local TTS.

Workflow

  1. Incoming Audio: When a user sends an audio/ogg/opus file:

    • Use tools/transcribe_voice.sh to get the text.
    • Process the text as a normal user prompt.
  2. Outgoing Response:

    • Instead of a text reply, generate speech using bin/sherpa-onnx-tts.
    • Send the resulting .ogg file back to the user as a voice note.

Triggers

  • User sends an audio message.
  • User says "activa modo walkie-talkie" or "hablemos por voz".

Constraints

  • Use local tools only (ffmpeg, whisper-cpp, sherpa-onnx-tts).
  • Maintain a fast response time (RTF < 0.5).
  • Always reply with BOTH text (for clarity) and audio.

Manual Execution (Internal)

To respond with voice manually:

bin/sherpa-onnx-tts /tmp/reply.ogg "Tu mensaje aquí"

Then send /tmp/reply.ogg via message tool with filePath.

Install

Download ZIP
Requires askill CLI v1.0+

AI Quality Score

62/100Analyzed 2/22/2026

Practical skill document for WhatsApp voice-to-voice automation with clear workflow structure and triggers. Lacks setup instructions and has mismatched tags. The transcribe→TTS pattern is valid but tightly coupled to specific local tools. Score reflects useful but incomplete documentation."

90
80
50
60
60

Metadata

Licenseunknown
Version-
Updated2/14/2026
Publisherdelorenj

Tags

github-actionsprompting