tonym128's comments

tonym128 · on May 21, 2024

Take a look at https://80.lv/ There's a lot of game dev stuff on there, I'm subscribed to their Telegram chat feed and it's one of my favourites

https://t.me/LevelEightyNews

tonym128 · on April 18, 2024

A Telegram Bot to convert speech to text from small videos and audio files.

This bot uses Docker, Telegram, Python and Whisper.cpp to stand up a small telegram bot on your own infrastructure which can take media and convert it to text.

The setup is meant to be as simple and quick as possible, so it's not meant for production use. It is intended for personal use only.

Tested on AMD64, provided build scripts for ARM64 and ARM32v7.

There's a YouTube video of a full build, source and setup run through - https://youtu.be/Vo4OVdDRlcU

This was originally a Raspberry Pi 4 project I wrote and was manually running on a Pi. You can see a video about that here - https://www.youtube.com/watch?v=MjLzgebcDHo

And now it's a Docker container, which can run on a Pi, Mac and Windows PC!

tonym128 · on Aug 24, 2023

I'm sorry these are videos, but your question made me think of these.

If you're looking for programming inspiration I love these videos

Jon Bentley - Three Beautiful Quicksorts https://www.youtube.com/watch?v=aMnn0Jq0J-E

Bret Victor - Inventing on Principle https://www.youtube.com/watch?v=PUv66718DII

pbyte13 · on Aug 24, 2023

OP might also be interested in Bentley's book, Programming Pearls:

https://www.amazon.com/Programming-Pearls-2nd-Jon-Bentley/dp...

tonym128 · on Aug 24, 2023

With the new generation of machine learning is there a way to make endless stories ? Could they be unique and fun and have a moral grounding ? Could I run it all myself on a Raspberry Pi, hidden in a corner somewhere ?

tonym128 · on Dec 19, 2022

I have a pet hate, it's voice notes in WhatsApp or Telegram. Quite often the voice notes remain unheard for hours, due to the call to action (the notification) not letting me see what I need to react to, or if I'm in meetings and cannot listen for a period of time.

There are paid for services which can transcode speech to text but none free I could find. With the release of Whisper this has become something I thought could be solved with some minimal coding.

While Whisper relies on GPU's, Whisper.cpp does not and can run on a CPU with 1Gb ram (about 500mb for the model) enter the Pi 4.

I wrote a telegram bot in Python using python-telegram-bot which calls whisper.cpp to transcode speech to text. Here's my bot which is open to all, but you could start your own, with a Pi 4 and an always up connection, you can leave it running for when you need it.

Due to the constraints on the Pi 4, it only runs the English model and may result in errors for other languages.

Check my bot out here https://web.telegram.org/k/#@shhhhhhhhhhhhhhhhh_bot Check out Whipser here https://openai.com/blog/whisper/

Check out Whipser.cpp here https://github.com/ggerganov/whisper.cpp