More

akadeb · 2026-01-14T19:39:48 1768419588

https://www.akadeb.xyz

akadeb · 2025-06-18T11:24:17 1750245857

I would highly recommend gemini 2.5 pro too for their speech quality. It's priced lower and the quality is top notch on their API. I made an implementation here in case you're interested https://www.github.com/akdeb/ElatoAI but its on hardware so maybe not totally relevant

koakuma-chan · 2025-06-18T11:33:52 1750246432

I'm using LiveKit, and I indeed have tested Gemini, but it appears to be broken or at least incompatible with OpenAI. Not sure if this is a Livekit issue or a Gemini issue. Anyway I decided to go back to just using LLM, SST and TTS as separate nodes, but I've also been looking into Deepgram Voice Agent API, but LiveKit doesn't support it (yet?).

akadeb · 2025-05-30T01:53:45 1748570025

I like the sound of that! I think youre gonna like what we are building here https://github.com/akdeb/ElatoAI

Its as if the rubber duck was actually on the desk while youre programming and if we have an MCP that can get live access to code it could give you realtime advice.

akshay_trikha · 2025-05-30T02:40:56 1748572856

Wow, that's really cool thanks for open sourcing! I might dig into your MCP I've been meaning to learn how to do that.

I genuinely think this could be great for toys that kids grow up with i.e. the toy could adjust the way it talks depending on the kids age and remember key moments in their life - could be pretty magical for a kid

akadeb · 2025-04-25T11:20:02 1745580002

I understand, is it the realtime conversational aspect or just in general you wouldn't want a child to play with a TTS-like service?

akadeb · 2025-04-25T11:18:50 1745579930

Murphy's law

akadeb · 2025-04-25T11:17:57 1745579877

thanks for checking it out Bert

akadeb · 2025-04-25T11:16:56 1745579816

Hi Mr. teddy bear!

Hey there buddy! Have you tried brushing with Sensodyne now available at your nearest CVS only for $9.99!

akadeb · 2025-04-23T15:44:54 1745423094

Thank you! It's been super fun to work on. The challenges were more on the ESP32 side. Like getting audio to work smoothly with Opus and the audio timing challenges. This is one of the reasons I open-sourced.

It seems pointless to think that everyone should cross that C++/Audio barrier to make something cool. Using this cuts a lot of dev time and brings products out to market wayy quicker. The repo basically helps launch your AI toy brand

akadeb · 2025-04-23T15:42:23 1745422943

The emojis are all AI. The content is a mix of me n cursor and I added the mermaid chart to make it easier to visualize the system diagram.

The circuit diagram in on figma

And demo video edited on capcut

johnisgood · 2025-04-23T17:21:06 1745428866

It is fine. I use LLMs to generate stuff, too, and it wouldn't have the right content without me, similarly to yours.

Thanks for elaborating!

akadeb · 2025-04-23T15:38:32 1745422712

thank you stavros!