Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Show HN: What Is Hacker News Working On? (eamag.me)
205 points by eamag 14 hours ago | hide | past | favorite | 44 comments
I tagged all comments from "What Are You Working On?" (like https://news.ycombinator.com/item?id=45561428) posts and built a simple SvelteKit website, hope it's helpful to find people with similar projects. I'm also thinking of adding some analysis of project types over time to see changes in tech




Nice work, thanks for sharing!

Here's my feedback..

1. Please, consider to always include the latest What Are You Working On posts. For instance, this latest one posted 16 hours ago is not included yet (https://news.ycombinator.com/item?id=45869146).

2. Also, it may be better to include ONLY top-level comments and not replies. For instance, the second row appearing on the list says "Thanks for the the kind words.." by @rubansk is a reply and not a root comment which may be of a little value here.

3. For the UI, when I sort by columns "AUTHOR" or "POSTED", the table would take a different size?! It would be nice to keep the size of the table fixed.

Just my two cents..


Personally, I find little value in most of the tags. To me, many of them just say "Coders gonna code."

It would be more interesting if you could tag the projects based on their subject matter: What problem are they working on, what subject area, what change are they hoping to bring to the world?


Author here, thanks for all feedback! Answering some questions:

- I wanted to publish it asap and get feedback, that's why it's a static website with some ui bugs, you can see a json with comments in a static folder so updates have to be with something like GitHub actions or I have to connect a db

- I included replies on purpose because I saw people saying "I'm also working on this, here's my project" and I wanted to index them

- tags are created with an llm and then there is another deduplication pass to combine them. You can imagine some pros and cons of this approach. I also didn't use embeddings with clustering because it didn't really work for another project of mine https://eamag.me/2024/Automated-Paper-Classification#icml-em...

- it flew under radar first time I posted, but shot to a front page after a second chance pool. If you want to contact me, collaborate, delete data or help to improve this website - there are contact options on my blog and an anonymous feedback form in the footer!


I noticed the tags could be improved by implementing some kind of standardization. You may be interested in this post on HN from a month ago: https://news.ycombinator.com/item?id=45571423. The comments also have some useful recommendations.

I just finished working on consulting project that involved tagging corrective and preventative actions (CAPAs) for a lab to help them organize some of their QA efforts. The use of LLMs for tagging free-form text is a common task and I thought it would be fun to experiment with different strategies for improving the consistency of tags. The article above presents a good approach because it's a streaming solution, but does come with drawbacks (more overhead to set up and treats older data differently than new data). Commenters recommend using a batch approach by collecting all the text up front and then using various strategies to cluster and generate tags, then use an LLM--giving it the predefined tags in its prompt. Once you have enough good tags you could train your own smaller model to generate tags. The batch methods have lower overhead, but take more time for tweaking and experimenting for your specific dataset.

For generating embeddings, I used Cohere's v4 embedder. I found that using HDBSCAN for clustering the embeddings of the tags was much more helpful than using K-means. I also learned that training a Pytorch MLP to predict multiple tags was superior in every aspect to training a tree-based model and gives very good precision, but just OK recall due to the difficulty of nailing all the tags right. I also compared gpt-5-mini and claude-haiku-4.5 for generating tags. Gpt-5-mini was much slower, but cheaper and better at generating good tags. Claude-haiku-4.5 was not far behind and was much faster due to the absence of thinking tokens, but much more expensive. The metric I used to compare the LMMs on their raw tagging ability was Scikit-learns's homogeneity_score.


This is great!

1. Does it include comments from ALL previous WAYWO posts?

2. Add navigation by page too. Currently I can press next, but it does not reflect in the URL, which means I can't jump to middle or at the end.

Looks like that paging is artificial and data is loaded once on first load. You could also add how many posts to show on single page.


Ideas:

1. Optionally allow all pages to be shown at once (no pagination).

2. Provide a "download all as JSON" button.

3. Allow the use of an LLM to filter through these comments.


Nice idea!

For quickly skimming the list, I think it would be useful to add a short, standardized summary for each project. These summaries should be easy to generate automatically with any LLM of your choice.

I have a similar tool for organizing the “Who Is Hiring” threads [1] that uses GPT to provide a quick overview. It’s still running on an old model version from 2023, yet it has been working surprisingly well for over two years now with basically zero maintenance.

[1] https://www.hacker-jobs.com/


As a kind of "AI first" take on this, I created a small Jupyter notebook that uses the Nthesis api which shapes, tags, etc. based on LLM instructions. It also has built in RAG which makes it possible to chat with the documents and has some interesting UMAP based visualizations.

https://nthesis.ai/public/hn-who-is-hiring

https://nthesis.ai/public/hn-working-on


> some analysis of project types over time to see changes in tech

Great idea! I'd love to see some stats on the high and low level tags.


I recently left (or rather, was semi-forced out from) reddit. Hackernews does not have crazy moderators or at the least not as many; there is more free-form discussion, which is great.

Still, a few things I like. I used old.reddit.com, but even that seems to be a bit better than the default UI on hacker news. I don't mind the UI, but I found old.reddit.com easier and faster to use (the new reddit UI is just garbage though). It would be nice if hackernews could add more features that are SIMPLE and also simple improves to the layout - again, just very small and careful changes; people dislike any change to their workflows, so hackernews should make these conservatively and only little; and perhaps with a limit per year or every 5 years or so.

Content-wise I have no huge issue, although I'd like more grouping, as some news are interesting, others not so much (to me). Reading just the title is often not enough; some good articles have horrible titles and vice versa.


I have a clone of the old school Reddit up and running. I’m not quite sure where to take it - on one hand I don’t want to open it to a wide audience because I don’t want the burden of moderation, on the other hand Reddit gets increasingly problematic on so many levels and I miss losing out on legitimately good online spaces for some of my hobbies & interests.

The challenge of moderation is a can of worms.

I've thought long and hard about how a new comer could get around the issues but never came up with anything rock solid. SMS / Email is maximum sign up resistance but could help. Also being able to browse before making an account is key for new users but then you're open to bots.

Anyone got some good idears?


I think having a handful of trusted moderators to have a higher level access to nuke posts, or see notices for posts with a high downvote:upvote ratio so they can nuke as an option...

Aside from that, the voting system X uses for community notes is pretty neat and something similar could work as well.

I've spent a bit of time on this as well... wanting to spin up a modern BBS and thinking maybe just a TUI over SSH might be enough to get something interesting without opening up to the heavy spam bots.


How can you be forced out of reddit? You could just make a new username surely?

I was forced out of Reddit by IP-number. I changed the MAC-number and that changed IP-number. Worked for a while, and then they banned half of the city. Which was funny and gained some notoriety at that time.

Just FYI, those "number" things you mention are called "addresses" - IP address, MAC address... I don't know if your use of "number" is an affectation or intentional, but in a crowd of IT geeks it makes you look unsophisticated, so to speak.

I think I fare rather well on sophistication level. I do know 5-7 human languages about as good as this one.

- List of available tags would be even cooler.

- When you select some tags, list can overflow horizontally and first column with tags is only partially visible. Happened to me when selecting "hardware" tag.

- When you select prev/next or tag, list could scroll to top

- It also often picks up irrelevant comments: https://news.ycombinator.com/item?id=45421459


What's the default sorting algorithm?

I just posted recently in the thread so would have expected to see my post somewhere on this site near the front page. Granted, I guess HN already has a new/hot algorithm, so perhaps you didn't want to reinvent the wheel and instead focus on search.


I was planning to do this for every single post ever made in this category with traffic tracking, linkedin profiles of people at work, how much funding have they raised or how much ARR / MRR for bootstrapped projects. how many are open source on github but you beat me to it lol

There's so much duplication, or overlap at least.

Despite being a software dev and more invested than the average person, I don't feel like there are that many websites or applications I use really. But there are the best part of a thousand webapps being developed there. I was mulling over starting a language learning platform, but there seem to be 133 being worked on just by people who've posted to HN to say they're working on one.

No wonder most of us seem to struggle with getting traction.


What is the system behind the distinction between “high-level” and “low-level” tags? For example, why is "artificial intelligence" high level, but "cloud computing" low level? Why is "tool" on both levels for the same entry?

Because AI is an end and cloud computing is a means

This is cool, I don't see my comment (https://news.ycombinator.com/item?id=45869146#45869388) indexed though

Really well done. I'm sure it must have been a joy to use Sveltekit.

How did you determine the tags?

with AI, I guess.

How about adding a column with a count of comments so we can sort and get a rough idea of the most popular posts?

Very cool - thanks!

On mobile I’d prefer to be able to just keep scrolling instead of having to tap ‘next page’, scroll to the top, then scroll down again


This is great!

Couple suggestions:

  - Randomize the home page results
  - Link to random site

good idea! somehow i think @martinraag (0) and @Joeboy (2) should talk

0: https://news.ycombinator.com/item?id=43157252

1: https://news.ycombinator.com/item?id=44704131


>browsing thousands of comments is a pain

s.replace("thousands of comments","HN")


actually, I find the scrolling & browsing here very convenient? And stressless? No adds, no dynamic bullshit loaded, and AgoliaSearch does a quite decent job (while making contrary impression, I agree)

And HN is a website than can be loaded on a smartphone even with superslow connectivitiy :-))


Thank you for this. I did a search filter on my username and have a good memory trace back. I really need it!

Very cool. Would be great to have upvotes for good ideas

Does it show itself :)

Now that would be cool. Imagine infinite recursion: you see this project featured on itself, and you click it and it shows itself there, and so on.

Really nice, and also the search is satisfying snappy!

Looks good, a fun project.

it's interesting.. we can find many ideas here

Will you provide any way for users to request their comments be removed from your site?

I have a new computing architecture hard/soft/comms from the ground up and a new AGI model. New algorithms for GC and defrag. New O(1) sorting algorithm.

really cool!

cool!



Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: