Hacker Newsnew | past | comments | ask | show | jobs | submit | 3idet's favoriteslogin
1.Navigating the World of Large Language Models (bentoml.com)
48 points by sherlockxu on March 22, 2024 | 4 comments
2.Show HN: Let's Build AI (letsbuild.ai)
173 points by aprxi on March 18, 2024 | 47 comments
3.Speech and Language Processing (3rd ed. draft) (stanford.edu)
214 points by yeesian on March 11, 2024 | 32 comments
4.Building an LLM from Scratch: Automatic Differentiation (2023) (bclarkson-code.github.io)
355 points by netwrt on Feb 15, 2024 | 17 comments
5.Show HN: Reor – An AI note-taking app that runs models locally (github.com/reorproject)
411 points by samlhuillier on Feb 14, 2024 | 102 comments
6.World model on million-length video and language with RingAttention (largeworldmodel.github.io)
196 points by GalaxyNova on Feb 14, 2024 | 60 comments
7.Understand how transformers work by demystifying the math behind them (osanseviero.github.io)
470 points by LaserPineapple on Jan 3, 2024 | 137 comments
8.Simulating fluids, fire, and smoke in real-time (andrewkchan.dev)
784 points by ibobev on Dec 19, 2023 | 169 comments
9. [flagged] DeciLM-7B: The Fastest and Most Accurate 7B-Parameter LLM to Date (deci.ai)
84 points by paulenelim on Dec 12, 2023 | 42 comments
10.Gemini AI (deepmind.google)
2135 points by dmotz on Dec 6, 2023 | 1602 comments
11.LLM Visualization (bbycroft.net)
1592 points by plibither8 on Dec 3, 2023 | 131 comments
12.Q-Transformer (qtransformer.github.io)
238 points by jonbaer on Nov 30, 2023 | 63 comments
13.MeshGPT: Generating triangle meshes with decoder-only transformers (nihalsid.github.io)
738 points by jackcook on Nov 28, 2023 | 157 comments
14.Simplifying Transformer Blocks (arxiv.org)
142 points by georgehill on Nov 28, 2023 | 49 comments
15.Prompting Frameworks for Large Language Models: A Survey (arxiv.org)
25 points by dmezzetti on Nov 26, 2023 | 4 comments
16.Experimental tree-based writing interface for GPT-3 (github.com/socketteer)
234 points by pyinstallwoes on Nov 23, 2023 | 34 comments
17.LM Studio – Discover, download, and run local LLMs (lmstudio.ai)
461 points by victormustar on Nov 22, 2023 | 148 comments
18.LLMs by Hallucination Rate (github.com/vectara)
107 points by vincent_s on Nov 16, 2023 | 132 comments
19.Infinite Context LLMs: Going Beyond RAG with Extended Minds (normalcomputing.ai)
145 points by telotortium on Nov 13, 2023 | 42 comments
20.What are Transformer Models and how do they work? [video] (youtube.com)
83 points by luis_likes_math on Nov 6, 2023 | 3 comments
21.01-AI/Yi: A series of large language models trained from scratch (github.com/01-ai)
143 points by simonpure on Nov 6, 2023 | 52 comments
22.Phind Model beats GPT-4 at coding, with GPT-3.5 speed and 16k context (phind.com)
891 points by rushingcreek on Oct 31, 2023 | 347 comments
23.M2 Ultra can run 128 streams of Llama 2 7B in parallel (github.com/ggerganov)
268 points by behnamoh on Oct 11, 2023 | 173 comments
24.LeoLM: German-Language LLM Research (laion.ai)
105 points by doubtfuluser on Sept 29, 2023 | 84 comments
25.I made a transformer to predict a simple sequence manually (vgel.me)
360 points by lukastyrychtr on Sept 22, 2023 | 94 comments
26.Self-supervised learning: The dark matter of intelligence (2021) (meta.com)
164 points by reqo on Sept 18, 2023 | 18 comments
27.OneDiffusion (github.com/bentoml)
39 points by aarnphm on Aug 23, 2023 | 7 comments
28.How Is LLaMa.cpp Possible? (finbarr.ca)
685 points by birriel on Aug 15, 2023 | 227 comments
29.Artificial General Intelligence – A gentle introduction (temple.edu)
282 points by lorepieri on Aug 11, 2023 | 193 comments
30.Llama from scratch, or how to implement a paper without crying (briankitano.com)
513 points by bkitano19 on Aug 9, 2023 | 52 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: