Memories – 256 bytes demo winner of Revision 2020

clan · on April 28, 2020

I am always very impressed when I see these demos and how much can be done with so little. If you are like me you just jumped to Youtube[1] to see it in action.

When trying to make my significant other to understand what was happening I wanted to run it myself. I was amazed how simple that was!

- Install the assembler[2]

- Install dosbox[3]

- Get the source[4] and put it into c:\temp\demo\memories.asm

- Start nasm and enter:

    cd c:\temp\demo
    nasm.exe memories.asm -fbin -o memories.com

- Start dosbox and enter:

    mount d c:\temp\demo
    d:
    dir
    memories

- Press [ALT][ENTER] for fullscreen

The dosbox config is not optimized but it runs with sound with the default settings!

For me this is somehow much more impressive than simply watching the video.

[1] https://www.youtube.com/watch?v=Imquk_3oFf4

[2] https://nasm.us/

[3] https://www.dosbox.com/

[4] http://www.sizecoding.org/wiki/Memories#Original_release_cod...

jeffhuys · on April 28, 2020

For macOS users (with brew):

  brew install dosbox nasm
  nasm memories.asm -fbin -o memories.com
  dosbox
  mount D ~/Development/memories (or whatever)
  D:
  memories

So, almost the same!

Hit FN+Ctrl+F12 to speed it up (it's time-independent, for smoother animations, hit that combination quite a few times).

It didn't output any audio for me, but that's probably fixable.

kawzeg · on April 28, 2020

Using `dosbox .` skips two steps by mounting the current directory (at least on linux where I tried it)

kubanczyk · on April 28, 2020

I often need a small script/program as a sample to go under automation/orchestration/ci or some other monstrosity of similar provenance.

It will be fun to replace "sleep 5 && echo ok" with an automated version of such writeup :)

diggernet · on April 30, 2020

Clicked your first link and had the disturbing realization that I was downloading a 48MB video to watch that 256B demo.

clan · on April 28, 2020

pvg has in a comment linked to the optimized dosbox config. With that it runs smooth as butter!

MrGilbert · on April 28, 2020

On a related note: 2019, "Dope on Wax" was 1st in the PC 64k section.

There is a breakdown of this demo on youtube, it's roughly ~2 hours. They explained how they made this demo. Really interesting to watch.

Demo: https://www.youtube.com/watch?v=QhqT0DhV9yE

Breakdown: https://www.youtube.com/watch?v=hFIyj5Yv440

antirez · on April 28, 2020

It's wonderful. More impressed with this being 64k than the originally posted 256 bytes.

TorKlingberg · on April 28, 2020

PC64k is the main size constrained demo format, where people do seriously impressive things. 256b is the masochists category, where doing anything at all is hard. 4k intros are in between.

It's interesting that the demo scene is very Windows/DOS focused, unlike other hacker scenes. Linux or Mac demos are basically not a thing. You're far more likely to see C64 or Amiga demos.

antirez · on April 28, 2020

Well I guess it makes sense after all, because in DOS you kinda have an "API", for instance using the default interrupts you can select video modes, and the video memory is mapped at a fixed offset and so forth. In Linux due to API fragmentation it would be hard to agree on something that works in the future, and even know likely more setup boilerplate setup code is needed.

HellMood · on April 29, 2020

Agreed. Just as a heads up regarding "future safety", the guys at NVIDIA - for now - seem to keep the door open for Dos/Bios with very high possible resolutions https://www.pouet.net/prod.php?which=63522#c858522 and even without the need for going the VESA way. Nothing i would really rely on in a business case ;) but neat anyway. (Mode List for several current GPUs : https://www.pouet.net/topic.php?which=11672&page=1 )

pjmlp · on April 29, 2020

Because the demoscene culture doesn't mix with FOSS one.

Back in the day we wanted to have the cool demos at the parties to show off how we managed to do something that others deemed impossible.

Part of the challenge is to match what others did and out win them, without having access to how they achieved it in first place.

Older Macs were never a thing in Europe given how expensive their were, Comodore, Atari and Sinclair machines ruled Europe.

While UNIX and demos never were a thing, those were more the domain of cracking competitions. Trying to gain access to some random university server.

HellMood · on April 29, 2020

(author here) All right, but at least the 1k category for Mac/Intel had some cool productions recently. https://www.pouet.net/prodlist.php?type%5B0%5D=1k&platform%5... Not my field exactly, but it seemed like the boilerplate was a bit shorter than for other platforms, so there was decent interest in it.

yread · on April 28, 2020

> In 320x200 mode, instead of constructing X and Y from the screen pointer DI with DIV, you can get a decent estimation with multiplying the screen pointer with 0xCCCD and read X and Y from the 8bit registers DH (+DL as 16bit value) and DL (+AH as 16bit value). The idea is to interpret DI as a kind of 16 bit float in the range [0,1], from start to end. Multiplying this number in [0,1] with 65536 / 320 = 204,8 results in the row before the comma, and again as a kind of a float, the column after the comma. The representation 0xCCCD is the nearest rounding of 204,8 * 256 ( = 52428,8 ~ 52429 = 0xCCCD). As long as the 16 bit representations are used, there is no precision loss.

Řrřola's trick. A bit like https://en.wikipedia.org/wiki/Fast_inverse_square_root

pjc50 · on April 28, 2020

This explanation was so confusing that I had to write a program to get it clear in my head. https://gistpreview.github.io/?9b252f267cd1fdf9754059bb73a18...

More clearly: DI = (y * 320) + x

Multiply by 0xCCCD => (y * 0x1000040) + (x * 0xcccd)

Take top byte is equivalent to divide by 0x1000000. So that gives you Y. The next lower (third) byte is then (x * 0xcccd / 0x10000) == (x * 52429 / 65536) =~ (x * 256/320). And the lower two bytes are noise.

HellMood · on April 28, 2020

(author here) you're right (about confusing), i wasn't expecting more than a few people to actually read this ;) at least i quickly repaired the float/fixed thing.

HellMood · on April 29, 2020

i included your explanation in the wiki http://www.sizecoding.org/wiki/General_Coding_Tricks#Obtaini...

Firadeoclus · on April 28, 2020

It strikes me as odd to describe an 8.8 fixed point representation as "a kind of a float".

That said, those demos are truly impressive.

MisterTea · on April 28, 2020

I'm sure the author is keeping the language simple and approachable while conveying the idea of "decimal number".

Firadeoclus · on April 28, 2020

Trying and not succeeding to keep the language simple. As evidenced by the sibling comment by pjc50. Fixed point numbers are at least as approachable as floating point numbers in my opinion.

HellMood · on April 29, 2020

(author here) I didn't know about this fast inverse trick, but i find it VERY funny that in "Memories" i use almost the same technique to create the "ocean" effect ;) http://www.sizecoding.org/wiki/Memories#Ocean_night_to_day_2 Maybe i was inverse square rooting all the time without knowing it ^^

guiambros · on April 28, 2020

The source code really looks like black magic. It's incredible they were able to cram the tunnel effect into 64 bytes [1].

The entire video is here [2], if you just want to watch the final product.

EDIT: link fixed.

[1] https://www.pouet.net/prod.php?which=85227

[2] https://www.youtube.com/watch?v=Imquk_3oFf4

nrr · on April 28, 2020

No, go fire up DOSBox and watch it in realtime! Watching demos on YouTube takes away a lot from the spirit of what makes productions like this one special.

Also, your Pouët link is wrong. I think you actually meant https://www.pouet.net/prod.php?which=85227 instead.

a1369209993 · on April 28, 2020

> I think you actually meant https://www.pouet.net/prod.php?which=85227 instead.

For reference, the original was https://www.pouet.net/prod.php?which=78044 which appears to be a 64-byte demo showing off just the raytracer from 85227.

lvturner · on April 28, 2020

There's something beautiful about the fact that the video is FAR larger in size than the program that initially generated the output. Almost worth watching for that fact alone.

derefr · on April 28, 2020

I’ve pondered before the idea of a video codec that works like RAR, where the video embeds an arbitrary user-specified virtual machine that can be used to decode the video frames. (How is this not just a program binary? Because it still would have the semantics of a video stream, with no random access to frame data, only tape-head-like access.)

Seems like this would be perfect for videos that are just e.g. gameplay of games made of tiles+sprites: the video could just store one copy of the assets, and the frames could just be tile maps + sprite position information.

It would also work well for “videos” that are really just a single static image. Or videos that are visualizations of the audio stream: the VM could actually take the audio frames as input and output the respective video frame.

saagarjha · on April 28, 2020

I think Super Smash Bros records replays as a random seed and user input, then generates the video on the fly from that.

derefr · on April 29, 2020

Sure, but the various machinima/demo file-formats of games are just application state-data formats, not video formats per se. The difference comes in what can decode them.

An application state-data format can only be decoded by the original application, because necessary context—in this case, the game engine that translates user input to game-state and then to displayed frames, and also the library of visual assets the game uses to render those frames—is in the application, rather than in the video.

A video format is self-contained, and usually not domain-specific. Many encoders and many decoders can be written to target a video format, and the decoders should not have to ship with an asset library (let alone a game engine) in order to properly render specific videos.

A format like I'm talking about—one that doesn't know anything about application state, but does understand that it's compositing and placing a set of embedded assets each frame, rather than only knowing about pixels/gradels—seems like something generically useful to me. (Heck, we're close to support for such a format already, since many video players already understand the idea of compositing arbitrary stuff with placement instructions on the screen each frame, care of support for the https://en.wikipedia.org/wiki/SubStation_Alpha subtitle format. That format is exactly the kind of "vector video" I'm talking about, except the only primitives it can position and style are text elements. Add RGBA-textured rectangles as another primitive type to it, and you'd get a video format!)

And yes, I'm basically talking about the visual equivalent of a https://en.wikipedia.org/wiki/Module_file (embedded samples/synth patches + sequencing information); or, if you prefer another analogy, "what Flash movies are if you exclude the ability to execute ActionScript."

recrof · on April 28, 2020

also, doom.

kubanczyk · on April 28, 2020

Your comment's html source code (the entire <div>) is also larger with its 380 bytes.

kubanczyk · on April 28, 2020

Can be gzipped down to 275 though.

Cthulhu_ · on April 28, 2020

It always stings when I make a website/app pulled through all the optimizers and compression algorithms, and the content people fuck it all up by adding 10MB of images :/.

tiborsaas · on April 28, 2020

You are building a passenger airliner optimized to don't crash and burn, don't worry about the cargo :)

d_silin · on April 28, 2020

A couple of other famous short demos:

https://www.youtube.com/watch?v=_YWMGuh15nE ("Elevated", 4k)

https://www.youtube.com/watch?v=fp0t2jCMGZE ("One of those days", 8k)

wgx · on April 28, 2020

Not MS-DOS, but my favourite 256 byte demo is for the C64: "A Mind Is Born", check it out: https://www.youtube.com/watch?v=sWblpsLZ-O8 that music is astonishing.

jcims · on April 28, 2020

If i had experienced this out of my dads humble little c64 back in the 80’s i think i would have passed out. That music is incredible esp considering how concisely it is stored.

userbinator · on April 28, 2020

That one has a great explanation too: https://linusakesson.net/scene/a-mind-is-born/

Also discussed previously at https://news.ycombinator.com/item?id=14164907

rsiqueira · on April 28, 2020

There is a JavaScript implementation of this parallax checkerboards effect with just 140 characters of code, including 3D animated perspective: https://www.dwitter.net/top/all In this page you can also find an implementation of Pouet's tunnel effect.

poutrathor · on April 30, 2020

I immediately think about the dwitter crowd when seeing the video. I think I recognized several patterns. Seems sound that in the shortest size goal, everyone ends up using the same function classes to generate maximal impact with minimal bytes.

twiceaday · on April 28, 2020

dwitter gives you some built ins so its not 140 but its not significantly more. I am sure you can convert most posts into 256 bytes of raw html+js.

aaronbwebber · on April 28, 2020

Can anyone describe at a high level for a complete noob how this kind of thing works? Someone who is not going to be able to read a bunch of ASM and interpret it? I'm guessing that it is something along the lines of:

- the graphics "driver" reads values out of certain registers (AL and AH?) at a set interrupt (maybe every X clock cycles?) and writes one pixel to the screen of whatever color those registers had in them

- by writing values into those registers and aligning the number of operations the program does with the frequency of the interrupts, you can get animation?

Even achieving any sort of flow control so you can switch between the effects is mind-boggling to me.

pjc50 · on April 28, 2020

It gets much simpler when you realise that in the original PC there's no "driver" in the way but bits of hardware are wired directly to various processor buses.

This is sixteen-bit assembly, so you have the famous 640kb of RAM available to the user and a 64k bit of RAM beyond that (see "0xa000" in the program). The graphics hardware is continuously rendering frames out of there at 320x200, one pixel per byte, using the default system palette.

The rendering is rather like a pixel shader. There is a big for loop over all the pixels, and at each point it computes a pixel value. First it decides which frame number it is on (stored in BP register I think), then calls an "effect" for that pixel.

It then jumps three pixels. This gives that nice "dissolve" transition between effects.

Keyboard controller is wired directly to the bus, so you can read the keyboard with a single instruction.

A MIDI controller is wired directly to address 0x330 (not standard equipment, back in the day this required a Roland card or SoundBlaster 32?), so you can just write MIDI to that.

There is a system timer interrupt configured for the music. The graphics appear to run continously, I can't see a link to the timer or vertical sync in the graphics code, that appears to just run continuously.

HellMood · on April 28, 2020

(author here) The "three pixel jump" is just for the looks, and it smoothes the animation for more calculation heavy effects (f.e. raycast tunnel). The transition effect is not bound to this, it is rather using the "noise" (as you described it) from the coordinate calculation to offset the time (desribed in the writeup). The graphic output is linked to the timer via register BP, which is modified in the interrupt routine.

hannob · on April 28, 2020

> - the graphics "driver" reads values out of certain registers (AL and AH?) at a set interrupt (maybe every X clock cycles?) and writes one pixel to the screen of whatever color those registers had in them

It's actually much simpler than that. After you set the right graphics mode (which for most simple dos demos is usually mode 13h, 256 color on 320x200) then there's an area of the memory that you can write to and it will show up as a pixel.

The "flow control" is usually just that you run your effect n times in these simple demos. Which means it will run faster on a faster CPU, but you usually wouldn't bother implement any form of timing in 256 byte.

raverbashing · on April 28, 2020

This is so cool

MS-DOS programming was overall a pain in the... byte but what I miss most about it was the simplicity of graphics.

Wanna draw? Just write to memory. Setting a mode was one instruction

(Wanna play sound? Fumble with 2 levels of IRQ controllers one DMA controller then sob uncontrollably. Or use Allegro. Wanna do multithreading? What's that? )

nchelluri · on April 28, 2020

Here's a size comparison with a "hello world" program in my favorite language, Go.

  nchelluri@grugbarn:~/dev/hello $ cat > hello.go
  package main

  import "fmt"

  func main() {
      fmt.Println("hello world")
  }
  nchelluri@grugbarn:~/dev/hello $ go build
  nchelluri@grugbarn:~/dev/hello $ strip hello
  nchelluri@grugbarn:~/dev/hello $ ./hello 
  hello world
  nchelluri@grugbarn:~/dev/hello $ du -h
  1.4M .

ajxs · on April 28, 2020

This is amazing!

Here's another awesome 256b demo that I love: http://www.pouet.net/prod.php?which=66372

HellMood · on April 28, 2020

That one is really amazing! I still don't understand how this didn't win the "Meteoriks" award (my "hypnoteye" did https://www.pouet.net/awards.php#2015tiny-intro ) Sadly, Baudsurfer has not been "around" for quite a while now ...

z0r · on April 28, 2020

Memories is... going into the memory bank for greatest 256 byte demos for me, right above the one you linked and immediate railways. Dang dude

rwmj · on April 28, 2020

The best demo I've found, also 256 bytes, is Pyrit by Řrřola (Jan Kadlec, a Czech developer). It's frankly incredible, something I wouldn't have believed was possible:

https://www.pouet.net/prod.php?which=78045

I ported it to a boot sector so you can run it with a single (rather long!) Linux command line in qemu:

https://rwmj.wordpress.com/2019/12/08/pyrit-by-rrrola-incred...

The source code for Pyrit is worth reading too (see first link). It's very clever and quite readable.

mateuszf · on April 28, 2020

Video on youtube: https://youtu.be/eYNoaVERfR4

haberman · on April 28, 2020

How do you handle time with such small code size? I see a timer interrupt for the music, but what about the animation? Is it dependent on the speed of the underlying CPU?

pvg · on April 28, 2020

It depends on the speed of the CPU - if you look at the archive at https://www.pouet.net/prod.php?which=85227, you'll find a DOSBox config specifically for this demo. If you run it in DOSBox you can fiddle with the emulation speed by pressing C-F11 and C-F12 and you'll notice the speed of the animation change.

Later: Your question made me wonder what the performance of virtual 'target CPU' is - the 'cycles' setting in the config is 20000 and there's a rough estimate of what these numbers translate to here

https://www.dosbox.com/wiki/Performance

So it looks like it's something along the lines of 'a 486 in the prime of its life'.

HellMood · on April 28, 2020

(author here) "you'll notice the speed of the animation change" that might be, but the demo is designed to run at equal speed on all systems (it hooks into the timer) if you experience animation speed differences, that means your system can not handle what dosbox (on high cycles) demands. It should be noted that DosBox is far slower than people expect it to be, and also, that in actual competitions in the demoscene, real modern hardware is booted to Freedos, but has no sound. So if you want sound (with MIDI) in a competition, you have to stick to the rather slow dosbox, and even optimize against an emulator which can be really really weird. (https://www.pouet.net/topic.php?which=11881) I wouldn't claim the demo runs fine on a real 486, but a pentium should do, as a variation of the raycast tunnel part indicates (https://www.youtube.com/watch?v=5_3CU6shKlY)

pvg · on April 28, 2020

Interesting, thanks! I thought I saw it get faster when I turned up the cycles but maybe I'm misremembering or it's some other effect/artifact.

hannob · on April 28, 2020

> How do you handle time with such small code size? I see a timer interrupt for the music, but what about the animation? Is it dependent on the speed of the underlying CPU?

Usually you don't "handle" it in very small demos of the 256b/64b kind, you just run your effect. And yes this means speed will depend on the CPU speed.

llarsson · on April 28, 2020

There is an OS-provided timer interrupt that you can hook in to, making sure your interrupt handler is called every X time units. I think it was something like 18 times per second by default, but changeable.

I used this to slow down my computer so that old games were playable. Hooked into the interrupt, wasted cycles, and could enjoy the game. :)

maggit · on April 28, 2020

I haven't had a look at the source code for this, but I expect that it syncs with the screen refresh rate.

HellMood · on April 28, 2020

(author here) it doesn't. it sets the timer to about 35 FPS and installs a callback routine that is called repeatedly as interrupt. Smoothing is rather done implicitly by "triple diagonal interlacing"

ddrdrck_ · on April 28, 2020

I didn't know about sizecoding.org, it seems to be a very valuable and interesting resource explaining the "black art" of tiny demos, thanks ! I did not check yet all pages, but "Memories" entry in particular is very well written and explained.

HellMood · on May 5, 2020

The final freedos version is available. It includes the Amiga Ball as extra effect. The filesize is still 256 bytes.

https://www.youtube.com/watch?v=wlW84fEHngM

imtringued · on April 28, 2020

Since these are so small I don't see why we couldn't have a "demoscene launcher" with a "mailto:" style protocol handler and just let people click on base64 encoded links to start the demo.

7777fps · on April 28, 2020

A handler for executing arbitrary code. What could possibly go wrong?

KMnO4 · on April 28, 2020

We already allow arbitrary code to execute by clicking a link, in the form of JavaScript.

You may argue that JS is sandboxed, but so is DOSBox. At least DOSBox can’t easily connect to remote servers over the internet.

7777fps · on April 28, 2020

Correct me if I'm wrong, I haven't used DOSBOX for a decade but doesn't it have the ability to access hard drives and mount them?

Given that, it's not much of a sandbox.

Or does that require intervention from the host system rather than auto-mounting home and similar?

HellMood · on April 28, 2020

I would clearly prefer a web browser with a dosbox to the "real" dosbox when it comes to safety...

7777fps · on April 28, 2020

Then I agree completely, and must have misunderstood what was proposed by a handler. Typically a handler will launch an external application such as mailto, ftp, magnet, etc.

If we want to run code in browser there is WASM.

So is the proposal that it would it be beneficial to have a DOS-like OS or x86 emulator in WASM for running COM files?

Yes, that would be better and more sandboxed than dosbox running outside the browser.

flohofwoe · on April 28, 2020

Run the code in an emulator, with the emulator implemented in WASM running in a web browser. That's enough sandboxing to be "reasonably secure".

zokier · on April 28, 2020

http://dosify.me should be fairly easy to modify to read the binary from url fragment instead of zip file

HellMood · on April 28, 2020

(author here) I didn't know about that one. I use http://twt86.co/ (no music there too, sadly) On both websites the performance is rather bad, but that is something that time will solve for us :D

HellMood · on April 28, 2020

(author here) Well there is http://twt86.co/ You can try samples, write your own code, or create clickable links =)

flohofwoe · on April 28, 2020

I tried something similar with a 4K C64 demo recently in my emulator, basically percent-encoding the program to run right into the URL instead of hosting it somewhere. It works, but only up to about 2.5 KBytes (good enough for 256 byte demos though).

Let's see if HN accepts the URL:

https://floooh.github.io/tiny8bit/c64.html?prg=AQgLCB4AnjIwN...

ChrisArchitect · on April 28, 2020

fun to watch this, like seeing the odd demo pop up on HN once in awhile and that code/techniques breakdown is incredible. Back in the day we rarely had that kind of insight into the mastery that went into a demo unless we really got into a discussion with the creator about the code.

another reminder of the 'art' of the demoscene and it's recent recognition as a piece of UN heritage in Finland which I thought was pretty cool http://demoscene-the-art-of-coding.net/2020/04/15/breakthrou... (HN discussion: https://news.ycombinator.com/item?id=22876961)

smabie · on April 28, 2020

I've always thought the demo scene looked cool. Problem is, I don't really care about graphics and sound and am not an especially creative person. Are there are competitions that are purely objective? As in, the objective criteria is quantitative?

HellMood · on April 28, 2020

You can always challenge yourself to create "something" in 32 bytes or 16 bytes. That is soo small, that sounds and graphics are rather abstract and it comes downto: does it produce something not random, noticable. For example, here is a paint program in 16 bytes : https://www.pouet.net/prod.php?which=62025 (the objective would be : create a program that allows painting on a canvas with the mouse) There have been ASM competitions with clear objectives in the past (http://www.hugi.scene.org/compo/) but these are long gone and seem to be replace with something like https://codegolf.stackexchange.com/ now

encom · on April 28, 2020

The International Obfuscated C Code Contest

https://www.ioccc.org/

smabie · on April 29, 2020

But people vote right? What quantitative metric could be possibly used to determine the most obfuscated C code?

klyrs · on April 28, 2020

codegolf

Cthulhu_ · on April 28, 2020

The Advent of Code challenges are really interesting as well, you can often code golf those by a lot. I never get that far though :/.

But, what I did last year was visualize one of the assignments; the assignment was something about overlapping areas on a field, the naive solution (for me) was to create an x by y bitmap and just add the overlaps, which could then easily be converted into a visible image, which helped me with visualizing the problem and my solution.

rasz · on April 29, 2020

accounting

mellow2020 · on April 28, 2020

note the "type" dropdown at the top, and enjoy.

https://www.pouet.net/toplist.php?type=32b&platform=&limit=5...

azuwey · on April 28, 2020

Wao this is pretty cool, I didn't know this site, thanks.

chrisweekly · on April 28, 2020

That is pretty amazing.

starpilot · on April 28, 2020

Pretty leet.

1cvmask · on April 28, 2020

The site has a login and lacks SSL. Hopefully this will be remedied soon.

spiritplumber · on April 28, 2020

256 bytes is in the "let's try every combination" range, I think. So, write a program that tries all of them and determines if any do something interesting enough to forward to a human for review.

LocalH · on April 28, 2020

Wouldn’t that be 256^256? That’s certainly a hell of a haystack

exikyut · on April 28, 2020

Unreasonably impossible as of yet, yes.

For handy reference:

8^8: 16,777,216

8^16: 281,474,976,710,656

8^32: 79,228,162,514,264,337,593,543,950,336

8^64: 6,277,101,735,386,680,763,835,789,423,207,666,416,102,355,444,464,034,512,896

8^128: ...

8^256: ?

8^512: hello from the other side of the quantum dimension

8^8 sounds interesting. 16 million reboots of a real {PC,C64,ST,Amiga,Mac,Z80,...} sounds like a collectively highly entertaining kind of hilarious. The issues only begin when you start wondering if any of the programs wedges the hardware into "interesting" states that are preserved across reboots - or at least the what if of that dimension of entropy... then the problem space becomes 8^8^8...

[I decided to compute 8^8^8. The result is apparently 15 million digits long. (`echo 8^8^8 | bc -ql | wc` -> `222814 222814 15596963`)

zokier · on April 28, 2020

Thats weird "reference". Why 8 as the base number? Nobody works with 3bit bytes. 8^8 == 2^24 == 2^(8 * 3) == 256^3, i.e. the combination space of three bytes.

The smallest category in pouet for reference is 32b (or 256 bits), so 2^256 combinations to brute force. For comparison usually 128bit encryption is considered "safe" and infeasible to brute force.

You might be able to constrain the search space to only valid IA32 instructions, but realistically I don't see it helping that much

vidarh · on April 28, 2020

You could further constrain it to exclude a lot of instructions and instruction pairs that makes no sense given the context. E.g. any instruction pair where the second one makes the first one redundant, such as the second instruction clobbering the same register the first one modified. Or a "ret" in the first few instructions...

It'd probably not constrain the search space nearly enough though.

But even if it did and you'd somehow manage to even generate every combination, you'd still face the second problem of how to evaluate if they do something "interesting enough" to be worthwhile reviewing.

exikyut · on April 29, 2020

Woops.

My original comment ran the numbers against the OP's "wouldn't that be 256^256?", but I got tripped up by the reply refuting that and saying it was 8^256 instead.

For as-yet unknown reasons my brain has always had a hard time mapping between the real world and the mathematical vacuum, so it was honestly less stressful to risk trusting that comment than try and [figure out how to] figure it out on my own. So I just substituted calculations for 8^n.

Here are the original numbers I supplied:

8^8: 167,77,216

16^16: 18,446,744,073,709,551,616

32^32: 1,461,501,637,330,902,918,203,684,832,716,283,019,655,932,542,976

Thanks.

Hopefully I can figure out those mapping problems one day. I think neurological damage may be involved, or something - I had to resort to button-mashing on my calculator while trying to figure out how many vegetables I could buy for $X given that they were $Y/kg one day at the supermarket. I'm 29. </rant>

userbinator · on April 28, 2020

Unless some crazy physics breakthrough happens, bruteforcing even 128 bits is physically impossible:

https://pthree.org/2016/06/19/the-physics-of-brute-force/

thr0w__4w4y · on April 28, 2020

Yes that's correct.

I think of it just slightly differently -- 256 bytes, 2048 bits -- so 2^2048 (same result as your 256^256).

To give people (who don't spend time with these numbers all the time, I do b/c of cryptography): 2 ^ 256 is on the order of the /number of atoms in the entire universe/ -- every star, moon, comet, black hole, galaxy, etc. across the entire known universe)

Now consider this: 2^512 -- take every single atom in the universe, and imagine that that atom //contains a universe of atoms//. Congratulations, you're only at 2 ^ 512.

Imagine how large 2 ^ 2048 is!

sho · on April 28, 2020

It would be 8^256. Less than you guessed but still a 232-digit number...

LocalH · on April 28, 2020

I don’t understand how it’s that low. 256 bytes, each with 256 possible values. 256^256.

scaredginger · on April 28, 2020

You're right, the other person is wrong :)

scaredginger · on April 28, 2020

I'm sorry, are you saying here that a byte can only express 8 different values?

sho · on May 5, 2020

Nope - it was late and I totally botched it. Rightfully downvoted.

HellMood · on April 28, 2020

(author here) it is not, as others explained. But if you're interested in bruteforcing, you can try to find a short code for the 7 bytes ( yes, seven bytes) version of my program "m8trix" (https://www.pouet.net/prod.php?which=63126, in the comments), that should be a tad more easy ;)

tripzilch · on April 28, 2020

AES is 256 bits and it's quite comfortably outside the "let's try every combination" range.

spiritplumber · on May 1, 2020

I cannot math today. Derp. Thanks

iszomer · on April 28, 2020

Tiny binaries probably relied heavily on the native OS's system libraries.

jfries · on April 28, 2020

Yes, that would give some peace of mind, right? Unfortunately for us, that's not the case. The only platform specific code is the 8 instructions on top of "Code of framework" on http://www.sizecoding.org/wiki/Memories

First to set the video mode, and then to set up a timer used to progress time.

yjftsjthsd-h · on April 28, 2020

That may be true sometimes, but "Memories" runs on MS-DOS, so there's precious little "OS", let alone libraries.

GuB-42 · on April 28, 2020

A common complaint on tiny demos. Here the OS is only used for setting graphics mode and setting up a timer. Plus all the boot code of course. Not much really, with 512 bytes you can probably do it on the bare metal, if someone didn't already.

There is even more debate in 4k. After all, most rely on graphics drivers that take hundreds of megabytes. But the thing to understand is that in any case, the intro ships all the code that produces the sound and image. The OS is just an abstraction layer. The exception would be fonts and MIDI instruments, that can be stored in the hardware or OS.

But not all intros have text, "Memories" doesn't. And many intros do their own sound synthesis, though in PC 256 bytes you are usually limited to MIDI or to that horrible buzzer.

HellMood · on April 28, 2020

(author here) Not quite, a 256 bytes PC intro CAN have decent non MIDI music as showcased here https://www.pouet.net/prod.php?which=79281 (won the "outstanding technical achievment" award) I did some intros in 32 bytes and 16 bytes having that "horrible buzzer", looks like some ASCII effects and a dutch gabber bassline is the maximum you can get in this category :D ( https://www.pouet.net/prod.php?which=76093)

GuB-42 · on April 28, 2020

That's why I said "usually", the moment someone says something is impossible, someone does it ;)

Anyways, great job. I was there during the compo, it was epic, with everyone double checking the executable size, even the old guys who have seen it all. You got my vote BTW.

HellMood · on April 28, 2020

#truckbreaker ;) yes, the overall reception was overwhelming, i didn't expect that. i completely agree with your post before, just wanted to point to "ikubun" =)

stiray · on April 28, 2020

No they dont, using them is waste of bytes.

HellMood · on April 28, 2020

(author here) interesting guess, but wrong, as others explained. for maximum purity you can try to NOT call any dos functions or interrupts. i gave that a try in the production "noint10h" ;) https://www.pouet.net/prod.php?which=80769

pjmlp · on April 28, 2020

MS-DOS system libraries in what concerns graphics was talking directly to the hardware.

smabie · on April 28, 2020

Such as? Doing anything meaningful would take up too many bytes. Regardless, dos doesn't really give you anything useful in the first place.

londons_explore · on April 28, 2020

Normally these demos are filled with all kinds of 'tricks' to make things smaller.

Things like self modifying code, using bits of the bios or video ROM in ways they weren't intended by jumping into the middle of them, saving space by using code as data or vice versa, tiny packers which compress or uncompress the code, massive pregenerated buffers to do runtime lookups to generate data in one order but use it in another, etc.

This seems very vanilla in comparison...

IvyMike · on April 28, 2020

Note your comment about how unimpressed you are is 214 bytes longer than the demo.

nullc · on April 28, 2020

Obviously he didn't do any 'tricks' to make it smaller.

tripzilch · on April 28, 2020

There's quite a few 'tricks' in the small bits of asm in the article.

Also, the tiny unpackers are generally used from 4096b and upwards. The size of the unpacker takes too much space and doesn't make up for the compression ratio at 256b.

saagarjha · on April 28, 2020

If only I could persistently XSS Hacker News and this comment could be self-modifying as well…

exikyut · on April 28, 2020

May I please have a list of demos that do all those things? That sounds really cool.

- Jumping into the middle of video ROM and/or the BIOS

- Using substantial amounts of code as data

- Pregenerated buffers and data reordering

oooooo.

HellMood · on April 28, 2020

(author here) Well, he is not entirely wrong. There is "m8trix", an 8 byte program (later optimized to seven bytes), that does - jump into the middle of it's own instructions - using 2 of that 7 bytes again as DATA - using FLAG register content as COLOR See : http://www.sizecoding.org/wiki/M8trix_8b

But all that doesn't really cut the space down in something as "big" as 256 bytes, it's the approach and the algorithms that do =)