As a more general comment, the repo README provides examples that all use gpt2. ...

Havoc · on Aug 14, 2023

Inclined to disagree - gpt2 is far more likely to produce gibberish. So if you can force specific outputs on that then it is a good demo that higher quality models will be even better

coder543 · on Aug 14, 2023

Maybe... but then if I want to use something better, I have to figure out how by myself. I said "at least one example", not "please change all the examples to llama2." I agree with your general point. It would be nice if there were an example of how to use a better model.

Models often have different shapes and requirements, so is it really as simple as changing the string "gpt2" to "llama2-13B-Chat" and it will magically work? If so, that's great, and I wish that was made clear. Unfortunately, that hasn't always been my experience with other libraries.

remilouf · on Aug 14, 2023

Agree, working on a Colab with a "better" model as we speak.

dvasdekis · on Aug 15, 2023

Wonderful, thank you!

swyx · on Aug 15, 2023

it would also be nice to see one example that uses gpt4.

coder543 · on Aug 15, 2023

Given how this works, I don’t think that is possible unless OpenAI implements it themselves.

swyx · on Aug 15, 2023

really? the docs seem to promise something like that "can work with any model"

coder543 · on Aug 15, 2023

Yes, any model that you can run on your computer. It changes the way that the tokens are sampled from the LLM, and OpenAI does not give you deep enough access into the pipeline to affect that.