Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

As long as it can do tool calling (which it seems to be doing OK with in the first ~30% of the context), the cut off date is less important. Maybe they didn't share it because it's less relevant today?


No, I wasn't referring to the cut-off date, I was referring to the fact that this is a fine-tune on top of an older Llama model. All the PR makes it sound like this is a foundational model (pretrained from scratch etc.).




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: