Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

"generalize to its dataset" is a contradiction, especially as these models are trained in the one epoch regimen on datasets of the scale of all of the internet. if you think being able to generalize in ways similar to the whole of the internet does not give your meaningful abilities to reason, I'm not sure what I can tell you


> "generalize to its dataset" is a contradiction

Not "to" but over, example the same code written in one language over the other language.

> if you think being able to generalize in ways similar to the whole of the internet does not give your meaningful abilities to reason, I'm not sure what I can tell you

If after reading papers below that show empirically that they can't reason, you will still think they can reason, then I don't know what I can tell you.

https://arxiv.org/abs/2311.00871

https://arxiv.org/abs/2309.13638

https://arxiv.org/abs/2311.09247

https://arxiv.org/abs/2305.18654

https://arxiv.org/abs/2309.01809




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: