Follow jakub151
jakub151
GPT-6 Needs ARC Evals
We propose legislation mandating evaluations of SOTA language models to test for dangerous capabilities.
jakub151
The Start of Investigating a 1-Layer SoLU Model
We found a behavior where a toy SoLU model succeeds as well as a few edge cases where it fails.
jakub151