Function-level code generation with unit tests.
pip install human-eval && generate_samples MODEL && evaluate_functional_correctness samples.jsonl