We present the first survey on Evaluation of large language models! [arxiv] [code]