Large language architectures (LLMs) have achieved remarkable performances in various natural language processing tasks. Scientific text summarization is a particularly challenging task due to the jargony nature of scientific documents. Evaluating LLMs on this particular task requires thoroughly formulated benchmarks and evaluation criteria. more