Abstract: Recent large language models (LLMs), driven by the scaling law, have demonstrated remarkable performance in various machine learning tasks by significantly increasing model size. However, ...