[Technical] If LLMs are trained on human data, why do they use some words that we rarely do, such as "delve", "tantalizing", "allure", or "mesmerize"?