Large language models architectures have revolutionized the field of artificial intelligence, showcasing astonishing capabilities in natural language processing. These complex systems are built upon vast neural networks, composed of millions or even billions of parameters. By training on huge datasets of text and code, these models learn a deep und