Large language models (LLMs) such as ChatGPT, Claude Cowork and GitHub Copilot have revolutionised the way individuals and organizations interact with artificial intelligence for content generation, ...
Encoding is a process of transforming the data into different parameters to enhance its compatibility, usefulness, and to transmit it through various systems and applications. Therefore, the main ...
Abstract: In this paper, we introduce an Optimized Byte Pair Encoding (OBPE) tokenizer where the algorithm is optimized for the South African languages, including Sesotho, Setswana, Xhosa, Xitsonga, ...
The proposed work addresses the imperative of safeguarding sensitive information through a dual-security approach of encryption and steganography. Employing the Advanced Encryption Standard (AES) for ...
In this work, we propose a novel view that treats inducing temporal action abstractions as a sequence compression problem. To do so, we bring a subtle but critical component of LLM training pipelines ...
Abstract: Payload classification is a kind of deep packet inspection model that has been proved effective for many Internet applications such as, but not limited to, intrusion detection and network ...
Synthetic DNA is rapidly emerging as a durable, high-density information storage platform. A major challenge for DNA-based information encoding strategies is the high rate of errors that arise during ...