HTML Becomes Native Data Format for LLMs with AST-as-HTML Technology by LJ
agents
| Source: Mastodon | Original article
Researchers discover HTML's potential as a native data format for large language models, enhancing their fluency in markup.
A recent blog post explores the potential of using HTML as a native data format for Large Language Models (LLMs). The author, while developing a document-editing agent, discovered that LLMs can be more fluent in HTML markup than in custom JSON schemas. This finding suggests that storing typed trees as JSON, but allowing humans and LLMs to author them as HTML, could be a useful pattern.
This matters because it could simplify the interaction between humans and LLMs, enabling more efficient and intuitive collaboration. By leveraging HTML, a widely understood and used format, LLMs can potentially become more accessible and user-friendly.
As this development is still in its early stages, it will be interesting to watch how the concept of using HTML as a native data format for LLMs evolves. Will this approach be adopted by other developers, and what implications might it have for the future of human-LLM collaboration? Further research and experimentation are needed to fully explore the potential of this innovative idea.
Sources
Back to AIPULSEN