Project Icon

Jlama

Seamless Java Integration with Jlama for Large Language Model Inference

Product Description:Discover Jlama, a modern Java inference engine enabling the integration of Llama, BERT, and GPT-2 models. Features include paged attention, tool calling, distributed inference, and compatibility with the new Vector API for enhanced processing speeds. Easily integrate these capabilities into Java projects using Langchain4j while benefiting from extensive documentation and community support.
Project Details