OctoML Launches OctoAI Text Gen Solution
SEATTLE, Nov. 28, 2023 /PRNewswire/ -- OctoML announced today the launch of its OctoAI Text Gen Solution to empower application builders to run and scale applications on their choice of Llama 2 Chat, Code Llama Instruct and Mistral Instruct models—all on one unified API endpoint. The new release offers the fastest fleet of accelerated open source LLMs, including numerous configurations of Llama 2, Mistral, and the unique option to bring your own fine-tuned Llama 2 models. OctoAI's Text Gen Solution, together with the OctoAI Image Gen Solution, now offers a flexible "model-cocktail" alternative to monolithic multi-modal models, enabling developers to build highly composable multi-modal applications.
- OctoAI's Text Gen Solution, together with the OctoAI Image Gen Solution , now offers a flexible "model-cocktail" alternative to monolithic multi-modal models, enabling developers to build highly composable multi-modal applications.
- With the OctoAI Text Gen Solution, developers can now easily run inferences against multiple OSS model families, sizes, and variants—all against one scalable production-grade API endpoint.
- Flexible "model-cocktail" approach to multi-modal needs: Text Gen solution complements OctoAI's recently launched Image Gen Solution and all the models available in the OctoAI compute service, empowering customers to easily build multi-modal application using their preferred mix of OSS models, as demonstrated in the OctoStudio demo application walkthrough.
- OctoAI Text Gen customers can also bring their own fine-tuned Llama 2 variant or checkpoint and run it at low-latency at massive scale.