Blog

GPT-in-a-Box 2.0 is Here With Four Ways to Get Started with GenAI

By Mike Barmonde and Arjoyita Roy

May 21, 2024 | min

Nutanix is thrilled to announce our GPT-in-a-Box 2.0 solution,  a secure, full-stack enterprise AI platform built on web-scale data services to deploy your large language models (LLMs), ML Operations (MLOps), and generative AI (GenAI) apps anywhere – from core to edge to cloud.

GPT-in-a-Box 2.0 will give customers access to a wider range of GenAI models and tools to simplify the top GenAI use cases for the enterprise with a validated AI ecosystem, including with NVIDIA® and Hugging Face™ software.

For you, GPT-in-a-Box 2.0 will solve the hardest question facing many enterprises making their move towards GenAI: Where do I start?

Here are four ways GPT-in-A Box 2.0 can help you get started and level up your enterprise GenAI deployments:

  1. Understand the simplest GenAI use cases and solutions you can deploy today
  2. Create, deploy and manage APIs and LLMs from NVIDIA NIM, Hugging Face, or LLMs you choose to quickly adapt to model trends and changes
  3. Use standard servers, GPUs, and containers for GenAI without deploying a special architecture
  4. Build on standardized data services from edge to cloud for GenAI
GPT-in-a-Box 2.0 Makes GenAI a Snap

1. Understand the simplest GenAI use cases and solutions you can deploy today 

To accelerate GenAI, you need solid, attainable use cases and solutions. GPT-in-a-Box 2.0 defines these through key industry examples with solutions you can start with right now, including new and powerful AI software. 

  • Finance Use Cases
    • Fraud detection
    • Risk assessment
    • Customer service
    • Algorithmic trading
  • Healthcare Use Cases
    • Enhance patient care
    • Streamline diagnostics
    • Develop personalized treatment plans
    • Improve operational efficiency
  • Public Sector Use Cases
    • Streamline administrative processes
    • Enhance decision-making
    • Optimize resource allocation
GenAI Use Cases and Solutions

Other industries will be within the wheelhouse of GPT-in-a-Box 2.0 to deliver AI solutions, and we’ll provide more guidance as those use cases evolve.

Now that we know our use case, how will we deploy it?

Once a use case has been decided on, GPT-in-a-Box 2.0 will enable four key private and secure solutions: Private GPT, GenAI for code, GenAI for content, and AI-assisted document understanding.

  • Private GPT
    A private GenAI chat bot that can help you control data security and privacy.

  • GenAI for code
    GenAI assisted code generation, boosting developer productivity.

  • GenAI for content
    GenAI capabilities for content creation to enhance productivity for marketing and sales teams.

  • AI-Assisted document understanding
    Extract, interpret, and process various types of documents while helping  safeguard intellectual property and sensitive data.

Deploy GenAI Use Cases

But wait, you may ask: What about AI model training?

Simply put, training of AI models at scale is often best suited for public cloud with elastic compute and GPUs where the companies building these models don't want to own the hardware and software CAPEX long-term.

Build and deploy with AI partners

GPT-in-a-Box 2.0 will further accelerate your enterprise GenAI strategy as a hybrid multicloud data platform with the introduction of the new AI partner program that includes new, incredible AI partners who help ensure your use cases can be delivered seamlessly. 

Here are two key partnerships from the AI partner program:

NVIDIA logo

For NVIDIA AI Enterprise customers, GPT-in-a-Box 2.0 will be able to easily deploy NVIDIA NIM, a set of optimized cloud-native microservices for GenAI.

Hugging Face logo

Together with GPT–in–a–Box 2.0, the Nutanix partnership with Hugging Face will fast-track model deployment to easily integrate LLMs and deliver a seamless workflow to search, download, and deploy validated AI LLMs with full support. 

 

2. Create, deploy and manage APIs LLMs from NVIDIA NIM, Hugging Face, or LLMs you choose to quickly adapt to model trends and changes

Accelerating GenAI for the enterprise demands a turnkey approach to download, deploy, and swap LLMs. These operations are called LLMOps (LLM operations, a branch of MLOps) and can also include API management. 

The easier a general IT technologist can manage GenAI business needs, the faster an enterprise can reap the benefits of continued GenAI adoption and differentiation .

GPT-in-a-Box 2.0 will meet this need with brand-new capabilities under development to make deploying GenAI a snap.

1. Create and Manage APIs and LLMs for GenAI

Customers will be able to create, manage, and connect APIs to LLMs for instant GenAI workload access. Platform engineers who are responsible for GenAI workflows won’t have to be LLMOps experts to connect or create APIs for fine-tuned models and apps.

Create and Manage APIs and LLMs for GenAI

2. Deploy NVIDIA NIM

For NVIDIA AI Enterprise customers, GPT-in-a-Box 2.0 will let you easily deploy NVIDIA NIM, a set of optimized cloud-native microservices designed to shorten time-to-market of generative AI models anywhere, from edge to cloud.

NVIDIA Model Catalog

3. Deploy Hugging Face models

Customers will be able to integrate validated LLMs for GenAI directly from Hugging Face to run on  GPT-in-a-Box 2.0.

Hugging Face Model Hub

4. Upload Your Own LLMs

GPT-in-a-Box 2.0 also will allow you to upload and deploy non-validated and unsupported models you choose.

Upload models

3. Use standard servers, GPUs, and containers for GenAI without deploying a special architecture

When a new technology impacts the enterprise, the concern is always: What is the lift to implement it? GenAI is no different. In fact, GenAI brings hardware back to the spotlight with critical compute requirements for high-powered servers with the latest GPUs and CPUs. 

GPT-in-a-Box 2.0 will leverage the Nutanix Cloud Platform under the hood for broad hardware compatibility and choice. 

1. GPUs for AI

Nutanix GPT-in-a-Box 2.0 will use the latest NVIDIA data center GPUs like the L40s and H100 for scalable GenAI needs from edge to private clouds. A GPU loads the AI LLM in its memory, then interfaces with GenAI apps for inference on content, code, document assisting, or fine-tuning as a computer operation.

NVIDIA logo

2. CPUs for AI

Nutanix GPT-in-a-Box 2.0 is developing Intel AMX for AHV and will enable Intel AMX for GenAI workloads. Intel AMX is a great choice for mixed workloads, where AI/ML processing is only one component of larger AI needs. Intel AMX delivers improved efficiency and reduced deployment costs, lowering the total cost of ownership.

Intel logo

3. Latest AI Platforms

GenAI for HCI makes integrating new hardware infrastructure simple, and Nutanix has partnered with most major hardware manufacturers to deliver the best, most dense platform for AI, including Dell, HPE, Lenovo, and Nutanix NX with  the NVIDIA MGX architecture.

4. Build on standardized data services from edge to cloud for GenAI

GPT-in-a-Box 2.0 will be part of the Nutanix Cloud Platform (NCP) whose foundation is based on secure, resilient, scalable, and flexible data from the edge to the public cloud for all your enterprise AI needs. This set of data services on Nutanix for GenAI is delivered via the Nutanix Unified Storage (NUS) solution for files and objects, and the Nutanix Data Services for Kubernetes® (NDK) solution for containers. Read more about these latest innovations from .NEXT, here.

Nutanix Unified Storage

Nutanix Unified Storage can be deployed as a dedicated all-NVMe storage cluster or as an HCI cluster at any location - edge, core, or cloud, with integrated compute, network and storage capabilities bringing GenAI apps closer to their data.

Data service requirements across AI/ML lifecycle

Nutanix Data Services for Kubernetes (NDK)

Nutanix Data Services for Kubernetes pairs with the Nutanix Kubernetes® Platform (NKP) solution which is under development, for management and automation, enabling a holistic approach to platform engineering and DevOps. The combination of Nutanix Kubernetes Platform plus Nutanix Data Services for Kubernetes create a turnkey solution for enterprise-grade private cloud environments that require a cloud-native design that easily extends to public clouds.

Nutanix Data Services for Kubernetes

But that’s not all.

Nutanix Multicloud Snapshot Technology

The Nutanix Multicloud Snapshot Technology feature allows data snapshots to be intelligently offloaded to be written directly to S3-compatible object storage, including hyperscaler offerings such as AWS S3 storage and Microsoft Azure Blob storage, and finally be restored anywhere GPT-in-a-Box 2.0 runs.

Nutanix Multicloud Snapshot Technology

What’s Your Next Move?

GPT-in-a-Box 2.0 will make getting started with enterprise GenAI a snap, allowing you  to deploy real use cases and solutions built on standard hardware without the need for a special architecture. GenAI models change fast, and with Nutanix, you can stay ahead of the curve with a secure, full-stack platform to run GenAI data and apps anywhere.

Nutanix AI Partners

So, what’s your next move?

Want to learn more? Check out Nutanix AI solutions for more information and updates.

About Nutanix

Nutanix is a global leader in cloud software, offering organizations a single platform for running apps and data across clouds. With Nutanix, organizations can reduce complexity and simplify operations, freeing them to focus on their business outcomes. Building on its legacy as the pioneer of HCI, Nutanix is trusted by companies worldwide to power hybrid multicloud environments consistently, simply, and cost-effectively. Learn more at www.nutanix.com or follow us on social media @nutanix.

© 2024 Nutanix, Inc. All rights reserved. Nutanix, the Nutanix logo, and all Nutanix product and service names mentioned herein are registered trademarks or unregistered trademarks of Nutanix, Inc. (“Nutanix”) in the United States and other countries. Kubernetes® is a registered trademark of the Linux Foundation. Other brand names or marks mentioned herein are for identification purposes only and may be the trademarks of their respective holder(s). This blog is for informational purposes only and nothing herein constitutes a warranty or other binding commitment by Nutanix. This blog contains express and implied forward-looking statements, including but not limited to statements regarding our plans and expectations relating to new product features and technology that are under development, the capabilities of such product features and technology and our plans to release product features and technology in the future. Such statements are not historical facts and are instead based on our current expectations, estimates and beliefs. The accuracy of such statements involves risks and uncertainties and depends upon future events, including those that may be beyond our control, and actual results may differ materially and adversely from those anticipated or implied by such statements. Any forward-looking statements included herein speak only as of the date hereof and, except as required by law, we assume no obligation to update or otherwise revise any of such forward-looking statements to reflect subsequent events or circumstances. Any future product or product feature information is intended to outline general product directions, and is not a commitment, promise or legal obligation for Nutanix to deliver any functionality.  This information should not be used when making a purchasing decision.