Author: R Jemaiel

Fine-tune Llama 2 using QLoRA and Deploy it on Amazon SageMaker with AWS Inferentia2

TutoSartup excerpt from this article:
In this post, we showcase fine-tuning a Llama 2 model using a Parameter-Efficient Fine-Tuning (PEFT) method and deploy the fine-tuned model on AWS Inferentia2... We then use a large model inference container powered by Deep Java Library (DJLServing) as our model serving solution... Solution overview Efficient Fine-tuning Llama2 using QLoRa The Llama 2 family of large language models (LLMs) is a...

Build an end-to-end MLOps pipeline using Amazon SageMaker Pipelines, GitHub, and GitHub Actions

TutoSartup excerpt from this article:
The built-in project templates provided by Amazon SageMaker include integration with some of third-party tools, such as Jenkins for orchestration and GitHub for source control, and several utilize AWS native CI/CD tools such as AWS CodeCommit, AWS CodePipeline, and AWS CodeBuild... In this post, we show you a step-by-step implementation to achieve the following: Create a custom SageMaker ML...

Strengthening customer third-party due diligence with renewed AWS CyberGRX assessment

TutoSartup excerpt from this article:
Amazon Web Services (AWS) is pleased to announce the successful renewal of the AWS CyberGRX cyber risk assessment report... Many customers use third-party cyber risk management (TPCRM) services such as CyberGRX to better manage risks from their evolving third-party environments and to drive operational efficiencies... To help with such efforts, AWS has completed the CyberGRX assessment of ...

Microsoft and Oracle announce that Oracle Database@Azure is now generally available

TutoSartup excerpt from this article:
This blog is co-authored by Ravi Turlapati, Head of Strategic Products and Growth, Oracle Cloud Infrastructure...The Microsoft and Oracle partnership is focused on giving customers choice and removing the hurdles faced when migrating mission-critical workloads to the public cloud where they can access the rich set of technology needed to accelerate innovation and compete more effectively... In...

AWS for Games updates from re:Invent 2023

TutoSartup excerpt from this article:
During re:Invent 2023, the AWS for Games team showcased the latest ways our customers are using AWS game development tools and introduced several new purpose-built guidance and partner solutions that are now available in the AWS Games Solution Library... Recordings of AWS for Games customer presentations from re:Invent include: Customer Keynote – Riot Games: Riot Games’ Head of Global ...

Create a web UI to interact with LLMs using Amazon SageMaker JumpStart

TutoSartup excerpt from this article:
This post shows you how you can create a web UI, which we call Chat Studio, to start a conversation and interact with foundation models available in Amazon SageMaker JumpStart such as Llama 2, Stable Diffusion, and other models available on Amazon SageMaker... After you deploy this solution, users can get started quickly and experience the capabilities of multiple foundation models in conversatio...

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

TutoSartup excerpt from this article:
” A generative pre-trained transformer (GPT) uses causal autoregressive updates to make prediction... In this post, we’ll summarize training procedure of GPT NeoX on AWS Trainium, a purpose-built machine learning (ML) accelerator optimized for deep learning training...2 M tokens/$) trained such models with AWS Trainium without losing any model quality...9B) are trained on openly available Pile...