Open in app

Sign In

Write

Sign In

Skanda Vivek
Skanda Vivek

2.4K Followers

Home

Lists

About

Pinned

Deploying Open-Source LLMs As APIs

Open-source LLMs are all the rage, along with concerns about data privacy with closed-source LLM APIs. This tutorial goes through how to deploy your own open-source LLM API Using Hugging Face + AWS — While ChatGPT and GPT-4 have taken the world of AI by storm in the last half year, open-source models are catching up — slowly but surely. And there has been a lot of ground to cover, to reach OpenAI model performance. …

AI

7 min read

Deploying Open-Source LLMs As APIs
Deploying Open-Source LLMs As APIs
AI

7 min read


Pinned

Hello and Welcome!

Who I am, why I write, and why you might be interested — First of all — thank you so much for reading this post! While I’ve been writing on Medium for a couple of years now, I haven’t yet gotten to introducing myself on Medium. I’m a data scientist but my journey has not been typical. I started off getting my PhD…

Introduction

7 min read

About Me — Hello and Welcome!
About Me — Hello and Welcome!
Introduction

7 min read


Published in

Towards Data Science

·Pinned

Build Industry-Specific LLMs Using Retrieval Augmented Generation

Organizations are in a race to adopt Large Language Models. Let’s dive into how you can build industry-specific LLMs Through RAG — Companies stand to gain a lot of productivity improvements through LLMs like ChatGPT. But try asking ChatGPT “what is the current inflation in the U.S.” and it gives: I apologize for the confusion, but as an AI language model, I don’t have real-time data or browsing capabilities. My responses are…

Data Science

10 min read

Build Industry-Specific LLMs Using Retrieval Augmented Generation
Build Industry-Specific LLMs Using Retrieval Augmented Generation
Data Science

10 min read


Published in

Towards Data Science

·Pinned

Fine-Tune Transformer Models For Question Answering On Custom Data

A tutorial on fine-tuning the Hugging Face RoBERTa QA Model on custom data and obtaining significant performance boosts — Question Answering and Transformers BERT is a transformer model that took the world by storm in 2019. BERT was trained on unlabeled data by masking words and training the model to predict these masked words based on context. BERT was later fine-tuned on multiple tasks and achieved state of the art performance on many…

Data Science

5 min read

Fine-Tune Transformer Models For Question Answering On Custom Data
Fine-Tune Transformer Models For Question Answering On Custom Data
Data Science

5 min read


Published in

EMAlpha

·Sep 6

How Do You Evaluate Large Language Model Apps — When 99% is just not good enough?

LLMs are fundamentally changing the way practitioners evaluate performance . Let’s look at the recent progress towards evaluating LLMs in production. — Prior to the release of Large Language Models (LLMs) like ChatGPT that do extremely well out of the box on custom use-cases, model evaluations were fairly typical. You split your data into training/test/dev sets — trained your model on the training set, and evaluated performance on the test/dev set. I’m…

AI

9 min read

How Do You Evaluate Large Language Model Apps — When 99% is just not good enough?
How Do You Evaluate Large Language Model Apps — When 99% is just not good enough?
AI

9 min read


Published in

Towards AI

·Aug 9

Build A Custom AI Based ChatBot Using Langchain, Weviate, and Streamlit

A comprehensive guide to building a customized chatbot using Generative AI, a popular vector database, prompt chaining, and UI tools — As multiple organizations are racing to build customized LLMs, a common question I have been asked is — what are the tools out there to streamline this process? In this article, I show you how to build a fully functional application for engaging in conversations through a chatbot built on…

AI

9 min read

Build A Custom AI Based ChatBot Using Langchain, Weviate, and Streamlit
Build A Custom AI Based ChatBot Using Langchain, Weviate, and Streamlit
AI

9 min read


Published in

EMAlpha

·Aug 8

The Economics of Large Language Models

A deep dive into considerations for using and hosting large language models — There have been many new exciting developments in Generative AI over the last couple of months. ChatGPT was released in late 2022 and took the world of AI by storm. In response, industries started inquiring into large language models and how to incorporate them into their business. However, in sensitive…

Technology

7 min read

The Economics of Large Language Models
The Economics of Large Language Models
Technology

7 min read


Published in

Towards Data Science

·Aug 1

4 Crucial Factors for Evaluating Large Language Models in Industry Applications

Every use case is different — depending on customer needs, and industry-specific guidelines. Learn how to make the right LLM choices, using 4 key rubrics — Over the past few months, I’ve had the opportunity to chat with folks from the legal, healthcare, finance, tech, insurance industries on LLM adoption. And each of them comes with unique requirements and challenges. In healthcare, for example — privacy is king. In finance, getting the numbers right is paramount…

Data Science

8 min read

4 Crucial Factors For Evaluating Large Language Models In Industry Applications
4 Crucial Factors For Evaluating Large Language Models In Industry Applications
Data Science

8 min read


Jul 31

(How) Will Generative AI Change Education?

Generative AI has many exciting benefits for the education sector but also poses opportunities for misuse. Here’s how to adapt — emphasizing reasoning, step-by-step thinking, and ethical AI — Key Takeaways Educators must think about whether to try to control or adapt to the new technology to minimize cases of misuse. Suggested solutions such as watermarks and classifiers (that identify AI-generated content) are flawed, and there is no obvious solution to this challenge.

Data Science

7 min read

(How) Will Generative AI Change Education?
(How) Will Generative AI Change Education?
Data Science

7 min read


Jul 21

Deploy LLMs Using Azure ML

A tutorial on how to use the Microsoft Azure ML catalog for deploying LLM endpoints as APIs and a comparison with AWS — I recently wrote an article about deploying LLMs as APIs using AWS — when someone commented that they would like a similar article but using Azure ML instead. So I decided to write this article discussing how to deploy LLM endpoints on Azure ML. I also compare the price, ease…

AI

7 min read

Deploy LLMs Using Azure ML
Deploy LLMs Using Azure ML
AI

7 min read

Skanda Vivek

Skanda Vivek

2.4K Followers

Senior Data Scientist in NLP and advisor

Following
  • Benjamin Cain

    Benjamin Cain

  • Tim Andersen, Ph.D.

    Tim Andersen, Ph.D.

  • Dr Mehmet Yildiz

    Dr Mehmet Yildiz

  • A. S. Deller

    A. S. Deller

  • Sergey Faldin 🇺🇦

    Sergey Faldin 🇺🇦

See all (503)

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech

Teams