Arne van Elk’s Post

SEO consultant @ Kinesso (formerly Reprise Digital), specialist search & information architecture

1mo

This is a nice & short explanation of the concept of ‘grounding’ of large language models. Grounding is important to reduce AI hallucinations

Gary Illyes

Analyst at Google

2mo

I need a place to point to, so let's talk about grounding in generative AI. Plain old generative AI makes predictions about what concepts are likely to follow each other given a prompt, and then gives the prompter the most likely string of concepts for that prompt. A is followed by B, which is followed by C. Based on the training data, it's unlikely that the next in line will be E. Rather it's likely it'll be D. This you get "A, B, C, D" for a prompt like "what are the first 4 letters of the Latin alphabet." It doesn't "know" that; it just knows that there's a 98.7% chance that D is the letter following C, and only 0.7% chance E is. If the likelihood of D or E following C are close to each other, the LLM might hallucinate and tell you the first four letters of the Latin alphabet are A, B, C, E. This can be caused by a bunch of things, but it's usually because of ambiguity in or staleness of the training data. You can ramp up the quality of your training data to solve this, but usually that's not feasible for one reason or other. However there's a quick hack (?) to decrease the chance of hallucinations: grounding. We could add another data source to our system, say elementary school text book that actually contains the whole Latin alphabet, and essentially verify the output of the LLM against that source. Now the output of the LLM is grounded.

To view or add a comment, sign in

More Relevant Posts

Simon Smith

EVP Generative AI at Klick
2mo Edited
Report this post
Here's more research to suggest that fine-tuning probably isn't the best strategy for many applications of generative AI. Here, researchers show that models' tendency to hallucinate increases as you fine-tune them on data that's outside their original training data. Based on this, and increasing research showing the value of good prompts and lots of examples in increasingly long context windows, I think a good strategy for applying AI to problems is: • Start with prompting. See how far you can get. • Add examples until you hit a limit on context length, response time, or cost that becomes impassible. • Consider fine-tuning if the data you're fine-tuning on is within the distribution of data the model originally trained on. • If not, perhaps train a new model using a combination of your data and synthetic data, and test different approaches. I think this also reinforces the value of having pretrained models that have trained on a huge amount of diverse data. Fine-tuning can help them use existing knowledge better for specific tasks. So the more base knowledge they have, the better. Paper: https://lnkd.in/gWt9PPuF
1 Comment
Like Comment
To view or add a comment, sign in
Chroma

4,612 followers
1mo
Report this post
Embedding Adapters Today, we are pleased to share the first of a series of technical reports with the AI application developer community—our investigation into the use of linear embedding adapters in improving retrieval accuracy in realistic settings. Retrieval accuracy is an important determinant of AI application performance. However, many approaches to improving retrieval accuracy require large labeled corpora, which are often not available to application developers. Additionally, many of these approaches require re-computing the entire set of embeddings. While embedding adapters aren't a new idea, but to our knowledge this is the first time they have been investigated in depth. In this work, we demonstrate that applying a linear transform, trained from relatively few labeled data points, to just the query embedding, produces a significant (up to 70%) improvement in retrieval accuracy across many domains, including across languages. For many applications, this is the difference between working or not. Learn more: https://lnkd.in/gnC_FTFm

Embedding Adapters

research.trychroma.com
Like Comment
To view or add a comment, sign in
Suvansh Sanjeev

Researcher @ OpenAI
1mo
Report this post
In this work, we sought to answer the question: how do you use thumb feedback 👍/👎 from users of retrieval bots to improve responses? TL;DR we find linear query-side adapters give huge, cheap retrieval performance gains once you have O(1000) annotations. This is a win for personalization – no need to re-embed all your docs as many times as you have users, just apply a query-time adapter tailored to your user in the retrieval step – introducing negligible latency.

Chroma

4,612 followers
1mo

Embedding Adapters Today, we are pleased to share the first of a series of technical reports with the AI application developer community—our investigation into the use of linear embedding adapters in improving retrieval accuracy in realistic settings. Retrieval accuracy is an important determinant of AI application performance. However, many approaches to improving retrieval accuracy require large labeled corpora, which are often not available to application developers. Additionally, many of these approaches require re-computing the entire set of embeddings. While embedding adapters aren't a new idea, but to our knowledge this is the first time they have been investigated in depth. In this work, we demonstrate that applying a linear transform, trained from relatively few labeled data points, to just the query embedding, produces a significant (up to 70%) improvement in retrieval accuracy across many domains, including across languages. For many applications, this is the difference between working or not. Learn more: https://lnkd.in/gnC_FTFm

Embedding Adapters

research.trychroma.com
Like Comment
To view or add a comment, sign in
Thomas James Lodato

Product Manager @ Mozilla | New Products, Innovation
9mo
Report this post
In the process of writing a different article, I took a section that was cut and made a shorter think-y piece: https://lnkd.in/ggfCqKK2 This post focuses on the tension inherent in generative AI that these technologies are made-things that feel very human. The result is a set of technologies stuck between sets of expectations to be flexible and adaptable as well as be compliant and "docile". Pretty early thinking all told, but putting it out.

The generative AI paradox

deptofthomas.micro.blog
Like Comment
To view or add a comment, sign in
Artūrs Torsters

Software Developer | Frontend Developer
8mo Edited
Report this post
Most of my colleagues, including myself, have had some experience with artificial intelligence. Although I wouldn’t call myself a complete rookie, there was definitely a lot to learn. For anyone that finds themselves in a similar position, I highly recommend reading this helpful “AI term starter pack” by Mitigate. It’s practical and works as a great starting point for those wanting to grasp the basics of AI. Access the AI term starter pack 👉 https://lnkd.in/de3u5ZAy

AI terminology 'starter pack'

mitigate.dev
Like Comment
To view or add a comment, sign in
Heather Brizzi
1mo
Report this post
The rate and pace of change in #ai can leave you at a disadvantage if you're trying to keep up with it like I am. Even a week away when I went on vacation meant I had some catchup to do as some big announcements happened during that period. It can be daunting, but with a plan, you can keep moving forward. I found this article really helpful in helping to frame and curate my needs around the infromation I want to know and consume. If you're on your #aijourney, make a plan to keep #learning. https://lnkd.in/g68xy3rt

How I Stay Up to Date with The Latest in AI 💻 📝

medium.com

1 Comment
Like Comment
To view or add a comment, sign in
Nicolas Ni

Govtechie Cloud Architect [Deep Learning][Machine Learning] [ToGAF][Public Cloud][CI/CD][DevOps][IIOT][GenAI]
6mo Edited
Report this post
🌟 A Project in Generative AI: Exploring Google Palm LLM in LangChain 🌟 I have been working on in the field of Generative AI one bit at a time through a hands-on project where I focuses on the exploration of Google Palm LLM within the LangChain framework. In this project, I had the opportunity to delve into the functionalities of the Google Palm module from the LangChain library. I observed the module's ability to generate creative outputs. The results were intriguing. Furthermore, I explored the RetrievalQA class from the LangChain library to develop a question-answering system. By loading data from a FAQ CSV file using the CSVLoader module, I created a knowledge base. I also experimented with Hugging Face embeddings to capture query meanings and utilized the FAISS vector store to retrieve relevant answers when query is enter. The retriever successfully pulled pertinent information based on the queries. During the project, I also explored GooglePalm embeddings and the Chroma vector database. However, I found that Hugging Face embeddings and FAISS were better suited for our specific use case. To enhance the question-answering capabilities, I created a RetrievalQA chain using a prompt template. This chain generates answers based on the provided context and question. If the answer is not found in the context, it gracefully responds with "I don't know." Please feel free to reach out if you have any questions or would like to know more about this project. Let's continue pushing the boundaries of AI together!
Like Comment
To view or add a comment, sign in
Dallas Joder

Data Scientist / Analyst / AI Whisperer
10mo
Report this post
The buzz over embedding in the generative-AI zeitgeist is worrying me. Many commentators conflate embedded vector stores with AI training, which is something completely different. Confusing these two concepts strikes me as dangerous. When you train an AI you are changing the weights of the underlying neural network. This means the neural network is learning and integrating new behaviors into its basic response repertoire, like a kid learning a new skill. Conceptually, we could say that after training, all the information exists within the algorithm independent of any context the user provides. Contrast: vector stores are effectively a new method for indexing data based on the same abstract semantic mapping that LLMs use. We can search that semantic index for elements that are similar to the context of the prompt. Then we append the corresponding blocks of text onto the prompt, thus reading them into the context for the response. However, that is not the same thing as adding the entire contents of a document into the LM’s context awareness. Not even close. Document embedding assumes that the vector for a query and the vectors for the elements that are most informative for the response will be similar. However that is not necessarily true, especially when the query is complex or subtle. Things get even more difficult if the desired information is spread out among multiple elements, possibly with repetition. How does one differentiate all the critical components? Unfortunately most of the embedding tools I have seen hide all this ambiguity and nuance from the end user in the interests of providing a “seamless” experience. We are left to trust that the system is working properly, and to scratch our heads in bewilderment when it fails catastrophically. Is it any wonder that trust of LLM intelligence is flagging? Embedding has its place in the AI ecosystem, but I don’t think the current implementations have it right. We need more transparency and education, or users will end up learning the lessons the hard way. What are your thoughts on best practices for embedding tools?
Like Comment
To view or add a comment, sign in
SPINOR

23 followers
1mo
Report this post
In our latest Medium article, we shed light on a pioneering model proposed by Marcus Hutter. AIXI uniquely combines Solomonoff induction and reinforcement learning to create an idealized rational agent capable of learning and acting optimally in any computable environment. Though it's a theoretical model with computational challenges, AIXI provides valuable insights into the principles of optimal decision-making and the future of AGI. Curious to learn more? #AI #AIXI #SPINOR #artificialintelligence #AGI #UAI #UniversalArtificialIntelligence #Medium Check out the full article here: https://lnkd.in/gdjJC4Qe

Understanding AIXI

medium.com
Like Comment
To view or add a comment, sign in
Towards AI

265,512 followers
8mo
Report this post
Author(s): Jesus Rodriguez Originally published on Towards AI. Can fine-tuning allow LLMs to unlearn existing knowledge?Created Using IdeogramI recently started an AI-focused educational newsletter, that already has over 160,000 subscribers. TheSequence is a no-BS (meaning no hype, no news, etc) ML-oriented newsletter that takes 5 minutes to read. The goal is to keep you up to date with machine learning projects, research papers, and concepts. Please give it a try by subscribing below:TheSequence U+007C Jesus Rodriguez U+007C SubstackThe best source to stay up-to-date with the developments in the machine learning, artificial intelligence, and data…thesequence.substack.comLarge language models(LLMs) are regularly trained in vast amounts of unlabeled data, which often leads to acquiring knowledge of incredibly diverse subjects. The datasets used in the pretraining of LLMs often including copyrighted material, triggering both legal and ethical concerns for developers, https://bit.ly/405UShm

Who is Harry Potter? Inside Microsoft Research’s Fine-Tuning Method for Unlearning Concepts in LLMs
Like Comment
To view or add a comment, sign in

1,076 followers

264 Posts

View Profile Follow

Arne van Elk’s Post

More Relevant Posts

Explore topics