「未来を感じる！最新AI技術LLaMA3を搭載した頼れる地元エージェント」

by LangChain 2024年11月6日, 06:40 21 Comments

Reliable, fully local RAG agents with LLaMA3

LLaMA3のリリースにより、信頼性の高いローカルエージェント（たとえば、あなたのノートパソコン上で実行可能なもの）への関心が高まっています。ここでは、LangGraphとLLaMA3-8bを使用して信頼性の高いローカルエージェントをゼロから構築する方法を紹介します。私たちは、3つの先進的なRAG論文（Adaptive RAG、Corrective RAG、Self-RAG）からアイデアを組み合わせて単一の制御フローを作成します。この制御フローは、@nomic_ai＆@trychromaによるローカルベクトルストア、@tavilyaiによるウェブ検索、そして@ollama経由でLLaMA3-8bを使ってローカルで実行されます。

Code:
https://github.com/langchain-ai/langgraph/blob/main/examples/rag/langgraph_rag_agent_llama3_local.ipynb

この記事では、LangGraphとLLaMA3-8bを使用した信頼性の高いローカルエージェントの構築方法やその重要性について詳しく解説しています。AI技術がどれだけ革新的で魅力的かを伝えるポイントとしては、自己修正能力や適応能力など最新の研究成果を取り入れた制御フローが挙げられます。また、開発者向けにGitHub上で公開されているコードへのリンクも提供されており、興味がある読者は実際に手を動かすことが可能です。AI技術の最先端を追求する方々にとって必見の記事と言えるでしょう。

動画はこちら

Written by LangChain

コメントを残すコメントをキャンセル

GIPHY App Key not set. Please check settings

21 Comments

Sort by

@王莽-o3y says:

2024年11月6日 at 06:40 Copy Link of a Comment

This is insanely good! I had a idea similar to this but never such well implemented as a AI new Bee.

0

返信
@laughtale1181 says:

2024年11月6日 at 06:40 Copy Link of a Comment

dont bullshit us with stupid tutorials thatd doesnt serve us any purpose ! go and buil;d a fuckng end to end conversational bot with ur langraph thta takes input, process it as more queries, send to differenet nodes asks more questions with llm and generate result with grader

0

返信

@arthurphiladelpho says:

2024年11月6日 at 06:40 Copy Link of a Comment

🚀🚀

0

返信
@superfreiheit1 says:

2024年11月6日 at 06:40 Copy Link of a Comment

Cant see the code well, can you make it bigger please

0

返信
@junzhou-up9qc says:

2024年11月6日 at 06:40 Copy Link of a Comment

no files in you submit github😁

0

返信

@raghavv3279 says:

2024年11月6日 at 06:40 Copy Link of a Comment

code work flawless on mac m2. But, fails at vector store indexing step on windows PC i9 processor, 64GB RAM, with nvidia 4090.. error "12:21:14.611 [error] Disposing session as kernel process died ExitCode: 3221225477, Reason: Failed to load llamamodel-mainline-cuda-avxonly.dll: LoadLibraryExW failed with error 0x7e

Failed to load llamamodel-mainline-cuda.dll: LoadLibraryExW failed with error 0x7e"

0

返信
@ea4all-genai-exploration says:

2024年11月6日 at 06:40 Copy Link of a Comment

That’s really awesome and very useful! I literally have implemented a similar flow today, using another langraph use-case, but the fallback workflow at the end makes much more sense to increase answer quality. Thanks and brilliant communicated.

0

返信
@elijahgavrilov1686 says:

2024年11月6日 at 06:40 Copy Link of a Comment

Thats okay, but can you make model to use specific role?

0

返信

@davidtindell950 says:

2024年11月6日 at 06:40 Copy Link of a Comment

Yes. Very Useful. Especially running 'reliably' on my local machine (in this case MS_Win with NVidia GPU") !
Thank You. Yet Again !!!!

0

返信
@drm2005 says:

2024年11月6日 at 06:40 Copy Link of a Comment

How to integrate a knowelge graph to increase accuracy

0

返信
@MattHudsonS says:

2024年11月6日 at 06:40 Copy Link of a Comment

Great video. Advanced concepts but simple to understand.

0

返信

@jennievo100 says:

2024年11月6日 at 06:40 Copy Link of a Comment

Excellent video! Thank you. Would you know how to handle the potential case that the agent goes into infinite loop, e.g. it gets stuck at the hallucinating check. I can only think of keeping track of the threshold for number of checks, and am wondering if there's a more elegant way to do that in Langchain.

0

返信
@shahnawazrshaikh9108 says:

2024年11月6日 at 06:40 Copy Link of a Comment

succinct!

0

返信
@EmirSyailendra says:

2024年11月6日 at 06:40 Copy Link of a Comment

Thanks for the amazing video Lance! Very clear explanation, this is really helpful to my work too.

I really like the graphic for the workflow, what tools that you used for that?

0

返信

@mohamedkeddache4202 says:

2024年11月6日 at 06:40 Copy Link of a Comment

i have a question, for example i build a agentic RAG application, this application has multiple LLMs working together (router, grader, generater, hallucination_checker, etc…) is every single LLM are called Agent or the whole application is an Agent ? (because i saw an information that agents break a task into multiple tasks).
also the chat prompt template for each LLM in the application, is it considered as prompt engineer ?

0

返信
@thiagoamaralf says:

2024年11月6日 at 06:40 Copy Link of a Comment

amazing video now imagine if you can implement Self-Supervised Learning to check it self and make sure the output will delivered witout errors and witout missuse of resource…

0

返信
@collinvelarde7473 says:

2024年11月6日 at 06:40 Copy Link of a Comment

Incredible. Great stuff brotha. Thank you.

0

返信

@JatinKashyap-Innovision says:

2024年11月6日 at 06:40 Copy Link of a Comment

Link to the code? Thanks for the video.

0

返信
@Anorch-oy9jk says:

2024年11月6日 at 06:40 Copy Link of a Comment

Nice. This is great content. I am gonna run it with phi-3. One Question:
Can I use a ReactAgent and provide multiple control flows as tools?

0

返信
@eduardoconcepcion4899 says:

2024年11月6日 at 06:40 Copy Link of a Comment

How important is the chunk size and what is the best way t set it up?

0

返信

@madhudson1 says:

2024年11月6日 at 06:40 Copy Link of a Comment

a great challenge would be to accurately ascertain whether the model is capable of answering the question/topic itself or whether external tooling such as web browsing is required. I haven't been able to do this yet with llama3. I guess I haven't managed to find the correct routing prompt (a stage after the initial routing)

0

返信

「未来を感じる！最新AI技術LLaMA3を搭載した頼れる地元エージェント」

Reliable, fully local RAG agents with LLaMA3

関連

Written by LangChain

MetaのLLAMA 3が業界を驚かせる！（新しいオープンソースGPT-4）

Unveiling the Future: How Mark Zuckerberg’s Llama 3, $10B Models, and Caesar Augustus are Revolutionizing AI in 1 GW Datacenters

Unveiling the Future: Meta Announces Llama 3 at Weights & Biases’ Conference

「Metaの最新AI大模型がLlama3で一键本地部署！GPU不要、100%成功保証で軽快な体験を」

Unlocking the Power of Local AI Agents: How to Build Your Own Tools with Llama 3 8B!

Unlock the Power of AI: Harnessing Meta Llama3 with Huggingface and Ollama

「CLAIRE® GPTが世界進出：ヨーロッパとAPACでAIデータ管理を革新する最前線」

「未来を感じる！大規模言語モデル（LLM）の魅力に迫る【ChatGPT/GPT-4/Transformer】」

「ChatGPTの会話履歴を賢く検索する方法：地味だけど超便利なライフハック！」

「驚異のGPT-4o登場：仕事効率を劇的に向上させる最新AIテクノロジーの進化と可能性」

MetaのLLAMA 3が業界を驚かせる！（新しいオープンソースGPT-4）

「OpenAI、GEMAの怒りを買う！歌詞を使ったAIトレーニングに法的挑戦」

コメントを残すコメントをキャンセル

21 Comments

「iOS 18.2ベータ版で明らかに！SiriがChatGPTアップグレードでどのように進化するのか、気になる新『デイリーリミット』ガイドも公開」

「未来を切り拓くAI技術！Transformerの仕組みを学ぼう」

「CLAIRE® GPTが世界進出：ヨーロッパとAPACでAIデータ管理を革新する最前線」

「未来を感じる！大規模言語モデル（LLM）の魅力に迫る【ChatGPT/GPT-4/Transformer】」

「ChatGPTの会話履歴を賢く検索する方法：地味だけど超便利なライフハック！」

「驚異のGPT-4o登場：仕事効率を劇的に向上させる最新AIテクノロジーの進化と可能性」

MetaのLLAMA 3が業界を驚かせる！（新しいオープンソースGPT-4）

「OpenAI、GEMAの怒りを買う！歌詞を使ったAIトレーニングに法的挑戦」