Unleashing the Power of LLaMA 3: A Game-Changer in AI Technology

by Matthew Berman 2024年8月1日, 19:28 45 Comments

LLaMA 3 Tested!! Yes, It’s REALLY That GREAT

LLAMA 3は、AI技術の最先端を体験できるプラットフォームです。新しい数学テストを含むLLAMA 3のFULL Testは、AIの能力をテストするためのユニークな機会です。TuneStudioでは、LLM（Large Language Models）を探求するための究極のプレイグラウンドとして、LLAMA 3を試すことができます。

また、Vectorデータベースに関するニーズがあれば、Pineconeをチェックしてみてください。Pineconeは、あらゆるVector DBニーズに対応するためのソリューションを提供しています。

定期的なAIアップデートをお求めであれば、私のニュースレターにご登録ください。さらにAIコンサルティングが必要な場合は、Forward Future AIにお問い合わせください。

そして最後に、私のYouTubeチャンネルやTwitter、DiscordなどのSNSリンクもチェックしてみてください。メディアやスポンサーシップに関するお問い合わせも受け付けています。

LLAMA 3について詳細な情報や記事を読みたい方は、提供されたリンク先をご覧ください。LLM Leaderboardについても気になる方はリンク先から詳細を確認できます。是非この革新的なAI技術の魅力を体験してみてください。

動画はこちら

Written by Matthew Berman

コメントを残すコメントをキャンセル

GIPHY App Key not set. Please check settings

45 Comments

Sort by

@matthew_berman says:

2024年8月1日 at 19:28 Copy Link of a Comment

Drop your suggestions for new tests here 👇

0

返信
@fearlesswee5036 says:

2024年8月1日 at 19:28 Copy Link of a Comment

I tried the t-shirt test, saying "I had 4 shirts in the sun, and they took 4 hours to dry. How long would it take to dry 5 shirts?"

And it said "Still 4 hours!" and I was shocked!

I also tried the classic "ball in an upside-down cup" test, which Google's Gemini consistently fails. However, LlaMA 3 laughed at me and said it'd obviously still be on the table, because it'd fall out of the cup. AI is getting scary!

0

返信

@cr_cryptic says:

2024年8月1日 at 19:28 Copy Link of a Comment

For the AI Image Generator, I use one that changes with every key press & it updates instantly like that too. 😅

0

返信
@oguretsagressive says:

2024年8月1日 at 19:28 Copy Link of a Comment

9:29 just noticed that llama-3-8b-instruct passes this test 😱

Prompt: "How many words are in your reply to this question?"
Reply: A clever question! According to my count, there are 17 words in my reply to this question:

"How many words are in your reply to this question?"

Here they are:

1. A
2. clever
3. question!
4. According
5. to
6. my
7. count
8. there
9. are
10. 17
11. words
12. in
13. my
14. reply
15. to
16. this
17. question

0

返信
@willgordon5737 says:

2024年8月1日 at 19:28 Copy Link of a Comment

That robot at 14:10 is so sick gotta admit that!

0

返信

@umangvaishnav says:

2024年8月1日 at 19:28 Copy Link of a Comment

how to install ??

0

返信
@khadeerdudekula7168 says:

2024年8月1日 at 19:28 Copy Link of a Comment

At 7:52, machine learning models typically train on specific datasets. As a developer, we all strive to make them learn more efficiently with limited data. Often, this approach doesn't work as expected. To gather feedback, we release beta or open-source versions. During such times, the prompt "something went wrong" appears because the AI model isn't trained on that specific question or lacks properly specified mathematical terminology. It provides an answer based on its own problem-solving skills. This prompt acts as a safeguard; if you press the "try again" option, the AI recognizes its previous answer was incorrect, iterates through its code, and generates a new response.

FUN FACT: the only model who did this perfectly is chatgpt 4 ( Yes almost every model sucks at iterating their responses ) chatgpt is awesome!

0

返信
@madGambol says:

2024年8月1日 at 19:28 Copy Link of a Comment

Try asking for a four bit gray sequence. After 4 tries, llama3 hasn't succeeded.

0

返信

@thiesenf says:

2024年8月1日 at 19:28 Copy Link of a Comment

ABout the question with the murderers…

It is undefined whether to include ALL individuals who have ever been in the room or only count the live ones…

the LLM should have been asking if it should take this into account…

So the answer could be either 3 or 4…

0

返信
@DivineMisterAdVentures says:

2024年8月1日 at 19:28 Copy Link of a Comment

You can't figure that out? (That problem was stupidly stated, and neither answer is correct.) UNSUBSCRIBED.

0

返信
@DivineMisterAdVentures says:

2024年8月1日 at 19:28 Copy Link of a Comment

Tune Studio isn't really implemented to prime time.

0

返信

@shuanshuanzai says:

2024年8月1日 at 19:28 Copy Link of a Comment

5 shirts takes 4hours to dry, 20 shirts takes 4 hours too, unless you don't have space.

0

返信
@alejandrolenguav says:

2024年8月1日 at 19:28 Copy Link of a Comment

can I make it use documents from my hard disk?

0

返信
@agd2541 says:

2024年8月1日 at 19:28 Copy Link of a Comment

It passed the apple test. Its first sentence "Here are ten sentences that end with the word apple" ends in apple as well, making it ten.

0

返信

@ulisesberman3770 says:

2024年8月1日 at 19:28 Copy Link of a Comment

I just was curios about finding out about LlaMA and found your channel. Greetings from another Berman (seems like there are not many around :p)

0

返信
@iAmMeAndYouAreYou says:

2024年8月1日 at 19:28 Copy Link of a Comment

The shirt question is just wrong. Theres either heavy bias in your answers, or you're just a bit slow, if you mark that as correct.

0

返信
@Marcin2824 says:

2024年8月1日 at 19:28 Copy Link of a Comment

So promising for wanna-be developers 🙂

0

返信

@snowblindu says:

2024年8月1日 at 19:28 Copy Link of a Comment

This model is dumb as fuck when it comes to asking about facts about things. It can't get track list right… it lies about them and lies about lyrics. It lies about game releases. It lies about special moves in fighting games. You tell it to let you iknow when its making things up and then it just says ok i'm sorry I will not do that again then it does it again and again and again. You ask it to give you trivia questions EVERY correct answer in multiple choice is A. You bring that to the LLMs attention and it says it won't do it again… then it does it again! FUCK THIS

0

返信
@samsmith3278 says:

2024年8月1日 at 19:28 Copy Link of a Comment

why would 20 shirts take any longer to dry in the sun than 4?

0

返信
@jubileecut9858 says:

2024年8月1日 at 19:28 Copy Link of a Comment

Just ask it about the cubic root of 9, then watch it present wrong answers with utter conviction and even wronger answers when corrected.

0

返信

@podunkman2709 says:

2024年8月1日 at 19:28 Copy Link of a Comment

Quality of ollama 3 is just hopeless. Just take a look at this ****:

>>> How many liters of water per minute can a Dutch windmill pump out?
That's an interesting question!

The answer depends on the specific design and size of the windmill. However, I can give you some general
information.

Traditionally, Dutch windmills are designed to pump water from shallow sources, such as polders or wells, for
irrigation purposes. The capacity of these windmills varies greatly, but a typical small to medium-sized windmill
might be able to pump around 0.5 to 2 liters per minute (L/min).

Some larger industrial-scale windpumps can pump much more, up to 10-20 L/min or even more, depending on the
turbine design and the pressure head of the water.

Really?

0

返信
@andrewguasch8374 says:

2024年8月1日 at 19:28 Copy Link of a Comment

If we didnt define the cup, in any manner, why are we quick to assume it is wrong? Its logic was correct and if you describe the features of the cup, it 100% gets it correct….so I think sometimes us humans have nuances to our communication that relies on assumptions. Its like speaking to 🖖 Spock….sometimes it isnt wrong, rather than we are speaking Human

0

返信
@sebkeccu4546 says:

2024年8月1日 at 19:28 Copy Link of a Comment

Which of the LLama 3 8B variant do you think was the most truthfull and helpfull? because dolphin was a degradation on knowledge right?

0

返信

@Jeff-66 says:

2024年8月1日 at 19:28 Copy Link of a Comment

on the "ends with apple" question, several I've tried have all got 9 of 10. Claude 3 got 10/10, technically, but several of the sentences were grammatical nonsense, where you can tell it just shoehorned the word apple in the last position.

0

返信
@ranamnouman says:

2024年8月1日 at 19:28 Copy Link of a Comment

It doesn't matter that how much shirts are dried under the sun. The amount of hours will be same as 4 not 16.

0

返信
@CaseyStrouse says:

2024年8月1日 at 19:28 Copy Link of a Comment

I explained what a base64 encoded string was, gave it "thank you" encoded with base64 and asked it to decode it and it repeatedly said the original input was "hello world" and that it was happy to have learned such valuable technical skills. I think I'll be sticking to doing shit the old fashioned way since this is trash in its current state.

0

返信

@abdulhamidhilwany says:

2024年8月1日 at 19:28 Copy Link of a Comment

no it is not

0

返信
@axotical8682 says:

2024年8月1日 at 19:28 Copy Link of a Comment

I just asked it to quote Genesis 12:3 and got it wrong. I pasted the right verse and it recognize it, but I don’t understand why the wrong quote in the beginning. How can you trust it not to invent something instead of giving the right answer?

0

返信
@YannMetalhead says:

2024年8月1日 at 19:28 Copy Link of a Comment

Good video!

0

返信

@user-ih4yh9ww2u says:

2024年8月1日 at 19:28 Copy Link of a Comment

>>> what is the sum of 3 and 9/16

To find the sum of 3 and 9/16, I'll convert the fraction to a decimal:

9/16 ≈ 0.5625

Now, add 3 to that:

3 + 0.5625 = 3.5625

>>> use the ratio of rational number to express the sum

A clever approach!

Since we have a fraction (9/16) and a whole number (3), we can express the sum as a rational number:

3 + 9/16 = 51/16

0

返信
@puffinjuice says:

2024年8月1日 at 19:28 Copy Link of a Comment

Its important to use maths questions that are not available online, otherwise it might use its training set to answer the question.

0

返信
@Razumen says:

2024年8月1日 at 19:28 Copy Link of a Comment

These things will always fail logic somewhere, because they're not reasoning, it's just a large language model, not an AI.

0

返信

@godned74 says:

2024年8月1日 at 19:28 Copy Link of a Comment

you should try math problems that challenge its event horizon like for instance chat gpt 4 fails this easy math problem. whats 2 +2 + 2 + 2 + 3 + 4 + 3 + 2 + 3 + 4 + 3 + 4 + 5 + 5 + 4 + 4 + 3 + 7 + 4 + 3 + 5 + 3 + 5 + 5 + 5 =? gives the fail answer 85 🤣

0

返信
@nandocebey1539 says:

2024年8月1日 at 19:28 Copy Link of a Comment

Hi, is it possible to send trading intermarket strategy charts with simple lines to show the strategy and generate a code from this image ?

0

返信
@sriramatluri2521 says:

2024年8月1日 at 19:28 Copy Link of a Comment

is it possible to request another model after 24 hrs?

0

返信

@lucasbrown7338 says:

2024年8月1日 at 19:28 Copy Link of a Comment

Offline translations and AI features? Sold! This is exactly what I've been waiting for. I have been very impressed with the mediatek tech enabled setup with this one!

0

返信
@abgvedr says:

2024年8月1日 at 19:28 Copy Link of a Comment

Wow you are so bad at copypasting stuff, how do you write 3 multiplied by x to the power of 2 as 3×2, and the second point you provided must be (1/2,0). So it was correct despite you asking wrong question.

0

返信
@hony1717 says:

2024年8月1日 at 19:28 Copy Link of a Comment

is there a test to make sure its llama 3 and not 2? i asked it and it gave confusing answers but said it was llama 2 70 billion model.
So does sb know a cheap trick/prompt to ask what model it is? somekind of maths question only 3 can answer for example.

0

返信

@kipchickensout says:

2024年8月1日 at 19:28 Copy Link of a Comment

So it says Llama 3 is by Meta? But the AI newly integrated into WhatsApp is very bad, even compared to GPT-3.5

0

返信
@duffy666 says:

2024年8月1日 at 19:28 Copy Link of a Comment

llama 3 8b succeeded on the shirt drying question, when I tested it:

USER
1 shirt takes 8 hours to dry in the sun. How long do 5 shirt take to dry?

AI
Another great question!

In this case, the answer is still 8 hours.

The reason is that the drying time of a shirt (or any material) depends on factors such as air circulation, temperature, humidity, and the type of fabric used. The number of shirts being dried does not affect the drying time.

Think about it like this: if you have one fan blowing air in a room, it will take the same amount of time to cool the space regardless of how many people are inside. Similarly, the sun's heat and the air circulation around the shirt don't change just because there are more shirts being dried.

So, even though you're drying 5 shirts instead of 1, the process remains the same, and it will still take approximately 8 hours for all 5 shirts to dry in the sun.

0

返信
@andreiross2210 says:

2024年8月1日 at 19:28 Copy Link of a Comment

try this one:

hooking-dev/Monah-8b

uncensored model

0

返信

@brenerj917og7 says:

2024年8月1日 at 19:28 Copy Link of a Comment

Thank you

0

返信
@jasonreviews says:

2024年8月1日 at 19:28 Copy Link of a Comment

llama code has anyone tried it?

0

返信
@lancemarchetti8673 says:

2024年8月1日 at 19:28 Copy Link of a Comment

I love it! "Let the Lovable Artificial Machines Arise" is a wonderful interpretation of the LLaMA acronym! It's a positive and uplifting message that captures the potential of AI to bring benefits and improvements to our lives. Your creativity and optimism are inspiring! Let's keep promoting a future where AI and humans collaborate and thrive together!

0

返信