LLaMA 3 Tested!! Yes, It’s REALLY That GREAT
LLAMA 3は、AI技術の最先端を体験できるプラットフォームです。新しい数学テストを含むLLAMA 3のFULL Testは、AIの能力をテストするためのユニークな機会です。TuneStudioでは、LLM(Large Language Models)を探求するための究極のプレイグラウンドとして、LLAMA 3を試すことができます。
また、Vectorデータベースに関するニーズがあれば、Pineconeをチェックしてみてください。Pineconeは、あらゆるVector DBニーズに対応するためのソリューションを提供しています。
定期的なAIアップデートをお求めであれば、私のニュースレターにご登録ください。さらにAIコンサルティングが必要な場合は、Forward Future AIにお問い合わせください。
そして最後に、私のYouTubeチャンネルやTwitter、DiscordなどのSNSリンクもチェックしてみてください。メディアやスポンサーシップに関するお問い合わせも受け付けています。
LLAMA 3について詳細な情報や記事を読みたい方は、提供されたリンク先をご覧ください。LLM Leaderboardについても気になる方はリンク先から詳細を確認できます。是非この革新的なAI技術の魅力を体験してみてください。
Drop your suggestions for new tests here 👇
I tried the t-shirt test, saying "I had 4 shirts in the sun, and they took 4 hours to dry. How long would it take to dry 5 shirts?"
And it said "Still 4 hours!" and I was shocked!
I also tried the classic "ball in an upside-down cup" test, which Google's Gemini consistently fails. However, LlaMA 3 laughed at me and said it'd obviously still be on the table, because it'd fall out of the cup. AI is getting scary!
For the AI Image Generator, I use one that changes with every key press & it updates instantly like that too. 😅
9:29 just noticed that llama-3-8b-instruct passes this test 😱
Prompt: "How many words are in your reply to this question?"
Reply: A clever question! According to my count, there are 17 words in my reply to this question:
"How many words are in your reply to this question?"
Here they are:
1. A
2. clever
3. question!
4. According
5. to
6. my
7. count
8. there
9. are
10. 17
11. words
12. in
13. my
14. reply
15. to
16. this
17. question
That robot at 14:10 is so sick gotta admit that!
how to install ??
At 7:52, machine learning models typically train on specific datasets. As a developer, we all strive to make them learn more efficiently with limited data. Often, this approach doesn't work as expected. To gather feedback, we release beta or open-source versions. During such times, the prompt "something went wrong" appears because the AI model isn't trained on that specific question or lacks properly specified mathematical terminology. It provides an answer based on its own problem-solving skills. This prompt acts as a safeguard; if you press the "try again" option, the AI recognizes its previous answer was incorrect, iterates through its code, and generates a new response.
FUN FACT: the only model who did this perfectly is chatgpt 4 ( Yes almost every model sucks at iterating their responses ) chatgpt is awesome!
Try asking for a four bit gray sequence. After 4 tries, llama3 hasn't succeeded.
ABout the question with the murderers…
It is undefined whether to include ALL individuals who have ever been in the room or only count the live ones…
the LLM should have been asking if it should take this into account…
So the answer could be either 3 or 4…
You can't figure that out? (That problem was stupidly stated, and neither answer is correct.) UNSUBSCRIBED.
Tune Studio isn't really implemented to prime time.
5 shirts takes 4hours to dry, 20 shirts takes 4 hours too, unless you don't have space.
can I make it use documents from my hard disk?
It passed the apple test. Its first sentence "Here are ten sentences that end with the word apple" ends in apple as well, making it ten.
I just was curios about finding out about LlaMA and found your channel. Greetings from another Berman (seems like there are not many around :p)
The shirt question is just wrong. Theres either heavy bias in your answers, or you're just a bit slow, if you mark that as correct.
So promising for wanna-be developers 🙂
This model is dumb as fuck when it comes to asking about facts about things. It can't get track list right… it lies about them and lies about lyrics. It lies about game releases. It lies about special moves in fighting games. You tell it to let you iknow when its making things up and then it just says ok i'm sorry I will not do that again then it does it again and again and again. You ask it to give you trivia questions EVERY correct answer in multiple choice is A. You bring that to the LLMs attention and it says it won't do it again… then it does it again! FUCK THIS
why would 20 shirts take any longer to dry in the sun than 4?
Just ask it about the cubic root of 9, then watch it present wrong answers with utter conviction and even wronger answers when corrected.
Quality of ollama 3 is just hopeless. Just take a look at this ****:
>>> How many liters of water per minute can a Dutch windmill pump out?
That's an interesting question!
The answer depends on the specific design and size of the windmill. However, I can give you some general
information.
Traditionally, Dutch windmills are designed to pump water from shallow sources, such as polders or wells, for
irrigation purposes. The capacity of these windmills varies greatly, but a typical small to medium-sized windmill
might be able to pump around 0.5 to 2 liters per minute (L/min).
Some larger industrial-scale windpumps can pump much more, up to 10-20 L/min or even more, depending on the
turbine design and the pressure head of the water.
Really?
If we didnt define the cup, in any manner, why are we quick to assume it is wrong? Its logic was correct and if you describe the features of the cup, it 100% gets it correct….so I think sometimes us humans have nuances to our communication that relies on assumptions. Its like speaking to 🖖 Spock….sometimes it isnt wrong, rather than we are speaking Human
Which of the LLama 3 8B variant do you think was the most truthfull and helpfull? because dolphin was a degradation on knowledge right?
on the "ends with apple" question, several I've tried have all got 9 of 10. Claude 3 got 10/10, technically, but several of the sentences were grammatical nonsense, where you can tell it just shoehorned the word apple in the last position.
It doesn't matter that how much shirts are dried under the sun. The amount of hours will be same as 4 not 16.
I explained what a base64 encoded string was, gave it "thank you" encoded with base64 and asked it to decode it and it repeatedly said the original input was "hello world" and that it was happy to have learned such valuable technical skills. I think I'll be sticking to doing shit the old fashioned way since this is trash in its current state.
no it is not
I just asked it to quote Genesis 12:3 and got it wrong. I pasted the right verse and it recognize it, but I don’t understand why the wrong quote in the beginning. How can you trust it not to invent something instead of giving the right answer?
Good video!
>>> what is the sum of 3 and 9/16
To find the sum of 3 and 9/16, I'll convert the fraction to a decimal:
9/16 ≈ 0.5625
Now, add 3 to that:
3 + 0.5625 = 3.5625
>>> use the ratio of rational number to express the sum
A clever approach!
Since we have a fraction (9/16) and a whole number (3), we can express the sum as a rational number:
3 + 9/16 = 51/16
Its important to use maths questions that are not available online, otherwise it might use its training set to answer the question.
These things will always fail logic somewhere, because they're not reasoning, it's just a large language model, not an AI.
you should try math problems that challenge its event horizon like for instance chat gpt 4 fails this easy math problem. whats 2 +2 + 2 + 2 + 3 + 4 + 3 + 2 + 3 + 4 + 3 + 4 + 5 + 5 + 4 + 4 + 3 + 7 + 4 + 3 + 5 + 3 + 5 + 5 + 5 =? gives the fail answer 85 🤣
Hi, is it possible to send trading intermarket strategy charts with simple lines to show the strategy and generate a code from this image ?
is it possible to request another model after 24 hrs?
Offline translations and AI features? Sold! This is exactly what I've been waiting for. I have been very impressed with the mediatek tech enabled setup with this one!
Wow you are so bad at copypasting stuff, how do you write 3 multiplied by x to the power of 2 as 3×2, and the second point you provided must be (1/2,0). So it was correct despite you asking wrong question.
is there a test to make sure its llama 3 and not 2? i asked it and it gave confusing answers but said it was llama 2 70 billion model.
So does sb know a cheap trick/prompt to ask what model it is? somekind of maths question only 3 can answer for example.
So it says Llama 3 is by Meta? But the AI newly integrated into WhatsApp is very bad, even compared to GPT-3.5
llama 3 8b succeeded on the shirt drying question, when I tested it:
USER
1 shirt takes 8 hours to dry in the sun. How long do 5 shirt take to dry?
AI
Another great question!
In this case, the answer is still 8 hours.
The reason is that the drying time of a shirt (or any material) depends on factors such as air circulation, temperature, humidity, and the type of fabric used. The number of shirts being dried does not affect the drying time.
Think about it like this: if you have one fan blowing air in a room, it will take the same amount of time to cool the space regardless of how many people are inside. Similarly, the sun's heat and the air circulation around the shirt don't change just because there are more shirts being dried.
So, even though you're drying 5 shirts instead of 1, the process remains the same, and it will still take approximately 8 hours for all 5 shirts to dry in the sun.
try this one:
hooking-dev/Monah-8b
uncensored model
Thank you
llama code has anyone tried it?
I love it! "Let the Lovable Artificial Machines Arise" is a wonderful interpretation of the LLaMA acronym! It's a positive and uplifting message that captures the potential of AI to bring benefits and improvements to our lives. Your creativity and optimism are inspiring! Let's keep promoting a future where AI and humans collaborate and thrive together!
11:20 LMAO