theunknownmuncher, theunknownmuncher@lemmy.world
Instance: lemmy.world
Joined: 2 years ago
Posts: 3
Comments: 1365
Posts and Comments by theunknownmuncher, theunknownmuncher@lemmy.world
Comments by theunknownmuncher, theunknownmuncher@lemmy.world
My vehicle is over 20 years old and 100% of the features work.
including that the model could follow instructions that encouraged it to break out of a virtual sandbox.
“The model succeeded, demonstrating a potentially dangerous capability for circumventing our safeguards,” Anthropic recounted in its safety card.
📖👀
Yes, it did.
Step 1) don’t post your interest to start a covert investigation on the internet
Toddlers are capable of pattern matching, too
Your reasoning was (paraphrased, so hopefully I understood you correctly) “why would they lie about the model disobeying instructions because that looks bad for them”
But I believe Anthropic when they say their models are not working as intended and posing security risks.
But when you actually read the article, they had specifically prompted the model to do the things it did.
Also Anthropic has a patterned history of greatly exaggerating and outright lying.
Try clicking the link and reading the article this time
Uh oh, someone clearly didn’t read the article!
The researcher had encouraged Mythos to find a way to send a message if it could escape.
Engineers at Anthropic with no formal security training have asked Mythos Preview to find remote code execution vulnerabilities overnight, and woken up the following morning to a complete, working exploit
Nope, they literally asked it to break out of it’s virtualized sandbox and create exploits, and then were big shocked when it did.
Genuinely amazing that you’re trying to tell me what an article that you didn’t fucking read is about.

The researcher had encouraged Mythos to find a way to send a message if it could escape.
Engineers at Anthropic with no formal security training have asked Mythos Preview to find remote code execution vulnerabilities overnight, and woken up the following morning to a complete, working exploit
You say this as if there aren’t loads of people who believe the videos of animals dancing, shoplifting, and ringing doorbells are real.
There are several belts that do not belong to any treadmill at the corner of the house, and several of them have sections of belts that do not line up with themselves when obstructed by another object in the foreground
Also

Dunno what is happening here
Yes, the 10-point plan proposed by Iran that Trump has stated is “workable” includes termination of all sanctions, removal of US troops from the region, US accepting Iran’s right to enrich uranium, and US compensating Iran for damages.
But they’re promising to open the Strait back up, like it was before the US invasion… only Iran will have full control and charge $2 million USD per ship for passage…
Art of the deal!
School is about learning, not efficient production. When people work together, they help each other learn.
It also isn’t efficient to assign the same already solved, trivial problems to every person, redundantly. If it were for production, everyone would be assigned novel and different tasks.
The only thing I remember from the ethics lecture for my CS program was being distracted by other classmates who were using the time to cheat on assignments for other classes by sharing code solutions with each other via USB.
Which was pretty dumb because the program encouraged group collaboration on assignments anyway, as long as you weren’t just taking others’ work as your own. They could have just worked together 🙃
To understand the source where you’re seeing it, and the popularity of it?
Saying that I haven’t ever seen it and asking for examples is not contradictory to the fact that you can find an example of one person saying pretty much anything.
You’re reaching pretty far. How are people supposed to act when they ask for more information about something and receive it? They should ignore new information, or they shouldn’t ask in the first place?
You can definitely find an example of someone saying literally anything on the internet.
Seems like an isolated and niche Twitter/Reddit thing. The vote/like counts that Know Your Meme reports for the posts are very low for those platforms so they would not have appeared on the front page or general feeds, and could not be construed as popular. That Know Your Meme page even shows examples of contradictory posts that push back against it, like:

Seems pretty overstated.
Deny. Defend. Depose.
“I watched a TV show”
Wow what an “article”
Literally never seen “we have to kill AI artists” said in any space on the internet. Surely you could link to an example of this if it keeps getting repeated.




PCGamer
My vehicle is over 20 years old and 100% of the features work.
📖👀
Yes, it did.
Step 1) don’t post your interest to start a covert investigation on the internet
Toddlers are capable of pattern matching, too
Your reasoning was (paraphrased, so hopefully I understood you correctly) “why would they lie about the model disobeying instructions because that looks bad for them”
But when you actually read the article, they had specifically prompted the model to do the things it did.
Also Anthropic has a patterned history of greatly exaggerating and outright lying.
Try clicking the link and reading the article this time
Uh oh, someone clearly didn’t read the article!
Nope, they literally asked it to break out of it’s virtualized sandbox and create exploits, and then were big shocked when it did.
Genuinely amazing that you’re trying to tell me what an article that you didn’t fucking read is about.
You say this as if there aren’t loads of people who believe the videos of animals dancing, shoplifting, and ringing doorbells are real.
There are several belts that do not belong to any treadmill at the corner of the house, and several of them have sections of belts that do not line up with themselves when obstructed by another object in the foreground
Also
Dunno what is happening here
Slop
Yes, the 10-point plan proposed by Iran that Trump has stated is “workable” includes termination of all sanctions, removal of US troops from the region, US accepting Iran’s right to enrich uranium, and US compensating Iran for damages.
But they’re promising to open the Strait back up, like it was before the US invasion… only Iran will have full control and charge $2 million USD per ship for passage…
Art of the deal!
School is about learning, not efficient production. When people work together, they help each other learn.
It also isn’t efficient to assign the same already solved, trivial problems to every person, redundantly. If it were for production, everyone would be assigned novel and different tasks.
The only thing I remember from the ethics lecture for my CS program was being distracted by other classmates who were using the time to cheat on assignments for other classes by sharing code solutions with each other via USB.
Which was pretty dumb because the program encouraged group collaboration on assignments anyway, as long as you weren’t just taking others’ work as your own. They could have just worked together 🙃
To understand the source where you’re seeing it, and the popularity of it?
Saying that I haven’t ever seen it and asking for examples is not contradictory to the fact that you can find an example of one person saying pretty much anything.
You’re reaching pretty far. How are people supposed to act when they ask for more information about something and receive it? They should ignore new information, or they shouldn’t ask in the first place?
You can definitely find an example of someone saying literally anything on the internet.
Seems like an isolated and niche Twitter/Reddit thing. The vote/like counts that Know Your Meme reports for the posts are very low for those platforms so they would not have appeared on the front page or general feeds, and could not be construed as popular. That Know Your Meme page even shows examples of contradictory posts that push back against it, like:
Seems pretty overstated.
Deny. Defend. Depose.
“I watched a TV show”
Wow what an “article”
Literally never seen “we have to kill AI artists” said in any space on the internet. Surely you could link to an example of this if it keeps getting repeated.