Researchers found AI is hopeless at most Upwork task, it gets the news wrong half the time — and humans crush AI on world model tests. AI Eye. AI agents cant complete 97% of tasks on Upwork to even a basic standard. Researchers at Scale AI and the Center for AI Safety got six different AI models to attempt 240 Upwork projects across categories, including writing, design and data analysis and then compared the results to the real freelancer. The overwhelming majority of the time, the AI models were unable to complete the tasks successfully, with the best AI model, Manus, completing just 2.5% of tasks and earning $1,810 out of $143,991 on offer. Claude Sonnet and Grok 4 managed to finish 2.1% of the tasks. While AI agents are good at simple and defined tasks like “generate a logo,” the research found they are bad at multi-step workflows, taking any initiative or using judgment. So they wont be causing mass unemployment for a while yet. This backs up research from August at MIT, which found that 95% of organizat...
Researchers found AI is hopeless at most Upwork task, it gets the news wrong half the time — and humans crush AI on world model tests. AI Eye. AI agents cant complete 97% of tasks on Upwork to even a basic standard. Researchers at Scale AI and the Center for AI Safety got six different AI models to attempt 240 Upwork projects across categories, including writing, design and data analysis and then compared the results to the real freelancer. The overwhelming majority of the time, the AI models were unable to complete the tasks successfully, with the best AI model, Manus, completing just 2.5% of tasks and earning $1,810 out of $143,991 on offer. Claude Sonnet and Grok 4 managed to finish 2.1% of the tasks. While AI agents are good at simple and defined tasks like “generate a logo,” the research found they are bad at multi-step workflows, taking any initiative or using judgment. So they wont be causing mass unemployment for a while yet. This backs up research from August at MIT, which found that 95% of organizat...
Researchers found AI is hopeless at most Upwork task, it gets the news wrong half the time — and humans crush AI on world model tests. AI Eye. AI agents cant complete 97% of tasks on Upwork to even a basic standard. Researchers at Scale AI and the Center for AI Safety got six different AI models to attempt 240 Upwork projects across categories, including writing, design and data analysis and then compared the results to the real freelancer. The overwhelming majority of the time, the AI models were unable to complete the tasks successfully, with the best AI model, Manus, completing just 2.5% of tasks and earning $1,810 out of $143,991 on offer. Claude Sonnet and Grok 4 managed to finish 2.1% of the tasks. While AI agents are good at simple and defined tasks like “generate a logo,” the research found they are bad at multi-step workflows, taking any initiative or using judgment. So they wont be causing mass unemployment for a while yet. This backs up research from August at MIT, which found that 95% of organizat...
Researchers found AI is hopeless at most Upwork task, it gets the news wrong half the time — and humans crush AI on world model tests. AI Eye. AI agents cant complete 97% of tasks on Upwork to even a basic standard. Researchers at Scale AI and the Center for AI Safety got six different AI models to attempt 240 Upwork projects across categories, including writing, design and data analysis and then compared the results to the real freelancer. The overwhelming majority of the time, the AI models were unable to complete the tasks successfully, with the best AI model, Manus, completing just 2.5% of tasks and earning $1,810 out of $143,991 on offer. Claude Sonnet and Grok 4 managed to finish 2.1% of the tasks. While AI agents are good at simple and defined tasks like “generate a logo,” the research found they are bad at multi-step workflows, taking any initiative or using judgment. So they wont be causing mass unemployment for a while yet. This backs up research from August at MIT, which found that 95% of organizat...
Researchers found AI is hopeless at most Upwork task, it gets the news wrong half the time — and humans crush AI on world model tests. AI Eye. AI agents cant complete 97% of tasks on Upwork to even a basic standard. Researchers at Scale AI and the Center for AI Safety got six different AI models to attempt 240 Upwork projects across categories, including writing, design and data analysis and then compared the results to the real freelancer. The overwhelming majority of the time, the AI models were unable to complete the tasks successfully, with the best AI model, Manus, completing just 2.5% of tasks and earning $1,810 out of $143,991 on offer. Claude Sonnet and Grok 4 managed to finish 2.1% of the tasks. While AI agents are good at simple and defined tasks like “generate a logo,” the research found they are bad at multi-step workflows, taking any initiative or using judgment. So they wont be causing mass unemployment for a while yet. This backs up research from August at MIT, which found that 95% of organizat...
RedStone expands beyond price oracles with Credora, integrating risk analytics across DeFi protocols Morpho and Spark. Modular oracle network RedStone launched Credora, a decentralized finance (DeFi)-native risk ratings platform aiming to bring transparency and credit analytics to lending protocols. RedStone said on Thursday that it had expanded beyond price feeds into the broader domain of credit, collateral and risk intelligence through its Credora acquisition in September. At launch, Credora by RedStone integrates with DeFi lending markets Morpho and Sparks to offer dynamic risk scores and default-probability analytics, accessible through an API. Read more
Bitcoin.com and Concordium have teamed up to introduce age-verified stablecoin payments to 75 million wallets, blending privacy with new compliance standards. Crypto media and wallet platform Bitcoin.com has partnered with Concordium, a privacy-focused layer-1 blockchain, to enable age-verified stablecoin payments across more than 75 million wallets on Bitcoin.com’s network. Announced on Thursday, the integration allows wallet users to verify specific identity attributes, such as age or jurisdiction, without revealing personal details. Verification occurs off-chain through independent third-party providers, and no personal data is stored on the blockchain. Each transaction utilizes zero-knowledge proof technology to verify compliance requirements while maintaining user privacy. Read more
The TRUMP memecoin has confirmed a falling wedge breakout, with price charts indicating a potential 70% rally toward $13 by the end of the year. Key takeaways: TRUMP confirms a falling wedge breakout, targeting a 70% rally toward $13 by 2026. Issuer’s Republic.com deal talks and $200 million buyback plan strengthen bullish fundamentals. Read more
The United Nations is preparing to launch a blockchain academy for governments and a UN-led blockchain advisory group to assist countries in adopting the technology. The United Nations Development Programme (UNDP) plans to launch two initiatives aimed at helping countries adopt blockchain technology. Robert Pasicko, the leader of UNDP’s financial technology team, AltFinLab, told Cointelegraph at the UN City offices in Copenhagen, Denmark, that the organization plans to launch a blockchain education program for government officials alongside a blockchain advisory body. The initiative builds on the UNDP’s existing blockchain academy for UN staff, now aimed at helping governments implement blockchain in real-world applications. Read more