Vue lecture

141° - Alan Wake sur PC (Dématérialisé)

1,24€ - Xbox Store

Alan Wake sur PC (Dématérialisé)

Plongez dans l’univers sombre et captivant d’Alan Wake, un thriller psychologique mêlant action et suspense. Incarne...
  •  

Allemagne - Côte d’Ivoire : les Allemands s’imposent dans le temps additionnel et valident leur ticket pour les seizièmes de finale

Après sa démonstration face à Curaçao, la Mannschaft a attendu le temps additionnel pour prendre le meilleur sur les Eléphants, samedi, à Toronto. L’Allemagne se qualifie ainsi pour la phase à élimination directe pour la première fois depuis 2014.

© COLE BURSTON / AFP

  •  

Fête de la musique : 4800 policiers et gendarmes mobilisés à Paris

En outre, la préfecture a décidé pour «éviter tout risque de chute dans la Seine, d’interdire les rassemblements, cortèges, défilés non déclarés sur les quais bas de dimanche 15h00 à lundi 08h00».

© SEBASTIEN DUPUY / AFP

Conformément aux décisions prise dans la matinée par la cellule interministérielle de crise, la consommation d’alcool dans la rue et les espaces publics est interdite du dimanche 07h00 au lundi 07h00.
  •  

OpenAI Announces Benchmarks for AI Life Sciences Research. Its Best Model Failed 63.9% of the Test

This week OpenAI announced a 750-task test to to measure "whether AI systems can support realistic life science research tasks, not just answer biology questions." But while OpenAI's top-performing GPT-Rosalind model led the rankings, Slashdot reader BrianFagioli notes that "it achieved a pass rate of just 36.1 percent, failing nearly two-thirds of benchmark tasks." Nerds.xyz points out that means "the best-performing model failed nearly two-thirds of the benchmark's tasks." The benchmark also revealed a familiar weakness. AI systems generally perform better when everything is presented as text. Once they are forced to work with supporting documents, figures, or complex datasets, performance drops noticeably. GPT-Rosalind's pass rate fell from 45.1 percent on text-only tasks to 28.1 percent on tasks involving artifacts or URLs. To be fair, the benchmark is not intended to suggest AI is useless in research. Quite the opposite. OpenAI found that models are becoming increasingly capable of scientific communication, evidence synthesis, and translating research findings into practical explanations. Those are valuable skills, particularly for researchers drowning in information. But LifeSciBench serves as a useful reminder that today's AI systems are still far from autonomous scientists. They can help. They can assist. They can sometimes provide surprisingly useful insights. What they cannot reliably do, however, is replace the expertise, judgment, and skepticism that real scientific research requires.

Read more of this story at Slashdot.

  •  
❌