fbpx
01 / 05
The Hassle-Free Future of Trash Pickup and Recycling

Wall Street Journal | Science & Technology

The Hassle-Free Future of Trash Pickup and Recycling

“Urban planners, the refuse industry and cities across the country are reimagining how we manage and dispose of our waste. The New York City Department of Sanitation and the Massachusetts Institute of Technology are among those leveraging artificial intelligence, robotics and electric power to tackle a growing garbage crisis fueled by cheap products and throwaway culture.”

From Wall Street Journal.

DeepMind | Science & Technology

Gemini Robotics-ER 1.6: Powering Real-World Robotics Tasks

“For robots to be truly helpful in our daily lives and industries, they must do more than follow instructions, they must reason about the physical world. From navigating a complex facility to interpreting the needle on a pressure gauge, a robot’s “embodied reasoning” is what allows it to bridge the gap between digital intelligence and physical action.

Today, we’re introducing Gemini Robotics-ER 1.6, a significant upgrade to our reasoning-first model that enables robots to understand their environments with unprecedented precision. By enhancing spatial reasoning and multi-view understanding, we are bringing a new level of autonomy to the next generation of physical agents.

This model specializes in reasoning capabilities critical for robotics, including visual and spatial understanding, task planning and success detection. It acts as the high-level reasoning model for a robot, capable of executing tasks by natively calling tools like Google Search to find information, vision-language-action models (VLAs) or any other third-party user-defined functions.

Gemini Robotics-ER 1.6 shows significant improvement over both Gemini Robotics-ER 1.5 and Gemini 3.0 Flash, specifically enhancing spatial and physical reasoning capabilities such as pointing, counting, and success detection. We are also unlocking a new capability: instrument reading, enabling robots to read complex gauges and sight glasses — a use case we discovered through close collaboration with our partner, Boston Dynamics.”

From DeepMind.

arXiv | Communications

LLMs Show Promising Social Media Fact-Checking Capabilities

“Large language models show promising capabilities for contextual fact-checking on social media: they can verify contested claims through deep research, synthesize evidence from multiple sources, and draft explanations at scale. However, prior work evaluates LLM fact-checking only in controlled settings using benchmarks or crowdworker judgments, leaving open how these systems perform in authentic platform environments. We present the first field evaluation of LLM-based fact-checking deployed on a live social media platform, testing performance directly through X Community Notes’ AI writer feature over a three-month period. Our LLM writer, a multi-step pipeline that handles multimodal content (text, images, and videos), conducts web and platform-native search, and writes contextual notes, was deployed to write 1,614 notes on 1,597 tweets and compared against 1,332 human-written notes on the same tweets using 108,169 ratings from 42,521 raters. Direct comparison of note-level platform outcomes is complicated by differences in submission timing and rating exposure between LLM and human notes; we therefore pursue two complementary strategies: a rating-level analysis modeling individual rater evaluations, and a note-level analysis that equalizes rater exposure across note types. Rating-level analysis shows that LLM notes receive more positive ratings than human notes across raters with different political viewpoints, suggesting the potential for LLM-written notes to achieve the cross-partisan consensus. Note-level analysis confirms this advantage: among raters who evaluated all notes on the same post, LLM notes achieve significantly higher helpfulness scores. Our findings demonstrate that LLMs can contribute high-quality, broadly helpful fact-checking at scale, while highlighting that real-world evaluation requires careful attention to platform dynamics absent from controlled settings.”

From arXiv.

Reuters | Motor Vehicles

Uber, Pony.ai and Verne Launch Robotaxi Service in Croatia

“Uber Technologies has partnered with Pony.ai and autonomous vehicle startup Verne to roll out the first commercial ​robotaxi service in Europe, with operations set to ‌start in the Croatian capital Zagreb.

Under the deal, Chinese robotaxi firm Pony.ai will supply the autonomous driving technology, while Croatian ​startup Verne will serve as the fleet owner ​and manage day-to-day operations, the companies said on ⁠Thursday.”

From Reuters.

Cato Institute | Communications

Starlink Connects Millions of People in Argentina 

“When Javier Milei became president of Argentina in December 2023, one his first measures as part of a package of wide-ranging deregulations was to open up the economy to satellite internet. (I wrote about that and his broader deregulatory push here.)

At a meeting I attended last month with a small group of economists, Argentina’s Minister of Deregulation, Federico Sturzenegger, presented the graph above. It shows how satellite internet use exploded once the government lifted its ban, which had, until then, benefited a politically powerful local internet provider.”

From Cato Institute.