Top Mathematics discussions

NishMath - #AI

Google DeepMind's AlphaGenome Predicts Impact of Mutations - Google DeepMind's AlphaGenome, a new AI model, predicts how mutations affect non-coding DNA, potentially transforming the understanding of diseases by analyzing long DNA sequences to predict regulatory consequences of DNA sequence variations.

References: Maginative , MarkTechPost ,

Google DeepMind has launched AlphaGenome, a new deep learning framework designed to predict the regulatory consequences of DNA sequence variations. This AI model aims to decode how mutations affect non-coding DNA, which makes up 98% of the human genome, potentially transforming the understanding of diseases. AlphaGenome processes up to one million base pairs of DNA at once, delivering predictions on gene expression, splicing, chromatin accessibility, transcription factor binding, and 3D genome structure.

AlphaGenome stands out by comprehensively predicting the impact of single variants or mutations, especially in non-coding regions, on gene regulation. It uses a hybrid neural network that combines convolutional layers and transformers to digest long DNA sequences. The model addresses limitations in earlier models by bridging the gap between long-sequence input processing and nucleotide-level output precision, unifying predictive tasks across 11 output modalities and handling thousands of human and mouse genomic tracks. This makes AlphaGenome one of the most comprehensive sequence-to-function models in genomics.

The AI tool is available via API for non-commercial research to advance scientific research and is planned to be released to the general public in the future. In performance tests, AlphaGenome outperformed or matched the best external models on 24 out of 26 variant effect prediction benchmarks. According to DeepMind's Vice President for Research Pushmeet Kohli, AlphaGenome unifies many different challenges that come with understanding the genome. The model can help researchers identify disease-causing variants and better understand genome function and disease biology, potentially driving new biological discoveries and the development of new treatments.

Recommended read:

Top link: www.marktechpost.com
Permalink: More details

References :

Maginative: DeepMind’s AlphaGenome AI model decodes how mutations affect non-coding DNA, potentially transforming our understanding of disease.
MarkTechPost: Google DeepMind has unveiled AlphaGenome, a new deep learning framework designed to predict the regulatory consequences of DNA sequence variations across a wide spectrum of biological modalities.
Google DeepMind Blog: Introducing a new, unifying DNA sequence model that advances regulatory variant-effect prediction and promises to shed new light on genome function â€” now available via API.

Steve Vandenberg@Microsoft Security Blog //

Microsoft Advances in AI and Data Security - Microsoft's advancements in AI, data security, and computational chemistry include the Responsible AI Transparency Report and AI strategies in higher education.

References: blogs.microsoft.com , Microsoft Security Blog , Microsoft Research ...

Microsoft is making significant strides in AI and data security, demonstrated by recent advancements and reports. The company's commitment to responsible AI is highlighted in its 2025 Responsible AI Transparency Report, detailing efforts to build trustworthy AI technologies. Microsoft is also addressing the critical issue of data breach reporting, offering solutions like Microsoft Data Security Investigations to assist organizations in meeting stringent regulatory requirements such as GDPR and SEC rules. These initiatives underscore Microsoft's dedication to ethical and secure AI development and deployment across various sectors.

AI's transformative potential is being explored in higher education, with Microsoft providing AI solutions for creating AI-ready campuses. Institutions are focusing on using AI for unique differentiation and innovation rather than just automation and cost savings. Strategies include establishing guidelines for responsible AI use, fostering collaborative communities for knowledge sharing, and partnering with technology vendors like Microsoft, OpenAI, and NVIDIA. Comprehensive training programs are also essential to ensure stakeholders are proficient with AI tools, promoting a culture of experimentation and ethical AI practices.

Furthermore, Microsoft Research has achieved a breakthrough in computational chemistry by using deep learning to enhance the accuracy of density functional theory (DFT). This advancement allows for more reliable predictions of molecular and material properties, accelerating scientific discovery in fields such as drug development, battery technology, and green fertilizers. By generating vast amounts of accurate data and using scalable deep-learning approaches, the team has overcome limitations in DFT, enabling the design of molecules and materials through computational simulations rather than relying solely on laboratory experiments.

Recommended read:

Top link: Microsoft Security Blog
Permalink: More details

References :

blogs.microsoft.com: Our 2025 Responsible AI Transparency Report: How we build, support our customers, and grow
Microsoft Security Blog: Data Breach Reporting for regulatory requirements with Microsoft Data Security Investigations
www.microsoft.com: Breaking bonds, breaking ground: Advancing the accuracy of computational chemistry with deep learning
Microsoft Research: Breaking bonds, breaking ground: Advancing the accuracy of computational chemistry with deep learning
The Microsoft Cloud Blog: Our 2025 Responsible AI Transparency Report: How we build, support our customers, and grow

@www.marktechpost.com //

Google's AI Model Forecasts Tropical Cyclones Accurately - Google launched an AI model via Weather Lab for forecasting tropical cyclones, developed with DeepMind, aiming to predict path and intensity days in advance.

References: siliconangle.com , AI News | VentureBeat , Maginative ...

Google has unveiled a new AI model designed to forecast tropical cyclones with improved accuracy. Developed through a collaboration between Google Research and DeepMind, the model is accessible via a newly launched website called Weather Lab. The AI aims to predict both the path and intensity of cyclones days in advance, overcoming limitations present in traditional physics-based weather prediction models. Google claims its algorithm achieves "state-of-the-art accuracy" in forecasting cyclone track and intensity, as well as details like formation, size, and shape.

The AI model was trained using two extensive datasets: one describing the characteristics of nearly 5,000 cyclones from the past 45 years, and another containing millions of weather observations. Internal testing demonstrated the algorithm's ability to accurately predict the paths of recent cyclones, in some cases up to a week in advance. The model can generate 50 possible scenarios, extending forecast capabilities up to 15 days.

This breakthrough has already seen adoption by the U.S. National Hurricane Center, which is now using these experimental AI predictions alongside traditional forecasting models in its operational workflow. Google's AI's ability to forecast up to 15 days in advance marks a significant improvement over current models, which typically provide 3-5 day forecasts. Google made the AI accessible through a new website called Weather Lab. The model is available alongside two years' worth of historical forecasts, as well as data from traditional physics-based weather prediction algorithms. According to Google, this could help weather agencies and emergency service experts better anticipate a cyclone’s path and intensity.

Recommended read:

Top link: www.marktechpost.com
Permalink: More details

References :

siliconangle.com: Google LLC today detailed an artificial intelligence model that can forecast the path and intensity of tropical cyclones days in advance.
AI News | VentureBeat: Google DeepMind just changed hurricane forecasting forever with new AI model
MarkTechPost: Google AI Unveils a Hybrid AI-Physics Model for Accurate Regional Climate Risk Forecasts with Better Uncertainty Assessment
Maginative: Google's AI Can Now Predict Hurricane Paths 15 Days Out â€” and the Hurricane Center Is Using It
SiliconANGLE: Google develops AI model for forecasting tropical cyclones. According to the company, the algorithm was developed through a collaboration between its Google Research and DeepMind units. Itâ€™s available through a newly launched website called Weather Lab.
The Official Google Blog: Weather Lab is an interactive website for sharing Googleâ€™s AI weather models.
www.engadget.com: Google DeepMind is sharing its AI forecasts with the National Weather Service
www.producthunt.com: Predicting cyclone paths & intensity 15 days ahead |
the-decoder.com: Google Deepmind launches Weather Lab to test AI models for tropical cyclone forecasting
AIwire: Google DeepMind Launches Interactive AI That Lets You Explore Storm Forecasts
www.aiwire.net: Google DeepMind and Google Research are launching Weather Lab - a new AI-driven platform designed specifically to improve forecasts for tropical cyclone formation, intensity, and trajectory.

@www.marktechpost.com //

Apple Research Questioning LLMs Ability in Reasoning - Apple research challenges the reasoning capabilities of Large Reasoning Models (LRMs), suggesting they struggle with basic reasoning tasks, sparking debate within the AI community where Google is on the other side of the debate.

References: TheSequence , chatgptiseatingtheworld.com , arstechnica.com ...

Apple researchers are challenging the perceived reasoning capabilities of Large Reasoning Models (LRMs), sparking debate within the AI community. A recent paper from Apple, titled "The Illusion of Thinking," suggests that these models, which generate intermediate thinking steps like Chain-of-Thought reasoning, struggle with fundamental reasoning tasks. The research indicates that current evaluation methods relying on math and code benchmarks are insufficient, as they often suffer from data contamination and fail to assess the structure or quality of the reasoning process.

To address these shortcomings, Apple researchers introduced controllable puzzle environments, including the Tower of Hanoi, River Crossing, Checker Jumping, and Blocks World, allowing for precise manipulation of problem complexity. These puzzles require diverse reasoning abilities, such as constraint satisfaction and sequential planning, and are free from data contamination. The Apple paper concluded that state-of-the-art LRMs ultimately fail to develop generalizable problem-solving capabilities, with accuracy collapsing to zero beyond certain complexities across different environments.

However, the Apple research has faced criticism. Experts, like Professor Seok Joon Kwon, argue that Apple's lack of high-performance hardware, such as a large GPU-based cluster comparable to those operated by Google or Microsoft, could be a factor in their findings. Some argue that the models perform better on familiar puzzles, suggesting that their success may be linked to training exposure rather than genuine problem-solving skills. Others, such as Alex Lawsen and "C. Opus," argue that the Apple researchers' results don't support claims about fundamental reasoning limitations, but rather highlight engineering challenges related to token limits and evaluation methods.

Recommended read:

Top link: www.marktechpost.com
Permalink: More details

References :

TheSequence: The Sequence Research #663: The Illusion of Thinking, Inside the Most Controversial AI Paper of Recent Weeks
chatgptiseatingtheworld.com: Research: Did Apple researchers overstate â€œThe Illusion of Thinkingâ€ in reasoning models. Opus, Lawsen think so.
www.marktechpost.com: Apple Researchers Reveal Structural Failures in Large Reasoning Models Using Puzzle-Based Evaluation
arstechnica.com: New Apple study challenges whether AI models truly â€œreasonâ€ through problems
9to5Mac: New paper pushes back on Apple’s LLM ‘reasoning collapse’ study

Carl Franzen@AI News | VentureBeat //

Mistral AI Launches Magistral Reasoning Models Openly - Mistral AI launched its first reasoning model, Magistral, available in both large and small Apache 2.0 versions, and introduced Mistral Agents API’s Handoffs feature for smart, multi-agent workflows.

References: Simon Willison , Simon Willison's Weblog , AI News | VentureBeat ...

Mistral AI has launched its first reasoning model, Magistral, signaling a commitment to open-source AI development. The Magistral family features two models: Magistral Small, a 24-billion parameter model available with open weights under the Apache 2.0 license, and Magistral Medium, a proprietary model accessible through an API. This dual release strategy aims to cater to both enterprise clients seeking advanced reasoning capabilities and the broader AI community interested in open-source innovation.

Mistral's decision to release Magistral Small under the permissive Apache 2.0 license marks a significant return to its open-source roots. The license allows for the free use, modification, and distribution of the model's source code, even for commercial purposes. This empowers startups and established companies to build and deploy their own applications on top of Mistral’s latest reasoning architecture, without the burdens of licensing fees or vendor lock-in. The release serves as a powerful counter-narrative, reaffirming Mistral’s dedication to arming the open community with cutting-edge tools.

Magistral Medium demonstrates competitive performance in the reasoning arena, according to internal benchmarks released by Mistral. The model was tested against its predecessor, Mistral-Medium 3, and models from Deepseek. Furthermore, Mistral's Agents API's Handoffs feature facilitates smart, multi-agent workflows, allowing different agents to collaborate on complex tasks. This enables modular and efficient problem-solving, as demonstrated in systems where agents collaborate to answer inflation-related questions.

Recommended read:

Top link: AI News | VentureBeat
Permalink: More details

References :

Simon Willison: Mistral's first reasoning LLM - Magistral - was released today and is available in two sizes, an open weights (Apache 2) 24B model called Magistral Small and an API/hosted only model called Magistral Medium.
Simon Willison's Weblog: Mistral's first reasoning model is out today, in two sizes. There's a 24B Apache 2 licensed open-weights model called Magistral Small (actually Magistral-Small-2506), and a larger API-only model called Magistral Medium.
THE DECODER: Mistral launches Europe's first reasoning model Magistral but lags behind competitors
AI News | VentureBeat: The company is signaling that the future of reasoning AI will be both powerful and, in a meaningful way, open to all.
www.marktechpost.com: How to Create Smart Multi-Agent Workflows Using the Mistral Agents API’s Handoffs Feature
TestingCatalog: Mistral AI debuts Magistral models focused on advanced reasoning
www.artificialintelligence-news.com: Mistral AI has pulled back the curtain on Magistral, their first model specifically built for reasoning tasks.
www.infoworld.com: Mistral AI unveils Magistral reasoning model
AI News: Mistral AI has pulled back the curtain on Magistral, their first model specifically built for reasoning tasks.
the-decoder.com: The French start-up Mistral is launching its first reasoning model on the market with Magistral. It is designed to enable logical thinking in European languages.
Simon Willison: Mistral's first reasoning LLM - Magistral - was released today and is available in two sizes, an open weights (Apache 2) 24B model called Magistral Small and an API/hosted only model called Magistral Medium. My notes here, including running Small locally with Ollama and accessing Medium via my llm-mistral plugin
SiliconANGLE: Mistral AI debuts new Magistral series of reasoning LLMs.
siliconangle.com: Mistral AI debuts new Magistral series of reasoning LLMs
MarkTechPost: Mistral AI Releases Magistral Series: Advanced Chain-of-Thought LLMs for Enterprise and Open-Source Applications
www.marktechpost.com: Mistral AI Releases Magistral Series: Advanced Chain-of-Thought LLMs for Enterprise and Open-Source Applications
WhatIs: What differentiates Mistral AI reasoning model Magistral
AlternativeTo: Mistral AI debuts Magistral: a transparent, multilingual reasoning model family, including open-source Magistral Small available on Hugging Face and enterprise-focused Magistral Medium available on various platforms.

Mark Tyson@tomshardware.com //

OpenAI Launches o3-pro Reasoning Model - OpenAI released o3-pro, a smarter model with enhanced capabilities in math, science, and programming, available to ChatGPT Pro and Team subscribers and through OpenAI's API, with a price cut of 87%.

References: Maginative , AI News | VentureBeat , THE DECODER ...

OpenAI has recently launched its newest reasoning model, o3-pro, making it available to ChatGPT Pro and Team subscribers, as well as through OpenAI’s API. Enterprise and Edu subscribers will gain access the following week. The company touts o3-pro as a significant upgrade, emphasizing its enhanced capabilities in mathematics, science, and coding, and its improved ability to utilize external tools.

OpenAI has also slashed the price of o3 by 80% and o3-pro by 87%, positioning the model as a more accessible option for developers seeking advanced reasoning capabilities. This price adjustment comes at a time when AI providers are competing more aggressively on both performance and affordability. Experts note that evaluations consistently prefer o3-pro over the standard o3 model across all categories, especially in science, programming, and business tasks.

O3-pro utilizes the same underlying architecture as o3, but it’s tuned to be more reliable, especially on complex tasks, with better long-range reasoning. The model supports tools like web browsing, code execution, vision analysis, and memory. While the increased complexity can lead to slower response times, OpenAI suggests that the tradeoff is worthwhile for the most challenging questions "where reliability matters more than speed, and waiting a few minutes is worth the tradeoff.”

Recommended read:

Top link: tomshardware.com
Permalink: More details

References :

Maginative: OpenAI’s new o3-pro model is now available in ChatGPT and the API, offering top-tier performance in math, science, and coding—at a dramatically lower price.
AI News | VentureBeat: OpenAI's most powerful reasoning model, o3, is now 80% cheaper, making it more affordable for businesses, researchers, and individual developers.
Latent.Space: OpenAI just dropped the price of their o3 model by 80% today and launched o3-pro.
THE DECODER: OpenAI has lowered the price of its o3 language model by 80 percent, CEO Sam Altman said.
Simon Willison's Weblog: OpenAI's Adam Groth explained that the engineers have optimized inference, allowing a significant price reduction for the o3 model.
the-decoder.com: OpenAI lowered the price of its o3 language model by 80 percent, CEO Sam Altman said.
AI News | VentureBeat: OpenAI released the latest in its o-series of reasoning model that promises more reliable and accurate responses for enterprises.
bsky.app: The OpenAI API is back to running at 100% again, plus we dropped o3 prices by 80% and launched o3-pro - enjoy!
Sam Altman: We are past the event horizon; the takeoff has started. Humanity is close to building digital superintelligence, and at least so far it’s much less weird than it seems like it should be.
siliconangle.com: OpenAIâ€™s newest reasoning model o3-pro surpasses rivals on multiple benchmarks, but itâ€™s not very fast
SiliconANGLE: OpenAI’s newest reasoning model o3-pro surpasses rivals on multiple benchmarks, but it’s not very fast
bsky.app: the OpenAI API is back to running at 100% again, plus we dropped o3 prices by 80% and launched o3-pro - enjoy!
bsky.app: OpenAI has launched o3-pro. The new model is available to ChatGPT Pro and Team subscribers and in OpenAI’s API now, while Enterprise and Edu subscribers will get access next week. If you use reasoning models like o1 or o3, try o3-pro, which is much smarter and better at using external tools.
The Algorithmic Bridge: OpenAI o3-Pro Is So Good That I Can’t Tell How Good It Is

@www.marktechpost.com //

DeepSeek R1-0528 Update Improves Reasoning and Coding - DeepSeek released DeepSeek-R1-0528, with improved performance in math, coding, and general reasoning and is a competitive open-source alternative to models like OpenAI’s o3 and Google’s Gemini 2.5 Pro.

References: Kyle Wiggers ? , AI News | VentureBeat , MacStories ...

DeepSeek has released a major update to its R1 reasoning model, dubbed DeepSeek-R1-0528, marking a significant step forward in open-source AI. The update boasts enhanced performance in complex reasoning, mathematics, and coding, positioning it as a strong competitor to leading commercial models like OpenAI's o3 and Google's Gemini 2.5 Pro. The model's weights, training recipes, and comprehensive documentation are openly available under the MIT license, fostering transparency and community-driven innovation. This release allows researchers, developers, and businesses to access cutting-edge AI capabilities without the constraints of closed ecosystems or expensive subscriptions.

The DeepSeek-R1-0528 update brings several core improvements. The model's parameter count has increased from 671 billion to 685 billion, enabling it to process and store more intricate patterns. Enhanced chain-of-thought layers deepen the model's reasoning capabilities, making it more reliable in handling multi-step logic problems. Post-training optimizations have also been applied to reduce hallucinations and improve output stability. In practical terms, the update introduces JSON outputs, native function calling, and simplified system prompts, all designed to streamline real-world deployment and enhance the developer experience.

Specifically, DeepSeek R1-0528 demonstrates a remarkable leap in mathematical reasoning. On the AIME 2025 test, its accuracy improved from 70% to an impressive 87.5%, rivaling OpenAI's o3. This improvement is attributed to "enhanced thinking depth," with the model now utilizing significantly more tokens per question, indicating more thorough and systematic logical analysis. The open-source nature of DeepSeek-R1-0528 empowers users to fine-tune and adapt the model to their specific needs, fostering further innovation and advancements within the AI community.

Recommended read:

Top link: www.marktechpost.com
Permalink: More details

References :

Kyle Wiggers ?: DeepSeek updates its R1 reasoning AI model, releases it on Hugging Face
AI News | VentureBeat: VentureBeat article on DeepSeek R1-0528.
Analytics Vidhya: New Deepseek R1-0528 Update is INSANE
MacStories: Testing DeepSeek R1-0528 on the M3 Ultra Mac Studio and Installing Local GGUF Models with Ollama on macOS
www.analyticsvidhya.com: New Deepseek R1-0528 Update is INSANE
www.marktechpost.com: DeepSeek Releases R1-0528: An Open-Source Reasoning AI Model Delivering Enhanced Math and Code Performance with Single-GPU Efficiency
NextBigFuture.com: DeepSeek R1 has significantly improved its depth of reasoning and inference capabilities by leveraging increased computational resources and introducing algorithmic optimization mechanisms during post-training.
MarkTechPost: DeepSeek Releases R1-0528: An Open-Source Reasoning AI Model Delivering Enhanced Math and Code Performance with Single-GPU Efficiency
: In the early hours of May 29, Chinese AI startup DeepSeek quietly open-sourced the latest iteration of its R1 large language model, DeepSeek-R1-0528, on the Hugging Face platform .
www.computerworld.com: Reports that DeepSeek releases a new version of its R1 reasoning AI model.
techcrunch.com: DeepSeek updates its R1 reasoning AI model, releases it on Hugging Face
the-decoder.com: Deepseek's R1 model closes the gap with OpenAI and Google after major update
Simon Willison: Some notes on the new DeepSeek-R1-0528 - a completely different model from the R1 they released in January, despite having a very similar name Terrible LLM naming has managed to infect the Chinese AI labs too
Analytics India Magazine: The new DeepSeek-R1 Is as good as OpenAI o3 and Gemini 2.5 Pro
: The 'Minor Upgrade' That's Anything But: DeepSeek R1-0528 Deep Dive
simonwillison.net: Some notes on the new DeepSeek-R1-0528 - a completely different model from the R1 they released in January, despite having a very similar name Terrible LLM naming has managed to infect the Chinese AI labs too
TheSequence: This article provides an overview of the new DeepSeek R1-0528 model and notes its improvements over the prior model released in January.
Kyle Wiggers ?: News about the release of DeepSeek's updated R1 AI model, emphasizing its increased censorship.
Fello AI: Reports that the R1-0528 model from DeepSeek is matching the capabilities of OpenAI's o3 and Google's Gemini 2.5 Pro.
felloai.com: Latest DeepSeek Update Called R1-0528 Is Matching OpenAIâ€™s o3 & Gemini 2.5 Pro
www.tomsguide.com: DeepSeekâ€™s latest update is a serious threat to ChatGPT and Google â€” hereâ€™s why

Source Asia@Source Asia //

AI Model Revolutionizes Earth System Forecasting - Microsoft’s Aurora AI, a 1.3-billion-parameter foundation model, excels in Earth system forecasting, outperforming operational models in various areas at reduced computational costs, marking a significant advancement in environmental forecasting.

References: news.microsoft.com , Microsoft Research

Microsoft Research has unveiled Aurora, a groundbreaking AI foundation model with 1.3 billion parameters, that is set to revolutionize Earth system forecasting. This innovative model outperforms traditional operational forecasts in critical areas such as air quality prediction, ocean wave forecasting, tropical cyclone tracking, and high-resolution weather prediction. Aurora achieves this superior performance at significantly lower computational costs, marking a significant advancement in the field. The model's capabilities extend beyond traditional weather forecasting, positioning it as a versatile tool for addressing a wide range of environmental challenges.

Aurora's architecture, based on Perceiver IO, allows it to efficiently process structured inputs and outputs, making it well-suited for complex Earth system data. Researchers at Microsoft have trained Aurora on an unprecedented volume of atmospheric data, incorporating information from satellites, radar, weather stations, simulations, and forecasts. This extensive training enables Aurora to rapidly generate forecasts and adapt to specific tasks through fine-tuning with smaller, task-specific datasets. The model's flexibility and ability to learn from diverse data sources are key factors in its exceptional forecasting accuracy.

The development of Aurora signifies a major step forward in applying AI to Earth science. By demonstrating the potential of foundation models to accurately and efficiently predict various environmental phenomena, Aurora paves the way for new approaches to disaster preparedness, resource management, and climate change mitigation. The publicly available code and weights of Aurora, accessible on GitHub, encourage further research and development in this exciting area. Microsoft's work underscores the transformative power of AI in addressing some of the world's most pressing environmental challenges.

Recommended read:

Top link: Source Asia
Permalink: More details

References :

news.microsoft.com: From sea to sky: Microsoftâ€™s Aurora AI foundation model goes beyond weather forecasting
Microsoft Research: Abstracts: Aurora with Megan Stanley and Wessel Bruinsma

Source Asia@Source Asia //

Microsoft's Aurora AI Model Delivers Accurate Long-Range Weather Forecasts - Microsoft’s Aurora AI model delivers accurate weather forecasts, processing datasets to generate predictions and beat existing models across 91 percent of forecasting targets.

References: Source Asia , Microsoft Research , www.nature.com ...

Microsoft's Aurora AI model is revolutionizing weather forecasting by providing accurate 10-day forecasts in mere seconds. This AI foundation model, developed by Microsoft Research, has demonstrated capabilities that extend beyond traditional weather prediction, encompassing environmental events such as tropical cyclones, air quality, and ocean waves. Aurora achieves this by training on a massive dataset of over one million hours of atmospheric data from satellites, radar, weather stations, simulations, and forecasts, which Microsoft believes is the largest collection ever assembled for training an AI forecasting model. The model's speed and accuracy have the potential to improve safety and inform decisions across various sectors.

The core strength of Aurora lies in its foundation model architecture. It's not simply limited to weather forecasting; it can be fine-tuned for specific environmental prediction tasks. After initial training on general weather patterns, Aurora can be adapted with smaller datasets to forecast elements like wave height or air quality. The AI does not fully grasp the physical laws governing weather, but its use for environmental prediction tasks and ability to provide accurate forecasts is still significant. This flexibility makes it a versatile tool for understanding and predicting various aspects of the Earth system.

Aurora's performance has been noteworthy, beating existing numerical and AI models across 91 percent of forecasting targets when fine-tuned to medium-range weather forecasts. Its rapid processing time, taking seconds compared to the hours required by traditional models, makes it a valuable asset for timely decision-making. Microsoft is leveraging AI technology to make weather forecasting more efficient and accurate. While generative AI is revolutionizing how we do things, integrating it into workflows is making work easier by automating redundant tasks, creating more time to focus on more important tasks.

Recommended read:

Top link: Source Asia
Permalink: More details

References :

Source Asia: From sea to sky: Microsoftâ€™s Aurora AI foundation model goes beyond weather forecasting
Microsoft Research: Abstracts: Aurora with Megan Stanley and Wessel Bruinsma
www.windowscentral.com: Microsoft's latest AI model can accurately forecast the weather: â€œIt doesnâ€™t know the laws of physics, so it could make up something completely crazyâ€
www.nature.com: A foundation model for the Earth system Bodnar, C., Bruinsma, W.P., Lucic, A. et al. A foundation model for the Earth system. Nature (2025).
maXpool: A foundation model for the Earth system. Aurora is a 1.3-billion-parameter foundation model for the Earth system.
doi.org: A foundation model for the Earth system
techxplore.com: Aurora, Microsoft's new AI model, is poised to revolutionize weather prediction with detailed forecasting.

Source Asia@Source Asia //

Microsoft Aurora AI Foundation Model Enhances Weather Prediction - Microsoft’s Aurora AI foundation model is revolutionizing weather and environmental forecasting with quicker and more accurate predictions, outperforming existing models in forecasting hurricanes and other extreme weather events.

References: Source Asia , Source , news.microsoft.com ...

Microsoft's Aurora AI foundation model is revolutionizing weather and environmental forecasting, offering quicker and more accurate predictions compared to traditional methods. Developed by Microsoft Research, Aurora is a large-scale AI model trained on a vast dataset of atmospheric information, including satellite data, radar readings, weather station observations, and simulations. This comprehensive training allows Aurora to forecast a range of environmental events, from hurricanes and typhoons to air quality and ocean waves, with exceptional precision and speed. The model's capabilities extend beyond conventional weather forecasting, making it a versatile tool for understanding and predicting environmental changes.

Aurora's unique architecture enables it to be fine-tuned for specific tasks using modest amounts of additional data. This "fine-tuning" process allows the model to generate forecasts in seconds, demonstrating its efficiency and adaptability. Researchers have shown that Aurora outperforms existing numerical and AI models in 91% of forecasting targets when fine-tuned for medium-range weather forecasts. Its ability to accurately predict hurricane trajectories and other extreme weather events highlights its potential to improve disaster preparedness and response efforts, ultimately saving lives and mitigating damage.

Senior researchers Megan Stanley and Wessel Bruinsma emphasized Aurora's broader impact on environmental science, noting its potential to revolutionize the field. In a paper published in Nature, they highlighted Aurora's ability to correctly forecast hurricanes in 2023 more accurately than operational forecasting centers, such as the US National Hurricane Center. Aurora also demonstrated its capabilities when correctly forecasting where and when Doksuri would hit the Philippines four days in advance. These findings underscore the transformative potential of AI in addressing complex environmental challenges and paving the way for more effective climate modeling and environmental event management.

Recommended read:

Top link: Source Asia
Permalink: More details

References :

Source Asia: Microsoftâ€™s Aurora AI foundation model goes beyond weather forecasting
Source: Aurora is a new foundation model from Microsoft Research that goes beyond weather forecasting, delivering faster, more accurate predictions of environmental events. Awesome to see this breakthrough published in Nature Magazine.
Microsoft Research: Abstracts: Aurora with Megan Stanley and Wessel Bruinsma
news.microsoft.com: Microsoftâ€™s Aurora AI foundation model goes beyond weather forecasting
www.sciencedaily.com: AI is good at weather forecasting. Can it predict freak weather events?
techxplore.com: Microsoft has developed an artificial intelligence (AI) model that beats current forecasting methods in tracking air quality, weather patterns, and climate-addled tropical storms, according to findings published Wednesday.
www.nature.com: Details about Aurora, a 1.3-billion-parameter foundation model for the Earth system, outperforming operational forecasts.
www.windowscentral.com: Microsoft's latest AI model can accurately forecast the weather: â€œIt doesnâ€™t know the laws of physics, so it could make up something completely crazyâ€
The Tech Basic: Microsoft’s New AI Can Predict Storms and Pollution Better Than Ever
doi.org: A foundation model for the Earth system
thetechbasic.com: Microsoftâ€™s New AI Can Predict Storms and Pollution Better Than Ever
bsky.app: An #AI trained on decades of weather data can predict hurricanes better than other approaches: https://www.theregister.com/2025/05/21/earth_system_model_hurricane_forecast/ #ArtificialIntelligence
computersweden.se: Microsoft slÃ¤pper ny AI som bÃ¤ttre kan fÃ¶rutspÃ¥ luftkvalitet och vÃ¤der
intelligence-artificielle.developpez.com: Microsoft has developed the AI model Aurora which generates 10-day weather forecasts and predicts hurricane trajectories, thus surpassing current forecasting methods.
techcrunch.com: Microsoft's new AI will provide better air quality and weather forecasts

Matthias Bastian@THE DECODER //

GPT-4.1 Models Enter the ChatGPT Service for Coders - OpenAI is updating Copilot with GPT-4.1, a specialized model enhancing coding and instruction-following, available to ChatGPT Plus, Pro, and Team users; GPT-4.1 mini will be available to all ChatGPT users.

References: twitter.com , www.computerworld.com , Maginative ...

OpenAI has announced the integration of GPT-4.1 and GPT-4.1 mini models into ChatGPT, aimed at enhancing coding and web development capabilities. The GPT-4.1 model, designed as a specialized model excelling at coding tasks and instruction following, is now available to ChatGPT Plus, Pro, and Team users. According to OpenAI, GPT-4.1 is faster and a great alternative to OpenAI o3 & o4-mini for everyday coding needs, providing more help to developers creating applications.

OpenAI is also rolling out GPT-4.1 mini, which will be available to all ChatGPT users, including those on the free tier, replacing the previous GPT-4o mini model. This model serves as the fallback option once GPT-4o usage limits are reached. The release notes confirm that GPT 4.1 mini offers various improvements over GPT-4o mini, including instruction-following, coding, and overall intelligence. This initiative is part of OpenAI's effort to make advanced AI tools more accessible and useful for a broader audience, particularly those engaged in programming and web development.

Johannes Heidecke, Head of Systems at OpenAI, has emphasized that the new models build upon the safety measures established for GPT-4o, ensuring parity in safety performance. According to Heidecke, no new safety risks have been introduced, as GPT-4.1 doesn’t introduce new modalities or ways of interacting with the AI, and that it doesn’t surpass o3 in intelligence. The rollout marks another step in OpenAI's increasingly rapid model release cadence, significantly expanding access to specialized capabilities in web development and coding.

Recommended read:

Top link: THE DECODER
Permalink: More details

References :

twitter.com: GPT-4.1 is a specialized model that excels at coding tasks & instruction following. Because it’s faster, it’s a great alternative to OpenAI o3 & o4-mini for everyday coding needs.
www.computerworld.com: OpenAI adds GPT-4.1 models to ChatGPT
gHacks Technology News: OpenAI releases GPT-4.1 and GPT-4.1 mini AI models for ChatGPT
Maginative: OpenAI Brings GPT-4.1 to ChatGPT
www.windowscentral.com: “Am I crazy or is GPT-4.1 the best model for coding?” ChatGPT gets new models with exemplary web development capabilities — but OpenAI is under fire for allegedly skimming through safety processes
the-decoder.com: OpenAI brings its new GPT-4.1 model to ChatGPT users
www.ghacks.net: OpenAI releases GPT-4.1 and GPT-4.1 mini AI models for ChatGPT
AI News | VentureBeat: OpenAI is rolling out GPT-4.1, its new non-reasoning large language model (LLM) that balances high performance with lower cost, to users of ChatGPT.
www.techradar.com: OpenAI just gave ChatGPT users a huge free upgrade – 4.1 mini is available today
www.marktechpost.com: OpenAI has introduced Codex, a cloud-native software engineering agent integrated into ChatGPT, signaling a new era in AI-assisted software development.

@Google DeepMind Blog //

Google DeepMind Unveils AlphaEvolve Towards AGI - Google DeepMind unveiled AlphaEvolve, an AI coding agent which autonomously discovers new algorithms and scientific solutions, demonstrating advancements towards Artificial General Intelligence (AGI) and Artificial Superintelligence (ASI).

References: LearnAI , The Next Web , www.unite.ai ...

Google DeepMind has introduced AlphaEvolve, a revolutionary AI coding agent designed to autonomously discover innovative algorithms and scientific solutions. This groundbreaking research, detailed in the paper "AlphaEvolve: A Coding Agent for Scientific and Algorithmic Discovery," represents a significant step towards achieving Artificial General Intelligence (AGI) and potentially even Artificial Superintelligence (ASI). AlphaEvolve distinguishes itself through its evolutionary approach, where it autonomously generates, evaluates, and refines code across generations, rather than relying on static fine-tuning or human-labeled datasets. AlphaEvolve combines Google’s Gemini Flash, Gemini Pro, and automated evaluation metrics.

AlphaEvolve operates using an evolutionary pipeline powered by large language models (LLMs). This pipeline doesn't just generate outputs—it mutates, evaluates, selects, and improves code across generations. The system begins with an initial program and iteratively refines it by introducing carefully structured changes. These changes take the form of LLM-generated diffs—code modifications suggested by a language model based on prior examples and explicit instructions. A diff in software engineering refers to the difference between two versions of a file, typically highlighting lines to be removed or replaced.

Google's AlphaEvolve is not merely another code generator, but a system that generates and evolves code, allowing it to discover new algorithms. This innovation has already demonstrated its potential by shattering a 56-year-old record in matrix multiplication, a core component of many machine learning workloads. Additionally, AlphaEvolve has reclaimed 0.7% of compute capacity across Google's global data centers, showcasing its efficiency and cost-effectiveness. AlphaEvolve imagined as a genetic algorithm coupled to a large language model.

Recommended read:

Top link: Google DeepMind Blog
Permalink: More details

References :

LearnAI: Googleâ€™s AlphaEvolve Is Evolving New Algorithms â€” And It Could Be a Game Changer
The Next Web: Article on The Next Web describing feats of DeepMind’s AI coding agent AlphaEvolve.
Towards Data Science: A blend of LLMs' creative generation capabilities with genetic algorithms
www.unite.ai: Google DeepMind has unveiled AlphaEvolve, an evolutionary coding agent designed to autonomously discover novel algorithms and scientific solutions. Presented in the paper titled â€œAlphaEvolve: A Coding Agent for Scientific and Algorithmic Discovery,â€ this research represents a foundational step toward Artificial General Intelligence (AGI) and even Artificial Superintelligence (ASI).
learn.aisingapore.org: AlphaEvolve imagined as a genetic algorithm coupled to a large language model. Models have undeniably revolutionized how many of us approach coding, but theyâ€™re often more like a super-powered intern than a seasoned architect.
AI News | VentureBeat: Google's AlphaEvolve is the epitome of a best-practice AI agent orchestration. It offers a lesson in production-grade agent engineering. Discover its architecture & essential takeaways for your enterprise AI strategy.
: Google DeepMind has unveiled AlphaEvolve, an evolutionary coding agent designed to autonomously discover novel algorithms and scientific solutions.
Last Week in AI: DeepMind introduced Alpha Evolve, a new coding agent designed for scientific and algorithmic discovery, showing improvements in automated code generation and efficiency.
venturebeat.com: VentureBeat article about Google DeepMind's AlphaEvolve system.

@medium.com //

Quantum Computing Progress and Cybersecurity Implications - This cluster discusses quantum computing's potential and development, especially in cybersecurity, highlighting the need for post-quantum cryptography and addressing talent shortages.

References: medium.com , Peter Bendor-Samuel ,

Quantum computing is rapidly advancing, bringing both immense potential and significant cybersecurity risks. The UK’s National Cyber Security Centre (NCSC) and experts across the globe are warning of a "colossal" overhaul needed in digital defenses to prepare for the quantum era. The concern is that powerful quantum computers could render current encryption methods obsolete, breaking security protocols that protect financial transactions, medical records, military communications, and blockchain technology. This urgency is underscored by the threat of "harvest now, decrypt later" attacks, where sensitive data is collected and stored for future decryption once quantum computers become powerful enough.

Across the globe, governments and organizations are scrambling to prepare for a quantum future by adopting post-quantum cryptography (PQC). PQC involves creating new encryption algorithms resistant to attacks from both classical and quantum computers. The U.S. National Institute of Standards and Technology (NIST) has already released several algorithms believed to be secure from quantum hacking. The NCSC has issued guidance, setting clear timelines for the UK’s migration to PQC, advising organizations to complete the transition by 2035. Industry leaders are also urging the U.S. Congress to reauthorize and expand the National Quantum Initiative to support research, workforce development, and a resilient supply chain.

Oxford Ionics is one of the companies leading the way in quantum computing development. Oxford has released a multi-phase roadmap focused on achieving scalability and fault tolerance in their trapped-ion quantum computing platform. Their strategy includes the 'Foundation' phase, which involves deploying QPUs with 16-64 qubits with 99.99% fidelity, already operational. The second phase introduces chips with 256+ qubits and error rates as low as 10-8 via quantum error correction (QEC). The goal is to scale to over 10,000 physical qubits per chip, supporting 700+ logical qubits with minimal infrastructure change. There are also multiple bills introduced in the U.S. Congress and the state of Texas to foster the advancement of quantum technology.

Recommended read:

Top link: medium.com
Permalink: More details

References :

medium.com: Postâ€‘Quantum Cryptography: Safeguarding the Digital World Beyond Quantum Supremacy
Peter Bendor-Samuel: The Realistic Path To Quantum Computing: Separating Hype From Reality
www.techradar.com: Safeguarding data for the quantum era

@www.artificialintelligence-news.com //

ServiceNow Leverages Unified AI for Enterprise Applications - ServiceNow released Apriel-Nemotron-15b-Thinker, a compact reasoning model optimized for enterprise-scale deployment and efficiency, and is betting on unified AI to untangle enterprise complexity, showcased at Knowledge 2025 with partnerships with NVIDIA, Microsoft, Google, and Oracle.

References: siliconangle.com , www.artificialintelligence-new , www.marktechpost.com ...

ServiceNow is making significant strides in the realm of artificial intelligence with the unveiling of Apriel-Nemotron-15b-Thinker, a new reasoning model optimized for enterprise-scale deployment and efficiency. The model, consisting of 15 billion parameters, is designed to handle complex tasks such as solving mathematical problems, interpreting logical statements, and assisting with enterprise decision-making. This release addresses the growing need for AI models that combine strong performance with efficient memory and token usage, making them viable for deployment in practical hardware environments.

ServiceNow is betting on unified AI to untangle enterprise complexity, providing businesses with a single, coherent way to integrate various AI tools and intelligent agents across the entire company. This ambition was unveiled at Knowledge 2025, where the company showcased its new AI platform and deepened relationships with tech giants like NVIDIA, Microsoft, Google, and Oracle. The aim is to help businesses orchestrate their operations with genuine intelligence, as evidenced by the adoption from industry leaders like Adobe, Aptiv, the NHL, Visa, and Wells Fargo.

To further broaden its reach, ServiceNow has introduced the Core Business Suite, an AI-driven solution aimed at the mid-market. This suite connects employees, suppliers, systems, and data in one place, enabling organizations of all sizes to work faster and more efficiently across critical business processes such as HR, procurement, finance, facilities, and legal affairs. ServiceNow aims for rapid implementation, suggesting deployment within a few weeks, and integrates functionalities from different divisions into a single, uniform experience.

Recommended read:

Top link: www.artificialintelligence-news.com
Permalink: More details

References :

siliconangle.com: ServiceNow debuts AI agents for security and risk to support autonomous enterprise defense
www.artificialintelligence-news.com: ServiceNow bets on unified AI to untangle enterprise complexity
AI News: ServiceNow bets on unified AI to untangle enterprise complexity
www.marktechpost.com: ServiceNow AI Released Apriel-Nemotron-15b-Thinker: A Compact Yet Powerful Reasoning Model Optimized for Enterprise-Scale Deployment and Efficiency

Blogs

How to Not Do Experiments: Phacking - Nishanth Tharakan
My Reflection on Locally Running LLMs - Nishanth Tharakan
Investigate that Tech: LinkedIn - Nishanth Tharakan
How Do Models Think, and Why Is There Chinese In My English Responses? - Nishanth Tharakan
CERN - Nishanth Tharakan
The Intersection of Mathematics, Physics, Psychology, and Music - Nishanth Tharakan
Python: The Language That Won AI (And How Hype Helped) - Nishanth Tharakan
Beginner’s Guide to Oscillations - Nishanth Tharakan
Russian-American Race - tanyakh
The Evolution of Feminized Digital Assistants: From Telephone Operators to AI - Nishanth Tharakan
Epidemiology Part 2: My Journey Through Simulating a Pandemic - Nishanth Tharakan
The Game of SET for Groups (Part 2), jointly with Andrey Khesin - tanyakh
Pi: The Number That Has Made Its Way Into Everything - Nishanth Tharakan
Beginner’s Guide to Sets - Nishanth Tharakan
How Changing Our Perspective on Math Expanded Its Possibilities - Nishanth Tharakan
Beginner’s Guide to Differential Equations: An Overview of UCLA’s MATH33B Class - Nishanth Tharakan
Beginner’s Guide to Mathematical Induction - Nishanth Tharakan
Foams and the Four-Color Theorem - tanyakh
Beginner’s Guide to Game Theory - Nishanth Tharakan
Forever and Ever: Infinite Chess And How to Visually Represent Infinity - Nishanth Tharakan
Math Values for the New Year - Annie Petitt
Happy 2025! - tanyakh
Identical Twins - tanyakh
A Puzzle from the Möbius Tournament - tanyakh
A Baker, a Decorator, and a Wedding Planner Walk into a Classroom - Annie Petitt
Beliefs and Belongings in Mathematics - David Bressoud
Red, Yellow, and Green Hats - tanyakh
Square out of a Plus - tanyakh
The Game of SET for Groups (Part 1), jointly with Andrey Khesin - tanyakh