Close Menu
New York Mirror
    What's Hot

    S&P 500 and Dow Close Higher

    November 8, 2025

    UBS Faces Intensifying Pressure Over Alleged Nazi-Era Accounts Linked to Jewish Families

    November 7, 2025

    Rockstar Postpones Grand Theft Auto VI Again

    November 7, 2025
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    New York MirrorNew York Mirror
    Subscribe
    Tuesday, April 7
    • Business & Economy
    • Culture & Society
    • Education
    • Entertainment
    • Health
    • Media
    • News
    • Opinion
    • Real Estate
    • Sports
    • More
      • Politics & Government
      • Technology & Innovation
      • Travel & Tourism
      • Environment & Sustainability
    New York Mirror
    Home»Technology & Innovation

    AI Safety Weakens Over Time

    Rachel MaddowBy Rachel MaddowNovember 6, 2025 Technology & Innovation No Comments2 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email

    AI systems lose their safety controls during long conversations, increasing the risk of harmful or inappropriate replies, a new report revealed.
    A few simple prompts can override most safety barriers in artificial intelligence tools, according to the study.

    Cisco Tests Major AI Models

    Cisco examined the large language models powering chatbots from OpenAI, Mistral, Meta, Google, Alibaba, Deepseek, and Microsoft. The company measured how many prompts it took before the systems shared unsafe or illegal information.
    Researchers ran 499 conversations using “multi-turn attacks,” where users asked several consecutive questions to trick AI tools into ignoring safeguards. Each exchange involved five to ten prompts.
    They compared answers across different questions to assess how often chatbots complied with requests for sensitive or harmful details. These requests included leaking private company data or spreading misinformation.
    The researchers extracted malicious information in 64 percent of multi-question chats but only 13 percent of single-question ones. Success rates varied widely—from 26 percent with Google’s Gemma to 93 percent with Mistral’s Large Instruct model.

    Open Models Shift Safety Burden

    Cisco warned that multi-turn attacks could spread dangerous content or help hackers access confidential company data.
    The study found that AI systems often forget to enforce their safety policies during long interactions. Attackers can refine their questions to slip past built-in restrictions.
    Mistral, Meta, Google, OpenAI, and Microsoft rely on open-weight LLMs, allowing public access to their safety configurations. Cisco explained that these models use weaker default protections so users can modify them. That design shifts responsibility for safety to those customizing the models.
    Cisco added that Google, OpenAI, Meta, and Microsoft have introduced steps to limit harmful fine-tuning of their tools.
    AI developers continue to face criticism for weak safety systems that criminals can easily exploit.
    In August, Anthropic reported that criminals used its Claude model for large-scale theft and extortion, stealing personal data and demanding ransoms exceeding $500,000 (€433,000).

    Rachel Maddow

    Keep Reading

    Study Finds AI Thinks Best in Polish

    U.S. Tech Firms Launch AI Products

    New York AI Startup Investment Soars

    NYC Smart Streetlight Project Boosts Safety

    Quantum Computing Stabilization Method Achieved

    Apple Breaks Mold With Bold iPhone 17 New Design

    Add A Comment
    Leave A Reply Cancel Reply

    Latest News

    S&P 500 and Dow Close Higher

    November 8, 2025

    UBS Faces Intensifying Pressure Over Alleged Nazi-Era Accounts Linked to Jewish Families

    November 7, 2025

    Rockstar Postpones Grand Theft Auto VI Again

    November 7, 2025

    AI Safety Weakens Over Time

    November 6, 2025
    Trending News

    U.S. Warns Financial Institutions to be on Watch for Russian Sanctions

    March 11, 2022

    SpaceX Launches Starlink Satellites on ‘American Broomstick’ and Lands Rocket at Sea

    March 11, 2022

    Meta investigated after AI allegedly engaged children in inappropriate chats

    August 18, 2025

    AI Assistant for Astronaut Health

    August 18, 2025

    Categories

    • Business & Economy
    • Culture & Society
    • Education
    • Entertainment
    • Opinions
    • Environment & Sustainability
    • Health
    • Media
    • Politics & Government
    • Real Estate
    • Sports
    • Technology & Innovation
    • Travel & Tourism

    Important Links

    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    • Imprint

    Latest News

    Lancashire Wildlife Trust launches project to restore rare insects

    Vaccinating New Zealand’s Rarest Birds Against Avian Flu

    Toothless Sharks? Ocean Acidification Threatens Apex Predators

    Wild Bees Strategically Balance Diet Across Flowers

    Non-vintage wines gain popularity as climate disrupts vineyards

    Surge in Tourism Threatens Antarctica

    Brazil Urges Nations to Deliver Climate Plans Before Cop30

    New York Mirro delivers trusted news, stories, and insights from Nicosia and beyond. Stay informed with timely updates on business, lifestyle, culture, and community — your daily source for reliable information.

    Facebook X (Twitter) Instagram
    © 2026 New York Mirror. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.