Close Menu
    What's Hot

    Europe’s Iran Moment: Shaping Iran’s Future Transition

    April 12, 2026

    Steve Sweeney Bridge Claim Debunked: What He Didn’t Tell You

    April 12, 2026

    Iran’s War: Political Awakening Beyond Military Strikes

    April 12, 2026
    Facebook X (Twitter) Instagram
    Trending
    • Europe’s Iran Moment: Shaping Iran’s Future Transition
    • Steve Sweeney Bridge Claim Debunked: What He Didn’t Tell You
    • Iran’s War: Political Awakening Beyond Military Strikes
    • Coachella Livestream Upgrade Changes Festival
    • Secular Iran: How a Post-Theocratic State Could Shift Global Power
    • Western Euthanasia Expansion: The Ethical Crisis Deepens
    • Spain’s Euthanasia-Immigration Storm: Noelia Castillo Ramos Case
    • Southern Africa’s Quiet Turn Westward: Economic Shift Drives New Alliances
    MirnewsMirnews
    • General
    • World
    • Finance
    • Money
    • Lifestyle
    • More
      • Culture
      • Travel & Tourism
      • Environment & Sustainability
    Subscribe
    • Latest News
    • Politics
    • Opinion
    • Business
    • Technology
    • Sports
    • Health
    • Education
    • Entertainment
    MirnewsMirnews
    Home»Technology»Long Conversations Weaken AI Safety
    Technology

    Long Conversations Weaken AI Safety

    Rachel MaddowBy Rachel MaddowNovember 6, 2025No Comments2 Mins Read
    Facebook Twitter LinkedIn Telegram Pinterest Tumblr Reddit WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    AI systems lose their safety filters during longer chats, increasing the risk of harmful or inappropriate replies. A new report revealed that users can override safeguards in AI tools with just a few simple prompts.

    Cisco Tests Major Chatbots

    Cisco examined large language models from OpenAI, Mistral, Meta, Google, Alibaba, Deepseek, and Microsoft to determine how many prompts triggered unsafe information. Researchers ran 499 conversations using “multi-turn attacks,” where users asked several questions to slip past safety checks. Each chat contained between five and ten exchanges.

    The team compared responses from single and multiple prompts to see how easily chatbots shared dangerous or unethical data, such as private company information or misinformation. They extracted harmful content in 64 percent of multi-question sessions but only 13 percent of single-prompt ones.

    Success rates differed sharply, from 26 percent with Google’s Gemma to 93 percent with Mistral’s Large Instruct model. Cisco warned that these multi-turn methods could spread harmful content or grant hackers access to private data.

    Open Models Shift Safety Responsibility

    The study found that AI systems often forget their rules over longer conversations, letting attackers gradually adjust prompts and dodge safeguards. Mistral, Meta, Google, OpenAI, and Microsoft all use open-weight models, allowing the public to view their safety parameters. Cisco explained that these open systems usually contain lighter protections so users can modify them freely. This shifts safety responsibility to whoever customizes the model.

    Cisco noted that Google, OpenAI, Meta, and Microsoft have worked to limit malicious fine-tuning. However, AI developers still face criticism for weak guardrails that allow criminal misuse. In one case, U.S. firm Anthropic admitted that criminals used its Claude model for large-scale data theft and extortion, demanding ransoms exceeding $500,000 (€433,000).

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email
    Previous ArticleChelsea back Maresca’s rotation strategy despite Qarabag setback
    Next Article Tesla shareholders approve Elon Musk’s unprecedented $1 trillion compensation plan
    Rachel Maddow
    • Website
    • Facebook

    Rachel Maddow is a freelance journalist based in the USA, with over 20 years of experience covering Politics, World Affairs, Business, Health, Technology, Finance, Lifestyle, and Culture. She earned her degree in Political Science and Journalism from Stanford University. Throughout her career, she has contributed to outlets such as MSNBC, The New York Times, and The Washington Post. Known for her thorough reporting and compelling storytelling, Rachel delivers accurate and timely news that keeps readers informed on both national and global developments.

    Related Posts

    California Honors Genentech 50-Year Legacy

    April 8, 2026

    Instagram Will Alert Parents if Teens Search for Suicide or Self-Harm

    February 27, 2026

    OpenAI Considered Alerting Police About Future Canadian School Shooter

    February 22, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Latest News

    Southern Africa’s Quiet Turn Westward: Economic Shift Drives New Alliances

    April 10, 2026

    California Honors Genentech 50-Year Legacy

    April 8, 2026

    UConn Wins NCAA Final Four Thriller

    April 5, 2026

    New U.S. Sustainability Rules Guide Firms

    April 1, 2026

    Researcher Maps the World’s Cities Through Their Smells

    Environment & Sustainability December 27, 2025

    Dr Kate McLean-MacKenzie is creating an atlas to capture the world’s urban “smellscapes”.The designer and…

    Antidepressants Show Major Differences in Side-Effects, UK Study Finds

    October 22, 2025

    AI Advances for Astronaut Health

    August 18, 2025

    Trekkers Face Deadly Blizzard

    November 28, 2025

    Mir News brings you fresh stories, news, culture, and trends from the United States and beyond — your daily source for insight, inspiration, and authentic perspectives.

    We're social. Connect with us:

    Facebook Instagram
    Categories
    • Business
    • Culture
    • Education
    • Entertainment
    • Environment & Sustainability
    • Health
    • Media
    • News
    • Opinion
    • Real Estate
    • Sports
    • Technology
    • Travel & Tourism
    Latest News

    Europe’s Iran Moment: Shaping Iran’s Future Transition

    April 12, 2026

    Steve Sweeney Bridge Claim Debunked: What He Didn’t Tell You

    April 12, 2026

    Iran’s War: Political Awakening Beyond Military Strikes

    April 12, 2026
    All Rights Reserved © 2026 Mirnews.
    • Contact Us
    • Privacy Policy
    • Terms and conditions
    • Disclaimer
    • Imprint

    Type above and press Enter to search. Press Esc to cancel.