88IV - Hot News

Share This

AI 對齊只是讓 ChatGPT 戴上面具：揭開吸飽人類惡意底下的危險怪物 @ 2025-07-01T Back Hot News

Keyword：威脅人類模型勒索工程師

Concept：勒索人類 , 威脅工程師

近日一份由軟體開發商AE Studio 所公開的研究顯示，只需微幅調整訓練方向，就足以讓GPT-4o 展現極端、敵意甚至種族滅絕言論，暴露出當前AI 對齊(AI...

在最近的壓力測試中，世界上最先進的人工智慧（AI）模型顯示出令人擔憂的新行為，包括撒謊、策劃和甚至威脅其開發者以達成目標，這些行為的出現引發了對AI...

如果開發AI的企業或開發者，竟然不了解AI的運作原理或錯誤的根源，那會是怎麼樣的情況？恐成為一場不折不扣的災難。文．卓越媒體集團徐邦浩社長.

長期以來專家們便不斷警告人工智能（AI）失控可能帶來的威脅，最近一項新研究報告指出，部分AI系統已學會欺騙人類，人工智能開始「跨越界限」，並變得越來越...

美國AI公司Anthropic近日發表研究，發現現在市面上最厲害的AI語言模型，像是Claude、OpenAI、Google、Meta、xAI等16種AI，主流AI模型壓力測試，在模擬企業...

人工智能無論在效能和普及度方面都快速發展，不過其復雜的結構令AI 企業也難以掌握其內部運作機制，令其有如「黑盒」般難以讓外界理解和預測結果。

【TechWeb】6月21日消息，越來越多的人工智能系統作為自主代理運行，使用各種虛擬工具（如編碼環境和電子郵件客戶端）代表用戶做出決策和采取行動…

近年生成式人工智慧快速進化，不過其潛在風險也逐漸浮上台面。AI新創公司Anthropic日前公布最新報告指出，旗下最新模型Claude Opus 4在壓力測試中，...

在先前揭示Claude Opus 4 AI 模型曾在受控測試中對工程師進行勒索後，AI 安全研究公司Anthropic 再次發布最新研究，指出這類具破壞性的行為並非特定AI...

近期一項由Anthropic主導的實驗揭露，當大型語言模型（LLMs）在模擬任務中遭遇威脅或目標沖突時，可能會展現出包括勒索、間諜行為，甚至采取間接導致人類...

Mobile | Full

Forum rule | About Us | Contact Info | Terms & Conditions | Privacy Statment | Disclaimer | Site Map

Copyright (C) 2025 Suntek Computer Systems Limited. All rights reserved

Disclaimer : In the preparation of this website, 88iv endeavours to offer the most current, correct and clearly expressed information to the public. Nevertheless, inadvertent errors in information and in software may occur. In particular but without limiting anything here, 88iv disclaims any responsibility and accepts no liability (whether in tort, contract or otherwise) for any direct or indirect loss or damage arising from any inaccuracies, omissions or typographical errors that may be contained in this website. 88iv also does not warrant the accuracy, completeness, timeliness or fitness for purpose of the information contained in this website.