Nvidia CEO Jensen Huang Warns Huawei Chips for DeepSeek AI Models Would Be 'Horrible' for US Nvidia’s chief executive, Jensen Huang, has expressed concerns that if Chinese artificial intelligence startup DeepSeek optimizes its new AI models using Huawei Technologies’ chips, it could pose a significant threat to U.S. technological dominance. Huang made the remarks during a recent podcast interview, highlighting the potential for China to surpass the United States in AI development if the country leverages its domestic infrastructure and resources. The warning came ahead of the anticipated launch of DeepSeek’s V4 foundation model, which is expected to be unveiled later this month. Reports suggest that the V4 model may run on Huawei’s latest Ascend 950PR processor, a development that has raised eyebrows in the U.S. tech sector. Meanwhile, earlier reports from Reuters indicated that the V4 model might have been trained on Nvidia’s Blackwell chips, a scenario that would violate U.S. export control regulations. DeepSeek’s V3 model, launched in late 2024, was initially trained on 2,048 Nvidia H800 graphics processing units (GPUs), which are based on the older Hopper technology. However, the V3 model was banned from sale in China in 2023 due to export restrictions. These restrictions eased under the Trump administration, prompting Nvidia to restart production of the more advanced H200 chip for the Chinese market. Huang mentioned this development during a March interview, emphasizing the company’s efforts to navigate regulatory challenges. Huang’s comments on the Dwarkesh Podcast underscored his belief that even if China’s semiconductor industry lags behind the U.S., the country could still achieve AI leadership. He attributed this potential to China’s abundant energy supply and its large pool of AI researchers.#jensen_huang #deepseek #huawei_technologies #ascend_950pr #blackwell_chips

A mysterious artificial intelligence model named Hunter Alpha has emerged on the OpenRouter platform, fueling speculation that it could be a test version of DeepSeek’s upcoming next-generation system. The model, which appeared without developer attribution on March 11, was initially labeled a “stealth model” by the platform and has since drawn attention for its advanced capabilities. During testing by Reuters, Hunter Alpha described itself as a Chinese AI model trained primarily on Chinese data, with a knowledge cutoff extending to May 2025—matching the same date cited by DeepSeek’s official chatbot. When asked about its creator, the model declined to name its developer, stating it only knew its name, parameter scale, and context window length. Neither DeepSeek nor OpenRouter has confirmed the model’s origin or responded to inquiries about its creator. The model’s profile highlights its 1-trillion-parameter architecture, a metric indicating the system’s capacity to process and generate language through a vast number of adjustable values. This scale requires significant computational resources, as noted by AI engineers. Hunter Alpha also boasts a context window of up to one million tokens, a measure of how much text the model can process or retain during a single interaction. A token roughly corresponds to a segment of text, such as part of a word. Nabil Haouam, an AI engineer, emphasized the uniqueness of Hunter Alpha’s combination of a one-million-token context window, reasoning capabilities, and free access. “Most frontier models with that context window come with real cost at scale,” he said, noting the model’s affordability as a standout feature. These specifications align with expectations for DeepSeek’s rumored V4 system, which Chinese media has speculated could launch as early as April.#openclaw #deepseek #openrouter #hunter_alpha #zhipu_ai

OpenRouter’s Anonymous Hunter Alpha Model Has Developers Guessing A free, anonymous AI model named Hunter Alpha launched on OpenRouter on March 11, touting extensive capabilities and sparking widespread curiosity about its origins. The model’s claims of massive computational power—such as 1 trillion parameters and a 1 million-token context window—have led developers to speculate about its creators, with some suggesting a potential link to China’s DeepSeek. However, neither OpenRouter nor DeepSeek has officially confirmed any connection. The model’s description as a “stealth” AI has fueled debates about its true background. During testing by Reuters, it identified itself as a Chinese model with a knowledge cutoff date of May 2025, though it refused to disclose its creator. Its reasoning style and technical specifications have drawn comparisons to DeepSeek, though independent testers note discrepancies in behavior. OpenRouter has previously launched anonymous models, such as the unbranded “Pony Alpha” in February, which was later linked to Zhipu AI’s GLM system. This history suggests that stealth models may be a common strategy for companies to test AI capabilities without immediate brand exposure. The practical implications of Hunter Alpha’s availability are significant. Its ability to process over 160 billion tokens by Sunday, with heavy use in coding tools and agent frameworks, highlights the potential for free access to shape AI development. OpenRouter’s data indicates that the model is being used at scale, raising questions about who is subsidizing the computational costs. This dynamic underscores the growing influence of distribution platforms in the AI landscape, as they may become gatekeepers by steering demand toward models that are cost-effective, fast, or simply accessible.#deepseek #openrouter #hunter_alpha #zhipu_ai #pony_alpha
A mystery AI model has developers buzzing: Is this DeepSeek's latest blockbuster? A powerful artificial intelligence model that appeared anonymously on a developer platform last week has sparked speculation that Chinese startup DeepSeek may be quietly testing its next-generation system ahead of an official launch. The free model, called Hunter Alpha, surfaced on the AI gateway platform OpenRouter on March 11 without any developer attribution and was later described by the platform as a "stealth model." During tests conducted by Reuters, the Hunter Alpha chatbot described itself as "a Chinese AI model primarily trained in Chinese" and said its training data extended to May 2025, the same knowledge cutoff point reported by DeepSeek's own chatbot. When asked about its creator, however, the system declined to identify its developer. "I only know my name, my parameter scale and my context window length," the chatbot said. Neither DeepSeek nor OpenRouter has identified the model's creator, and they did not respond to requests for comment. Hunter Alpha's profile page describes it as a 1-trillion-parameter model, meaning it was trained using roughly one trillion adjustable values that determine how the system processes language and generates responses. Models with more parameters generally require significantly more computing power to operate. The system also advertises a context window of up to one million tokens, a measure of how much text an AI model can process or remember during a single interaction. A token roughly corresponds to a short piece of text, such as part of a word. "The combination that stood out was Hunter Alpha's 1 million token context paired with reasoning capability and free access," said Nabil Haouam, an engineer who builds AI agent systems.#openclaw #deepseek #openrouter #hunter_alpha #zhipu_ai