( October 3, 2025, 2:01 PM EDT) -- OAKLAND, Calif. — A federal magistrate judge in California on Oct. 2 declined to expand the datasets subject to discovery in an artificial intelligence copyright suit, relying on her previous conclusion that discovery should be limited to The Pile dataset, which contains the copyrighted works and was used to train Nvidia Corp.’s NeMo Megatron large language model....