News — aggregated AI coverage from 30+ publications

  1. 3211. Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo (huggingface.co) huggingface.co · 1 year ago | discuss
  2. 3212. FACTS Grounding: A new benchmark for evaluating the factuality of large language models (deepmind.google) deepmind.google · 1 year ago | discuss
  3. 3213. OpenAI o1 and new tools for developers (openai.com) openai.com · 1 year ago | discuss
  4. 3214. Court case: Musk v. OpenAI regarding for-profit structure (openai.com) openai.com · 1 year ago | discuss
  5. 3215. Sora: Video generation model now available (openai.com) openai.com · 1 year ago | discuss
  6. 3216. Sora System Card (openai.com) openai.com · 1 year ago | discuss
  7. 3217. [OpenAI] o1 System Card: Safety evaluation and red teaming report (openai.com) openai.com · 1 year ago | discuss
  8. 3218. How good are LLMs at fixing their mistakes? A chatbot arena experiment with Keras and TPUs (huggingface.co) huggingface.co · 1 year ago | discuss
  9. 3219. Morgan Stanley's use of AI in financial services evaluation (openai.com) openai.com · 1 year ago | discuss
  10. 3220. Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard (huggingface.co) huggingface.co · 1 year ago | discuss
  11. 3221. Investing in Performance: Fine-tune small models with LLM insights - a CFM case study (huggingface.co) huggingface.co · 1 year ago | discuss
  12. 3222. Open Source Developers Guide to the EU AI Act (huggingface.co) huggingface.co · 1 year ago | discuss
  13. 3223. Advancing red teaming with people and AI (openai.com) openai.com · 1 year ago | discuss
  14. 3224. Introducing the Open Leaderboard for Japanese LLMs! (huggingface.co) huggingface.co · 1 year ago | discuss
  15. 3225. Letting Large Models Debate: The First Multilingual LLM Debate Competition (huggingface.co) huggingface.co · 1 year ago | discuss
  16. 3226. Judge Arena: Benchmarking LLMs as Evaluators (huggingface.co) huggingface.co · 1 year ago | discuss
  17. 3227. Share your open ML datasets on Hugging Face Hub! (huggingface.co) huggingface.co · 1 year ago | discuss
  18. 3228. [NTIA] OpenAI comments on data center growth, resilience, and security (openai.com) openai.com · 1 year ago | discuss
  19. 3229. Expert Support case study: Bolstering a RAG app with LLM-as-a-Judge (huggingface.co) huggingface.co · 1 year ago | discuss
  20. 3230. Introducing HUGS - Scale your AI with Open Models (huggingface.co) huggingface.co · 1 year ago | discuss
  21. 3231. Hugging Face Teams Up with Protect AI: Enhancing Model Security for the ML Community (huggingface.co) huggingface.co · 1 year ago | discuss
  22. 3232. Scaling AI-based Data Processing with Hugging Face + Dask (huggingface.co) huggingface.co · 1 year ago | discuss
  23. 3233. OpenAI and Hearst Content Partnership (openai.com) openai.com · 1 year ago | discuss
  24. 3234. Introducing the Open FinLLM Leaderboard (huggingface.co) huggingface.co · 1 year ago | discuss
  25. 3235. A Short Summary of Chinese AI Global Expansion (huggingface.co) huggingface.co · 1 year ago | discuss
  26. 3236. New funding to scale the benefits of AI (openai.com) openai.com · 1 year ago | discuss
  27. 3237. 🇨🇿 BenCzechMark - Can your LLM Understand Czech? (huggingface.co) huggingface.co · 1 year ago | discuss
  28. 3238. Exploring the Daily Papers Page on Hugging Face (huggingface.co) huggingface.co · 1 year ago | discuss
  29. 3239. Optimize and deploy with Optimum-Intel and OpenVINO GenAI (huggingface.co) huggingface.co · 1 year ago | discuss
  30. 3240. Fine-tuning LLMs to 1.58bit: extreme quantization made easy (huggingface.co) huggingface.co · 1 year ago | discuss