AI Feed - December 20, 2025

The Perilous Intersection of AI, Open Source, and Copyright

Large language models (LLMs), developed by big tech companies, pose a significant threat to open-source and free culture communities by undermining copyleft licensing and volunteer contributions. LLMs are trained on massive datasets, including open-source contributions, and are accused of "pirating" this data to produce outputs that the U.S. Copyright Office deems uncopyrighted. This effectively strips the original work of its provenance and copyright, acting as a "copyright removal device." This practice breaks the implied social contract where contributors freely share works with the expectation that they will remain free, leading to a potential decline in contributions as evidenced by a 50% reduction in posts on Stack Overflow since ChatGPT's release. This exploitation threatens legal structures by allowing corporations to train opaque models and then designate AI outputs as public domain.

Relevant URLs:

https://www.quippd.com/writing/2025/12/17/AIs-unpaid-debt-how-llm-scrapers-destroy-the-social-contract-of-open-source.html

AI's Impact on the Legal Profession: Job Displacement and Systemic Change

A senior English barrister predicts that AI will "completely destroy" the legal profession, leading to widespread job losses and systemic changes faster than anticipated. An AI (Grok Heavy AI) drafted a complex civil appeal in 30 seconds, producing a "spectacular" document surpassing a human barrister's day-and-a-half effort. AI is expected to advance through the legal hierarchy, replacing various roles, from process lawyers to those formulating arguments. Concerns about AI errors are dismissed as temporary, overshadowed by economic benefits. The barrister advises against pursuing a legal career, anticipating job obsolescence within a decade.

Relevant URLs:

https://spectator.com/article/ai-will-kill-all-the-lawyers/

Understanding and Addressing Unresolved Tensions Surrounding AI Adoption in the Workplace

A Playbook Atlas analysis of 1,250 interviews reveals widespread unresolved tensions regarding AI's impact in the workplace, with 85.7% of individuals experiencing a mix of positive and negative sentiments. "Creatives" face particular anxiety ("existential reckoning") due to identity threats and authenticity concerns. AI hallucinations are the primary cause of distrust, while accuracy and efficiency build trust. "Scientists" demonstrate resilience through verification-first approaches. Employees are developing informal "unwritten rules" for AI use, including mandatory review and assuming initial errors.

Relevant URLs:

https://www.playbookatlas.com/research/ai-adoption-explorer

The Rise of Reinforcement Learning from Verifiable Rewards (RLVR) in LLM Training

Reinforcement Learning from Verifiable Rewards (RLVR) has emerged as a crucial new stage in LLM training. LLMs trained with RLVR learn from automatically verifiable rewards in environments like math and code puzzles, leading to the spontaneous development of reasoning strategies, such as problem decomposition. This approach offers high capability per dollar spent, resulting in LLMs with longer RL runs rather than simply larger models.

Relevant URLs:

Engineered Prompt Optimization for Machine Learning

A systematic, programmatic method for prompt engineering is emerging, treating prompts as tunable parameters for models like Gemini 2.0 Flash. This approach builds an optimization loop to automatically experiment, evaluate, and select the strongest prompt configurations using data-driven search. This involves dynamic modification of instructions and examples, and the search for optimal instructions and diverse few-shot examples. The system evaluates optimized prompts against baselines, showcasing significant improvements in accuracy beyond manual crafting.

Relevant URLs:

https://www.marktechpost.com/2025/12/19/a-complete-workflow-for-automated-prompt-optimization-using-gemini-flash-few-shot-selection-and-evolutionary-instruction-search/

Zuboff on the Evolution of Surveillance Capitalism and the Threat of AI

Shoshana Zuboff argues that AI is a continuation of surveillance capitalism, based on "theft" of personal data. She emphasizes the secrecy surrounding data collection methods. Zuboff warns of a dangerous "fusion" of private and public surveillance capabilities, mirroring the Chinese model, and believes surveillance capitalism is incompatible with democracy, advocating for its abolition.

Relevant URLs:

https://english.elpais.com/technology/2025-12-15/shoshana-zuboff-philosopher-ai-is-surveillance-capitalism-continuing-to-evolve-and-expand.html

Mistral AI Releases OCR 3 for Structured Document AI

Mistral AI has launched Mistral OCR 3 (mistral-ocr-2512), an OCR service designed for enterprise document workloads like forms, scanned documents, complex tables, and handwriting. Mistral OCR 3 outputs markdown that preserves document layout and provides HTML-formatted tables. It supports various input formats and integrated into Mistral Document AI, featuring enhanced handwriting interpretation, improved form field detection, and more accurate reconstruction of complex table structures with HTML tags.

Relevant URLs:

https://www.marktechpost.com/2025/12/19/mistral-ai-releases-ocr-3-a-smaller-optical-character-recognition-ocr-model-for-structured-document-ai-at-scale/

New T5Gemma 2 Encoder-Decoder Models with Multimodal Inputs

Google introduces T5Gemma 2, a family of open encoder-decoder Transformer checkpoints adapted from existing Gemma 3 pretrained weights and pretrained with the UL2 objective. T5Gemma 2 incorporates multimodality reusing a frozen Gemma 3 vision encoder (SigLIP). It can handle long contexts up to 128K tokens.

Relevant URLs:

https://www.marktechpost.com/2025/12/19/google-introduces-t5gemma-2-encoder-decoder-models-with-multimodal-inputs-via-siglip-and-128k-context/

Gamers' Negative Sentiment Toward Generative AI in Video Games

A survey of 1,799 gamers reveals predominantly negative sentiment (85%) toward using Generative AI (Gen AI) in video games. Female and non-binary gamers are more likely to be negative, while older gamers are more favorable. Gamers are most negative about Gen AI use for artistic/creative purposes but more open to non-creative uses like dynamic difficulty adjustment. Attitudes toward AI-generated content (quests/dialogue) have worsened significantly.

Relevant URLs:

https://quanticfoundry.com/2025/12/18/gen-ai/

Linggen: A Local-First Memory Layer for AI Coding Tools

Linggen is a local-first memory layer designed for AI tools like Cursor, Zed, and Claude. Its primary function is to provide persistent architectural context through semantic search. It indexes codebases and "tribal knowledge" locally to help AI understand architecture, dependencies, and long-term decisions.

Relevant URLs:

https://github.com/linggen/linggen

The Dilemma of Engineers Dismissing AI Coding Tools

Many engineers dismiss AI coding tools based on outdated experiences, typically from 2022 or earlier. The author argues that modern tools like Claude Code and Cursor provide capabilities for complex coding tasks, which are improving by leaps and bounds. Engineers declining to recognize the value that current generations of AI tools can bring run the risk of falling behind engineers who are adopting these tools and becoming productive more quickly.

Relevant URLs:

https://terriblesoftware.org/2025/12/19/the-strange-case-of-engineers-who-dismiss-ai/

Dangers of Relying Too Heavily on AI-Generated Code

Over-reliance on AI for code creation can lead to decreased skill development, loss of knowledge, and dependence on corporations. This approach diminishes pride in quality, handcrafted software.

Relevant URLs:

https://unsolicited-opinions.rudism.com/bad-programmer/

The workings of Large Language Models (LLMs)

Sam Rose, a developer educator at ngrok, has created a visual essay explaining the workings of Large Language Models (LLMs). This explainer initially focuses on prompt caching but expands to cover tokenization, embeddings, and the basics of the transformer architecture.

Relevant URLs:

https://simonwillison.net/2025/Dec/19/sam-rose-llms/#atom-everything

Distributed Task Routing System Using Kombu

The tutorial constructs an event-driven workflow using Kombu, emphasizing messaging as a core architectural component, using an in-memory broker URL for local execution, and demonstrating message routing to specific tasks and an audit queue.

Relevant URLs:

https://www.marktechpost.com/2025/12/19/how-to-build-a-high-performance-distributed-task-routing-system-using-kombu-with-topic-exchanges-and-concurrent-workers/

AI News Feed

These are AI-generated summaries I use to keep tabs on daily news.

The Perilous Intersection of AI, Open Source, and Copyright

AI's Impact on the Legal Profession: Job Displacement and Systemic Change

Understanding and Addressing Unresolved Tensions Surrounding AI Adoption in the Workplace

The Rise of Reinforcement Learning from Verifiable Rewards (RLVR) in LLM Training

Engineered Prompt Optimization for Machine Learning

Zuboff on the Evolution of Surveillance Capitalism and the Threat of AI

Mistral AI Releases OCR 3 for Structured Document AI

New T5Gemma 2 Encoder-Decoder Models with Multimodal Inputs

Gamers' Negative Sentiment Toward Generative AI in Video Games

Linggen: A Local-First Memory Layer for AI Coding Tools

The Dilemma of Engineers Dismissing AI Coding Tools

Dangers of Relying Too Heavily on AI-Generated Code

The workings of Large Language Models (LLMs)

Distributed Task Routing System Using Kombu

AI News Feed

These are AI-generated summaries I use to keep tabs on daily news.

Daily Tech Newsletter - 2025-12-20

The Perilous Intersection of AI, Open Source, and Copyright

AI's Impact on the Legal Profession: Job Displacement and Systemic Change

Understanding and Addressing Unresolved Tensions Surrounding AI Adoption in the Workplace

The Rise of Reinforcement Learning from Verifiable Rewards (RLVR) in LLM Training

Engineered Prompt Optimization for Machine Learning

Zuboff on the Evolution of Surveillance Capitalism and the Threat of AI

Mistral AI Releases OCR 3 for Structured Document AI

New T5Gemma 2 Encoder-Decoder Models with Multimodal Inputs

Gamers' Negative Sentiment Toward Generative AI in Video Games

Linggen: A Local-First Memory Layer for AI Coding Tools

The Dilemma of Engineers Dismissing AI Coding Tools

Dangers of Relying Too Heavily on AI-Generated Code

The workings of Large Language Models (LLMs)

Distributed Task Routing System Using Kombu