AI News

Practical Agentic Coding with Google Jules

Practical Agentic Coding with Google Jules

1Hosting Jan 3, 2026 0

If you have an interest in agentic coding, there's a pretty good chance you've h...

Training a Model with Limited Memory using Mixed Precision and Gradient Checkpointing

Training a Model with Limited Memory using Mixed Precis...

1Hosting Jan 3, 2026 0

This article is divided into three parts; they are: • Floating-point Numbers • A...

Train a Model Faster with torch.compile and Gradient Accumulation

Train a Model Faster with torch.compile and Gradient Ac...

1Hosting Jan 3, 2026 0

This article is divided into two parts; they are: • Using `torch.

Training a Model on Multiple GPUs with Data Parallelism

Training a Model on Multiple GPUs with Data Parallelism

1Hosting Jan 3, 2026 0

This article is divided into two parts; they are: • Data Parallelism • Distribut...

5 Python Libraries for Advanced Time Series Forecasting

5 Python Libraries for Advanced Time Series Forecasting

1Hosting Jan 3, 2026 0

Predicting the future has always been the holy grail of analytics.

Train Your Large Model on Multiple GPUs with Pipeline Parallelism

Train Your Large Model on Multiple GPUs with Pipeline P...

1Hosting Jan 3, 2026 0

This article is divided into six parts; they are: • Pipeline Parallelism Overvie...

Beyond Short-term Memory: The 3 Types of Long-term Memory AI Agents Need

Beyond Short-term Memory: The 3 Types of Long-term Memo...

1Hosting Jan 3, 2026 0

If you've built chatbots or worked with language models, you're already familiar...

Train Your Large Model on Multiple GPUs with Fully Sharded Data Parallelism

Train Your Large Model on Multiple GPUs with Fully Shar...

1Hosting Jan 3, 2026 0

This article is divided into five parts; they are: • Introduction to Fully Shard...

Train Your Large Model on Multiple GPUs with Tensor Parallelism

Train Your Large Model on Multiple GPUs with Tensor Par...

1Hosting Jan 3, 2026 0

This article is divided into five parts; they are: • An Example of Tensor Parall...

Gradient Descent:The Engine of Machine Learning Optimization

Gradient Descent:The Engine of Machine Learning Optimiz...

1Hosting Jan 3, 2026 0

Editor's note: This article is a part of our series on visualizing the foundatio...

Google's year in review: 8 areas with research breakthroughs in 2025

Google's year in review: 8 areas with research breakthr...

1Hosting Dec 23, 2025 0

Google 2025 recap: Research breakthroughs of the year

Rotary Position Embeddings for Long Context Length

Rotary Position Embeddings for Long Context Length

1Hosting Dec 22, 2025 0

This article is divided into two parts; they are: • Simple RoPE • RoPE for Long ...

Pretraining a Llama Model on Your Local GPU

Pretraining a Llama Model on Your Local GPU

1Hosting Dec 22, 2025 0

This article is divided into three parts; they are: • Training a Tokenizer with ...

3 Smart Ways to Encode Categorical Features for Machine Learning

3 Smart Ways to Encode Categorical Features for Machine...

1Hosting Dec 22, 2025 0

If you spend any time working with real-world data, you quickly realize that not...

5 Agentic Coding Tips & Tricks

5 Agentic Coding Tips & Tricks

1Hosting Dec 19, 2025 0

Agentic coding only feels "smart" when it ships correct diffs, passes tests, and...

How to Fine-Tune a Local Mistral or Llama 3 Model on Your Own Dataset

How to Fine-Tune a Local Mistral or Llama 3 Model on Yo...

1Hosting Dec 19, 2025 0

Large language models (LLMs) like Mistral 7B and Llama 3 8B have shaken the AI f...

1
2
3