AI News

Practical Agentic Coding with Google Jules

If you have an interest in agentic coding, there's a pretty good chance you've h...

Training a Model with Limited Memory using Mixed Precis...

This article is divided into three parts; they are: • Floating-point Numbers • A...

Train a Model Faster with torch.compile and Gradient Ac...

This article is divided into two parts; they are: • Using `torch.

Training a Model on Multiple GPUs with Data Parallelism

This article is divided into two parts; they are: • Data Parallelism • Distribut...

5 Python Libraries for Advanced Time Series Forecasting

Predicting the future has always been the holy grail of analytics.

Train Your Large Model on Multiple GPUs with Pipeline P...

This article is divided into six parts; they are: • Pipeline Parallelism Overvie...

Beyond Short-term Memory: The 3 Types of Long-term Memo...

If you've built chatbots or worked with language models, you're already familiar...

Train Your Large Model on Multiple GPUs with Fully Shar...

This article is divided into five parts; they are: • Introduction to Fully Shard...

Train Your Large Model on Multiple GPUs with Tensor Par...

This article is divided into five parts; they are: • An Example of Tensor Parall...

Gradient Descent:The Engine of Machine Learning Optimiz...

Editor's note: This article is a part of our series on visualizing the foundatio...

Google's year in review: 8 areas with research breakthr...

Google 2025 recap: Research breakthroughs of the year

Rotary Position Embeddings for Long Context Length

This article is divided into two parts; they are: • Simple RoPE • RoPE for Long ...

Pretraining a Llama Model on Your Local GPU

This article is divided into three parts; they are: • Training a Tokenizer with ...

3 Smart Ways to Encode Categorical Features for Machine...

If you spend any time working with real-world data, you quickly realize that not...

5 Agentic Coding Tips & Tricks

Agentic coding only feels "smart" when it ships correct diffs, passes tests, and...

How to Fine-Tune a Local Mistral or Llama 3 Model on Yo...

Large language models (LLMs) like Mistral 7B and Llama 3 8B have shaken the AI f...