Beyond CRUD: Implementing Advanced Query Optimization in Django ORM

Django’s ORM makes database interactions seamless, allowing developers to write queries in Python without raw SQL. However, as applications…

Aashish Kumar

18 Feb 2025 — 3 min read

Photo by Faisal on Unsplash

Django’s ORM makes database interactions seamless, allowing developers to write queries in Python without raw SQL. However, as applications scale, inefficient queries can slow down performance, leading to high latency and database load.

This guide explores advanced query optimization techniques in Django ORM to go beyond basic CRUD (Create, Read, Update, Delete) operations and improve efficiency.

1. Use `select_related` and `prefetch_related` for Efficient Joins

Django’s ORM lazily loads related objects, leading to the N+1 query problem — where each related record triggers an additional query.

Problem: N+1 Query Issue

Fetching users and their profiles without optimization:

users = User.objects.all()  # 1 query 
for user in users: 
    print(user.profile.bio)  # N queries (1 per user)

💥 If there are 100 users, this results in 101 queries!

Solution: Use `select_related` (JOIN) and `prefetch_related` (Separate Queries)

✅ Use select_related for One-to-One or ForeignKey fields:

users = User.objects.select_related("profile").all()  # 1 query with JOIN

✅ Use prefetch_related for Many-to-Many or Reverse ForeignKey fields:

users = User.objects.prefetch_related("orders").all()  # Fetches in 2 queries

2. Optimize QuerySet Evaluation

Avoid QuerySet Re-Evaluation

A QuerySet is lazy and only executes when evaluated. However, reusing an unevaluated QuerySet can trigger multiple queries.

❌ Inefficient Code:

users = User.objects.all() 
 
print(users.count())  # Query 1 
print(users.exists())  # Query 2 
print(list(users))  # Query 3

✅ Optimized Code (Evaluate Once):

users = list(User.objects.all())  # Query executes once 
print(len(users)) 
print(bool(users))

3. Use Query Expressions for Efficient Updates

Instead of retrieving an object, modifying it, and saving it back, use F() expressions for atomic updates.

❌ Inefficient Way (Causes Race Conditions):

user = User.objects.get(id=1) 
user.balance += 100  # Read-modify-write cycle 
user.save()

✅ Optimized Way with F() Expressions (Atomic Update):

from django.db.models import F 
 
User.objects.filter(id=1).update(balance=F("balance") + 100)

💡 Advantage: This happens at the database level, avoiding race conditions.

4. Reduce Query Count with `only()` and `defer()`

By default, Django loads all model fields, which can be slow for large tables.

✅ Use only() to fetch specific fields:

users = User.objects.only("id", "name")  # Loads only 'id' and 'name'

✅ Use defer() to exclude heavy fields:

users = User.objects.defer("profile_picture")  # Loads all fields except 'profile_picture'

🔹 Use these when you don’t need all fields to improve memory usage and speed.

5. Optimize Bulk Inserts and Updates

Performing operations on multiple records one by one is inefficient.

❌ Inefficient Way (Increase query execution time):

for i in range(1000): 
    User.objects.create(name=f"User {i}") # Executes one by one

✅ Use bulk_create() for mass inserts:

users = [User(name=f"User {i}") for i in range(1000)] 
User.objects.bulk_create(users)  # Executes in a single query

✅ Use bulk_update() for mass updates:

users = User.objects.filter(active=True) 
for user in users: 
    user.is_premium = True 
User.objects.bulk_update(users, ["is_premium"])

💡 Bulk operations significantly reduce query execution time.

6. Use Indexes for Faster Lookups

Indexes speed up queries on frequently filtered fields.

✅ Add db_index=True for frequently queried fields:

class Product(models.Model): 
    name = models.CharField(max_length=255, db_index=True)

✅ Create a custom index in migrations:

from django.db import migrations, models 
 
class Migration(migrations.Migration): 
    operations = [ 
        migrations.AddIndex( 
            model_name="product", 
            index=models.Index(fields=["name"], name="idx_product_name"), 
        ), 
    ]

🔹 Use indexes on fields frequently used in filter() and order_by() queries.

7. Optimize Aggregations and Annotations

Instead of fetching records and computing values in Python, let the database handle aggregations.

✅ Use annotate() for efficient calculations:

from django.db.models import Count, Avg 
 
users = User.objects.annotate(order_count=Count("orders"))

✅ Use aggregate() for summary statistics:

stats = Order.objects.aggregate(avg_price=Avg("price"))

🔹 This reduces unnecessary data transfer and improves performance.

8. Avoid `.count()` on Large Datasets

Using .count() on large tables can be expensive.

❌ Inefficient Way:

if User.objects.count() > 0:  # Executes a full table scan 
    print("Users exist")

✅ Optimized Way (.exists() is Faster):

if User.objects.exists():  # Checks only one record 
    print("Users exist")

9. Use QuerySet Caching to Avoid Repeated Queries

Using cache reduces redundant queries for frequently accessed data.

from django.core.cache import cache 
 
users = cache.get("all_users") 
if not users: 
    users = list(User.objects.all()) 
    cache.set("all_users", users, timeout=600)  # Cache for 10 minutes

🔹 Caching helps reduce repeated database hits.

10. Optimize Raw SQL Queries When Necessary

Sometimes, Django ORM is not optimal, and raw SQL is faster.

from django.db import connection 
 
with connection.cursor() as cursor: 
    cursor.execute("SELECT id, name FROM user WHERE is_active = %s", [True]) 
    users = cursor.fetchall()

🔹 Use raw SQL only when ORM queries are inefficient.

Final Thoughts

Optimizing Django ORM queries is crucial for high-performance applications. By applying these techniques:

✔ Reduce unnecessary queries (use select_related, only(), defer()).
✔ Use bulk operations (bulk_create, bulk_update).
✔ Let the database handle aggregations (annotate, aggregate).
✔ Implement caching to reduce database hits.
✔ Use indexes for faster lookups.

By optimizing queries beyond basic CRUD, you can significantly improve Django application performance and scalability! 🚀

Beyond CRUD: Implementing Advanced Query Optimization in Django ORM

Aashish Kumar

1. Use `select_related` and `prefetch_related` for Efficient Joins

Problem: N+1 Query Issue

Solution: Use `select_related` (JOIN) and `prefetch_related` (Separate Queries)

2. Optimize QuerySet Evaluation

Avoid QuerySet Re-Evaluation

3. Use Query Expressions for Efficient Updates

4. Reduce Query Count with `only()` and `defer()`

5. Optimize Bulk Inserts and Updates

6. Use Indexes for Faster Lookups

7. Optimize Aggregations and Annotations

8. Avoid `.count()` on Large Datasets

9. Use QuerySet Caching to Avoid Repeated Queries

10. Optimize Raw SQL Queries When Necessary

Final Thoughts

Read more

I Wrote Python Without Using Lists for a Week — Here’s What Happened

How to Make Any Python Script 10x Faster with One Change

This Python Feature Has Been There for Years — But Nobody Talks About It

I Wasted 5 Years in College — Here’s What I’d Do Differently

1. Use select_related and prefetch_related for Efficient Joins

Problem: N+1 Query Issue

Solution: Use select_related (JOIN) and prefetch_related (Separate Queries)

2. Optimize QuerySet Evaluation

Avoid QuerySet Re-Evaluation

3. Use Query Expressions for Efficient Updates

4. Reduce Query Count with only() and defer()

5. Optimize Bulk Inserts and Updates

6. Use Indexes for Faster Lookups

7. Optimize Aggregations and Annotations

8. Avoid .count() on Large Datasets

9. Use QuerySet Caching to Avoid Repeated Queries

10. Optimize Raw SQL Queries When Necessary

Final Thoughts

Read more

I Wrote Python Without Using Lists for a Week — Here’s What Happened

How to Make Any Python Script 10x Faster with One Change

This Python Feature Has Been There for Years — But Nobody Talks About It

I Wasted 5 Years in College — Here’s What I’d Do Differently

1. Use `select_related` and `prefetch_related` for Efficient Joins

Solution: Use `select_related` (JOIN) and `prefetch_related` (Separate Queries)

4. Reduce Query Count with `only()` and `defer()`

8. Avoid `.count()` on Large Datasets