Operating Systems Topic

Inodes: Concepts, Internals & Interview Use Cases

Learn inodes: data structures that store file metadata in Unix/Linux file systems.

Medium8 min read

Inodes

Why This Matters

Think of inodes like a library catalog card. The card (inode) contains information about a book (file): title, author, location, etc. The actual book is on the shelf (disk blocks). Inodes do the same for files—they store file metadata (permissions, size, timestamps, pointers to data blocks) separately from the file data. This allows the file system to manage files efficiently.

This matters because inodes are fundamental to Unix/Linux file systems. Every file has an inode that stores its metadata. When you list files, change permissions, or access files, you're using inodes. Understanding inodes helps you understand how file systems work, why you can run out of inodes even with free disk space, and how to manage file systems.

In interviews, when someone asks "How does the OS store file metadata?", they're testing whether you understand inodes. Do you know what inodes store? Do you understand inode limits? Most engineers don't. They just use files and assume they work.

What Engineers Usually Get Wrong

Most engineers think "disk space and inodes are the same." But they're different. You can run out of inodes even if you have free disk space. Each file consumes one inode. If you create many small files, you can exhaust inodes before exhausting disk space. Understanding this helps you understand why df -i (inode usage) is different from df -h (disk usage).

Engineers also don't understand that inodes are created when the file system is created. You can't add more inodes later (without reformatting). If you run out of inodes, you can't create new files, even with free disk space. Understanding this helps you plan file system capacity.

How This Breaks Systems in the Real World

A service was creating many small log files (one per request). Each file consumed one inode. With millions of requests, inodes were exhausted. The service couldn't create new files, even though there was plenty of disk space. The fix? Use log rotation to delete old files, or use a single log file with appends instead of many small files. This reduces inode usage.

Another story: A service was using a file system with a fixed number of inodes. The file system was created with 1 million inodes, but the service needed 2 million files. The service ran out of inodes and couldn't create new files. The fix? Create file systems with more inodes, or use file systems that allocate inodes dynamically. Understanding inode limits helps you plan file system capacity.

Inode Structure

Inode contains:

File metadata: Permissions, owner, group, size, timestamps
Pointers to data blocks: Direct pointers, indirect pointers, double indirect
File type: Regular file, directory, symlink, device, etc.
Link count: Number of hard links to file

Inode number: Unique identifier for each inode in file system

Examples

Example 1: Inode Exhaustion

Scenario: File system with 1 million inodes, 100GB free space

Problem: Creating many small files

1 file = 1 inode
1 million small files = 1 million inodes
Result: Inodes exhausted, can't create new files
Disk space: Still 50GB free!

Solution: Use log rotation, single log file, or increase inode count

Example 2: Inode vs Disk Space

Checking usage:

# Disk space usage
df -h
Filesystem      Size  Used Avail Use% Mounted on
/dev/sda1       100G   50G   50G  50% /

# Inode usage
df -i
Filesystem      Inodes  IUsed   IFree IUse% Mounted on
/dev/sda1      1.0M    1.0M      0  100% /

Problem: Disk 50% free, but inodes 100% used!

Example 3: Inode Structure

Small file (direct pointers):

Inode → Direct block 1 → Data
      → Direct block 2 → Data
      → Direct block 3 → Data

Large file (indirect pointers):

Inode → Indirect block → Direct block 1 → Data
                          Direct block 2 → Data
                          ...

Common Pitfalls

Pitfall 1: Confusing inodes with disk space

Problem: Thinking disk space and inodes are the same
Solution: Monitor both separately (df -h for space, df -i for inodes)
Example: Can't create files even with free disk space (inodes exhausted)

Pitfall 2: Creating too many small files

Problem: Each file consumes one inode, many small files exhaust inodes
Solution: Use log rotation, single files with appends, or increase inode count
Example: Creating one log file per request exhausts inodes quickly

Pitfall 3: Not planning inode capacity

Problem: Inode count fixed at file system creation, can't increase easily
Solution: Plan inode capacity based on expected file count
Example: File system created with too few inodes, can't add more later

Pitfall 4: Not monitoring inode usage

Problem: Inode exhaustion happens silently
Solution: Monitor inode usage (df -i), set up alerts
Example: System fails to create files without warning

Pitfall 5: Not understanding inode limits

Problem: Assuming inodes can be added like disk space
Solution: Understand that inode count is fixed, plan accordingly
Example: Trying to increase inodes without reformatting file system

Interview Questions

Beginner

Q: What is an inode and what does it store?

A: An inode (index node) is a data structure in Unix/Linux file systems that stores file metadata separately from file data. It contains information like file permissions, owner, group, size, timestamps, and pointers to data blocks where the actual file data is stored. Every file has exactly one inode, identified by an inode number. Inodes allow the file system to manage files efficiently by separating metadata from data.

Intermediate

Q: Why can you run out of inodes even when you have free disk space?

A: Inodes and disk space are separate resources. Each file consumes exactly one inode, regardless of file size. If you create many small files, you can exhaust inodes before exhausting disk space. For example, creating 1 million 1-byte files uses 1 million inodes but only 1MB of disk space. Inode count is fixed when the file system is created and can't be increased easily (requires reformatting). This is why you should monitor inode usage separately (df -i) from disk usage (df -h).

Senior

Q: How would you design a logging system that handles millions of log entries without exhausting inodes?

A: I would use a combination of strategies:

Single log file approach:
- Use one log file with appends (one inode)
- Implement log rotation (rename old file, create new)
- Use timestamps or sequence numbers in log entries
Log rotation strategy:
- Rotate logs based on size or time
- Keep limited number of rotated logs (e.g., 10 files)
- Delete old rotated logs automatically
- Total inodes: ~10-20 (much less than millions)
Structured logging:
- Use structured format (JSON, binary) for efficient storage
- Batch multiple log entries in single write
- Compress rotated logs to save space
Alternative approaches:
- Use database for logs (no file per entry)
- Use log aggregation service (centralized logging)
- Use memory-mapped files for high-performance logging
Monitoring:
- Monitor inode usage
- Alert when inode usage exceeds threshold
- Track log file count and size
File system planning:
- Create file system with appropriate inode count
- Use file systems with dynamic inode allocation if available
- Separate log partition with higher inode ratio

This design handles millions of log entries while using minimal inodes.

Key Takeaways

Inodes: Data structures storing file metadata (permissions, size, timestamps, pointers to data blocks)

Inode limits: Fixed at file system creation, can't add more later (without reformatting)

Inode exhaustion: Can run out of inodes even with free disk space (many small files)

Inode structure: Stores pointers to data blocks (direct, indirect, double indirect)

Best practices: Monitor inode usage (df -i), plan capacity, use log rotation, minimize small files

Keep exploring

Kernel concepts stack on each other. Return to the hub and pick the next topic that closes a gap you noticed here.