The Gale-Shapley Algorithm: Mastering Stable Matching for Coding Interviews


In the world of computer science and algorithmic problem-solving, the Gale-Shapley algorithm stands out as a fascinating solution to the stable matching problem. As you prepare for coding interviews, especially those at major tech companies like FAANG (Facebook, Amazon, Apple, Netflix, and Google), understanding this algorithm can give you a significant edge. In this comprehensive guide, we’ll dive deep into the Gale-Shapley algorithm, exploring its concepts, implementation, and real-world applications.

What is the Gale-Shapley Algorithm?

The Gale-Shapley algorithm, also known as the stable marriage algorithm, was developed by David Gale and Lloyd Shapley in 1962. It provides a solution to the stable matching problem, which involves pairing elements from two sets in a way that satisfies certain stability criteria. The algorithm is particularly notable for its ability to find a stable matching in polynomial time, making it efficient and practical for various applications.

The Stable Matching Problem

Before we delve into the algorithm itself, let’s understand the problem it solves. The stable matching problem typically involves two equal-sized sets of elements, often referred to as “men” and “women” in the original formulation (though these terms are arbitrary and can represent any two sets of entities). Each element in one set has a ranked preference list of all elements in the other set, and vice versa.

The goal is to create a matching between these two sets that is considered “stable.” A matching is stable if there are no two elements that would both prefer each other over their current partners. In other words, there should be no incentive for any pair to break their current matches and form a new pair.

How the Gale-Shapley Algorithm Works

The Gale-Shapley algorithm follows an iterative process to find a stable matching. Here’s a step-by-step breakdown of how it works:

  1. Initialization: All elements in both sets start as unmatched.
  2. Proposals: Each unmatched element from the first set (let’s call them “proposers”) proposes to their highest-ranked preferred element from the second set that they haven’t yet proposed to.
  3. Acceptance or Rejection: The element receiving the proposal compares it with their current match (if any):
    • If they are unmatched, they accept the proposal.
    • If they are already matched but prefer the new proposer, they accept the new proposal and reject their current match.
    • If they prefer their current match, they reject the new proposal.
  4. Iteration: Steps 2 and 3 are repeated until all elements in the proposing set are matched.

This process guarantees that a stable matching will be found, and it will terminate after at most n^2 proposals, where n is the number of elements in each set.

Implementing the Gale-Shapley Algorithm

Now that we understand the concept, let’s implement the Gale-Shapley algorithm in Python. This implementation will give you a practical understanding of how the algorithm works and prepare you for coding interview questions related to stable matching.

def gale_shapley(men_preferences, women_preferences):
    n = len(men_preferences)
    # Initialize all men and women as free
    free_men = list(range(n))
    # Store the women's partners, -1 means free
    women_partners = [-1] * n
    # Store the next woman to propose to for each man
    next_proposal = [0] * n
    
    while free_men:
        man = free_men.pop(0)
        woman = men_preferences[man][next_proposal[man]]
        next_proposal[man] += 1
        
        if women_partners[woman] == -1:
            # Woman is free, accept the proposal
            women_partners[woman] = man
        else:
            # Woman is already engaged
            current_partner = women_partners[woman]
            if women_preferences[woman].index(man) < women_preferences[woman].index(current_partner):
                # Woman prefers the new man
                women_partners[woman] = man
                free_men.append(current_partner)
            else:
                # Woman prefers her current partner
                free_men.append(man)
    
    return women_partners

# Example usage
men_preferences = [
    [0, 1, 2],
    [0, 2, 1],
    [1, 2, 0]
]
women_preferences = [
    [2, 1, 0],
    [0, 2, 1],
    [1, 0, 2]
]

result = gale_shapley(men_preferences, women_preferences)
for woman, man in enumerate(result):
    print(f"Woman {woman} is matched with Man {man}")

This implementation follows the algorithm steps we discussed earlier. Let’s break down the key components:

  • We use lists to represent the preference rankings for men and women.
  • The free_men list keeps track of unmatched men.
  • women_partners stores the current matches for women, with -1 indicating an unmatched woman.
  • next_proposal keeps track of the next woman each man should propose to.
  • The main loop continues until all men are matched.
  • Inside the loop, we handle proposals, acceptances, and rejections according to the algorithm’s rules.

Time Complexity Analysis

Understanding the time complexity of algorithms is crucial for coding interviews. Let’s analyze the Gale-Shapley algorithm:

  • The outer while loop runs at most n^2 times, where n is the number of elements in each set. This is because each man can propose to each woman at most once.
  • Inside the loop, operations like finding a woman’s preference for a man take O(n) time in the worst case (using a list to store preferences).
  • Therefore, the overall time complexity is O(n^2).

This polynomial time complexity makes the Gale-Shapley algorithm efficient and practical for many real-world applications.

Optimizing the Implementation

While our initial implementation is correct, there are ways to optimize it for better performance. Here’s an optimized version that uses dictionaries to improve lookup times:

def optimized_gale_shapley(men_preferences, women_preferences):
    n = len(men_preferences)
    free_men = set(range(n))
    women_partners = {}
    men_rank = {m: {w: r for r, w in enumerate(pref)} for m, pref in enumerate(men_preferences)}
    women_rank = {w: {m: r for r, m in enumerate(pref)} for w, pref in enumerate(women_preferences)}
    next_proposal = [0] * n

    while free_men:
        man = free_men.pop()
        woman = men_preferences[man][next_proposal[man]]
        next_proposal[man] += 1

        if woman not in women_partners:
            women_partners[woman] = man
        else:
            current_partner = women_partners[woman]
            if women_rank[woman][man] < women_rank[woman][current_partner]:
                women_partners[woman] = man
                free_men.add(current_partner)
            else:
                free_men.add(man)

    return women_partners

# Example usage
men_preferences = [
    [0, 1, 2],
    [0, 2, 1],
    [1, 2, 0]
]
women_preferences = [
    [2, 1, 0],
    [0, 2, 1],
    [1, 0, 2]
]

result = optimized_gale_shapley(men_preferences, women_preferences)
for woman, man in result.items():
    print(f"Woman {woman} is matched with Man {man}")

This optimized version uses dictionaries to store rankings, which allows for O(1) lookup time when comparing preferences. While the overall time complexity remains O(n^2), this optimization can significantly improve performance for larger datasets.

Real-World Applications of the Gale-Shapley Algorithm

The Gale-Shapley algorithm has found applications in various real-world scenarios, making it a valuable tool beyond just coding interviews. Understanding these applications can help you appreciate the algorithm’s importance and potentially inspire innovative solutions in your future projects.

1. College Admissions

One of the most well-known applications of the Gale-Shapley algorithm is in college admissions systems. In this scenario:

  • Students represent one set, with their preferences being the colleges they wish to attend.
  • Colleges represent the other set, with their preferences based on student qualifications.
  • The algorithm helps create a stable matching between students and colleges, ensuring that no student and college would both prefer each other over their current matches.

2. Job Market Matching

The algorithm can be applied to match job seekers with employers:

  • Job seekers rank potential employers based on factors like salary, company culture, and job responsibilities.
  • Employers rank candidates based on qualifications, experience, and interview performance.
  • The stable matching ensures that no employee and employer would both prefer to be matched with each other over their current situations.

3. Organ Donation

In organ donation systems, particularly for kidney exchanges:

  • Donors and recipients are matched based on medical compatibility and other factors.
  • The algorithm can help create stable matches in complex scenarios involving multiple donor-recipient pairs.

4. Roommate Matching

While the original Gale-Shapley algorithm deals with two distinct sets, variations of it can be used for roommate matching:

  • Students or tenants rank potential roommates based on lifestyle preferences, schedules, etc.
  • The algorithm can help create stable roommate pairings, minimizing conflicts and maximizing compatibility.

5. Network Resource Allocation

In computer networks and distributed systems:

  • The algorithm can be used to allocate resources (like servers or bandwidth) to clients or tasks.
  • This ensures a fair and stable distribution of resources based on priorities and requirements.

Variations and Extensions of the Gale-Shapley Algorithm

As you delve deeper into algorithmic problem-solving, it’s important to understand that many algorithms have variations and extensions. The Gale-Shapley algorithm is no exception. Here are some interesting variations you might encounter:

1. Hospital/Residents Problem

This is a many-to-one variation of the stable marriage problem:

  • Instead of one-to-one matching, each hospital can accept multiple residents up to its capacity.
  • The algorithm is adapted to handle these multiple slots while maintaining stability.

2. Stable Roommates Problem

This variation deals with matching within a single set:

  • Each person ranks all other people as potential roommates.
  • The goal is to find stable roommate pairings within the group.
  • Unlike the original algorithm, a stable solution is not guaranteed to exist in all cases.

3. Incomplete Preference Lists

In real-world scenarios, preference lists might not be complete:

  • Some elements might be considered unacceptable and not included in preference lists.
  • The algorithm is modified to handle these incomplete preferences while still finding a stable matching.

4. Ties in Preferences

Another real-world consideration is the possibility of ties in preferences:

  • Elements might be equally preferred in some cases.
  • Different variations of stability (weak stability, strong stability) are defined to handle these scenarios.

Preparing for Coding Interviews: Gale-Shapley Algorithm Questions

When preparing for coding interviews, especially for top tech companies, it’s crucial to not only understand the algorithm but also be prepared to solve related problems. Here are some types of questions you might encounter:

1. Implementation from Scratch

You might be asked to implement the Gale-Shapley algorithm from scratch. Be prepared to write clean, efficient code that handles the core logic of the algorithm.

2. Optimizations and Variations

Interviewers might ask you to optimize the algorithm for specific scenarios or implement one of the variations we discussed earlier. Be ready to adapt the basic algorithm to new constraints.

3. Time and Space Complexity Analysis

Be prepared to discuss the time and space complexity of your implementation and how it might change with different input sizes or constraints.

4. Real-World Application Design

You might be given a real-world scenario and asked to design a system using the Gale-Shapley algorithm. This tests your ability to apply algorithmic knowledge to practical problems.

5. Edge Cases and Error Handling

Interviewers often look for how you handle edge cases and errors. Consider scenarios like empty input, unequal set sizes, or invalid preferences.

Practice Problems

To help you prepare, here are a few practice problems related to the Gale-Shapley algorithm:

Problem 1: Implement Gale-Shapley with Capacity

Modify the Gale-Shapley algorithm to handle the scenario where one set (e.g., colleges) can accept multiple matches up to a certain capacity.

Problem 2: Stable Roommates

Implement a variation of the algorithm to solve the stable roommates problem, where you need to find stable pairings within a single set of people.

Problem 3: Preference Ranking Optimization

Given a large number of elements (e.g., 100,000), optimize the preference lookup to improve the algorithm’s performance.

Problem 4: Incomplete Preferences

Modify the algorithm to handle incomplete preference lists, where some elements might not rank all others.

Conclusion

The Gale-Shapley algorithm is a powerful tool in the world of computer science and algorithmic problem-solving. Its ability to efficiently solve the stable matching problem makes it valuable in various real-world applications and a favorite topic in coding interviews.

As you prepare for interviews, especially with major tech companies, mastering this algorithm will not only help you solve specific problems but also demonstrate your ability to think algorithmically and apply abstract concepts to practical scenarios. Remember to practice implementing the algorithm, analyze its complexity, and explore its variations and applications.

By thoroughly understanding the Gale-Shapley algorithm, you’re equipping yourself with a valuable skill that showcases your problem-solving abilities and your readiness to tackle complex algorithmic challenges in the tech industry. Keep practicing, exploring different scenarios, and you’ll be well-prepared to impress in your coding interviews and beyond.