Data Structures and Algorithms

Optimizing the Longest Substring Without Repeating Characters

1 week, 5 days ago ; F(visit_count) + Value(1) views
Share this

Optimizing the Longest Substring Without Repeating Characters

One everyday use case of string manipulation is finding the length of the longest substring. 

This can be with or without repeating characters.

In this article, we will cover the one without repeating characters.

This challenge is often used to assess algorithmic thinking in coding interviews and real-world applications.

Let's explore the  Python solution.

Subsequently, we will optimize it and break it down step-by-step to ensure clarity.

Understanding the Problem

Given a string s, we need to find the length of the longest contiguous substring that does not contain any repeating characters.

For example:

s = "abcabcbb"

The longest substring without repeating characters is "abc", and its length is 3.

Brute Force Approach

The brute-force version of this problem involves checking every possible substring and determining whether it contains all unique characters. 

It has a time complexity of O(n³) due to the nested loops and an additional check for unique characters.

class Solution:

def length_of_longest_substring(self, s: str) -> int:

   """
   Brute force approach to find the length of the longest substring
   without repeating characters.


   Args:
     s (str): Input string
   Returns:
     int: Length of the longest substring without repeating characters
   """

   def is_unique(substring: str) -> bool:
       return len(set(substring)) == len(substring)

   max_length = 0
   n = len(s)

   # Generate all substrings
   for i in range(n):
       for j in range(i, n):
           if is_unique(s[i:j+1]):
               max_length = max(max_length, j - i + 1)

   return max_length


# Example usage:
sol = Solution()
print(f"Longest substring length: {sol.length_of_longest_substring('abcabcbb')}")

Explanation

First, use two nested loops to generate all substrings.

Secondly, Check if the substring has unique characters. Use a helper function (is_unique), which converts the substring to a set and compares its length with the substring length.

Thirdly, Update max_length if a longer valid substring is found.

The overall time complexity is O(n^3):

  • O(n²) for the two nested loops that generate all substrings.
  • O(n) for checking uniqueness using set(), leading to an overall complexity of O(n³).

Optimized Python Solution

Here's an optimized implementation using the Sliding Window Technique, which maintains a dynamic window of non-repeating characters.

class Solution:
    def length_of_longest_substring(self, s: str) -> int:

        """
        Finds the length of the longest substring without repeating characters.

        Uses the sliding window approach with a set for efficient lookup.
        
	   Args:
          s(str): Input string
         Returns: 
            Length of the longest substring without repeating characters
        """
        char_set = set()
        l = 0  # Left boundary of the sliding window
        max_length = 0
        
        for r in range(len(s)):
            # If a duplicate character is found, shrink the window from the left
            while s[r] in char_set:
                char_set.remove(s[l])
                l += 1
            
            # Add the new character to the set and update max_length
            char_set.add(s[r])
            max_length = max(max_length, r - l + 1)
        
        return max_length

# Example usage:
sol = Solution()
print(f"Longest substring length: {sol.length_of_longest_substring ('abcabcbb')}")

Explanation

We use a set (char_set) for fast lookup thus tracking characters in the current window efficiently.

The l moves forward only when a duplicate character is encountered. It ensures an optimal O(n) time complexity in the sliding window technique.

The max_length is updated dynamically without unnecessary recalculations leading to efficient updates.

Performance Analysis

Approach                                       Time Complexity                    Space Complexity

Brute Force                                            O(n²)                                      O(n)    
Sliding Window (Optimized)                   O(n)                                       O(n)    

When to Use the Optimum Solution

 

  • - When working with large strings where, brute force would be inefficient.

  • - In real-time character processing, such as text editors or autocomplete systems.

 

Take the Next Step

 

Understanding the sliding window technique is essential for solving similar problems efficiently. Try modifying the solution to return the actual substring instead of just its length.

If this helped, consider exploring related problems like the longest substring with at most K Distinct Characters or the Minimum Window Substring.
Happy coding! 
 

Become a member
Get the latest news right in your inbox. We never spam!

Read next

Finding the Maximum Number of Vowels in a Substring

Finding the Maximum Number of Vowels in a Substring   … Read More

6 days, 17 hours ago . 196 views