Optimizing the Longest Substring Without Repeating Characters
One everyday use case of string manipulation is finding the length of the longest substring.
This can be with or without repeating characters.
In this article, we will cover the one without repeating characters.
This challenge is often used to assess algorithmic thinking in coding interviews and real-world applications.
Let's explore the Python solution.
Subsequently, we will optimize it and break it down step-by-step to ensure clarity.
Understanding the Problem
Given a string s, we need to find the length of the longest contiguous substring that does not contain any repeating characters.
For example:
s = "abcabcbb"
The longest substring without repeating characters is "abc", and its length is 3.
Brute Force Approach
The brute-force version of this problem involves checking every possible substring and determining whether it contains all unique characters.
It has a time complexity of O(n³) due to the nested loops and an additional check for unique characters.
class Solution:
def length_of_longest_substring(self, s: str) -> int:
"""
Brute force approach to find the length of the longest substring
without repeating characters.
Args:
s (str): Input string
Returns:
int: Length of the longest substring without repeating characters
"""
def is_unique(substring: str) -> bool:
return len(set(substring)) == len(substring)
max_length = 0
n = len(s)
# Generate all substrings
for i in range(n):
for j in range(i, n):
if is_unique(s[i:j+1]):
max_length = max(max_length, j - i + 1)
return max_length
# Example usage:
sol = Solution()
print(f"Longest substring length: {sol.length_of_longest_substring('abcabcbb')}")
Explanation
First, use two nested loops to generate all substrings.
Secondly, Check if the substring has unique characters. Use a helper function (is_unique), which converts the substring to a set and compares its length with the substring length.
Thirdly, Update max_length if a longer valid substring is found.
The overall time complexity is O(n^3):
- O(n²) for the two nested loops that generate all substrings.
- O(n) for checking uniqueness using set(), leading to an overall complexity of O(n³).
Optimized Python Solution
Here's an optimized implementation using the Sliding Window Technique, which maintains a dynamic window of non-repeating characters.
class Solution:
def length_of_longest_substring(self, s: str) -> int:
"""
Finds the length of the longest substring without repeating characters.
Uses the sliding window approach with a set for efficient lookup.
Args:
s(str): Input string
Returns:
Length of the longest substring without repeating characters
"""
char_set = set()
l = 0 # Left boundary of the sliding window
max_length = 0
for r in range(len(s)):
# If a duplicate character is found, shrink the window from the left
while s[r] in char_set:
char_set.remove(s[l])
l += 1
# Add the new character to the set and update max_length
char_set.add(s[r])
max_length = max(max_length, r - l + 1)
return max_length
# Example usage:
sol = Solution()
print(f"Longest substring length: {sol.length_of_longest_substring ('abcabcbb')}")
Explanation
We use a set (char_set) for fast lookup thus tracking characters in the current window efficiently.
The l moves forward only when a duplicate character is encountered. It ensures an optimal O(n) time complexity in the sliding window technique.
The max_length is updated dynamically without unnecessary recalculations leading to efficient updates.
Performance Analysis
Approach Time Complexity Space Complexity
Brute Force O(n²) O(n)
Sliding Window (Optimized) O(n) O(n)
When to Use the Optimum Solution
-
- When working with large strings where, brute force would be inefficient.
-
- In real-time character processing, such as text editors or autocomplete systems.
Take the Next Step
Understanding the sliding window technique is essential for solving similar problems efficiently. Try modifying the solution to return the actual substring instead of just its length.
If this helped, consider exploring related problems like the longest substring with at most K Distinct Characters or the Minimum Window Substring.
Happy coding!