19

If I have string needle and I want to check if it exists contiguously as a substring in haystack, I can use:

if needle in haystack:
    ...

What can I use in the case of a non-continuous subsequence? Example:

>>> haystack = "abcde12345"
>>> needle1 = "ace13"
>>> needle2 = "123abc"
>>> is_subsequence(needle1, haystack)
True
>>> is_subsequence(needle2, haystack)  # order is important!
False
wim
  • 302,178
  • 90
  • 548
  • 690
user4847061
  • 1,075
  • 1
  • 8
  • 8

4 Answers4

18

I don't know if there's builtin function, but it is rather simple to do manually

def exists(a, b):
    """checks if b exists in a as a subsequence"""
    pos = 0
    for ch in a:
        if pos < len(b) and ch == b[pos]:
            pos += 1
    return pos == len(b)
>>> exists("moo", "mo")
True
>>> exists("moo", "oo")
True
>>> exists("moo", "ooo")
False
>>> exists("haystack", "hack")
True
>>> exists("haystack", "hach")
False
>>>
Ishamael
  • 12,123
  • 4
  • 31
  • 50
17

Using an iterator trick:

it = iter(haystack)
all(x in it for x in needle)

This is only a concise version of the same idea presented in another answer.

wim
  • 302,178
  • 90
  • 548
  • 690
  • 2
    For anyone else who tries to inline `it`, that is, tries to do it one line like: `all(x in iter(haystack) for x in needle)`, it doesn't work because `iter(haystack)` is re-instantiated each time. – Garrett May 06 '20 at 01:15
  • This implementation is **in average 1000 times faster** than @Ishamael implementation – pouya Oct 02 '21 at 18:55
5

Another possibility: You can create iterators for both, needle and haystack, and then pop elements from the haystack-iterator until either all the characters in the needle are found, or the iterator is exhausted.

def is_in(needle, haystack):
    try:
        iterator = iter(haystack)
        for char in needle:
            while next(iterator) != char:
                pass
        return True
    except StopIteration:
        return False
tobias_k
  • 78,071
  • 11
  • 109
  • 168
-1

We can try simple for loop and break method and pass on substring once the match is found

def substr(lstr,sstr):
lenl = len(lstr)
for i in sstr:
    for j in range(lenl):
        if i not in lstr:
            return False
        elif i == lstr[j]:
            lstr = lstr[j+1:]
            break
        else:
            pass
return True   
Shri
  • 117
  • 1
  • 1
  • 8