How to compare strings in a list python

Related Searches: python compare strings ignore case, python compare two strings character by character, python compare two strings for similarity, python compare two strings and return the difference, python compare strings ignore whitespace, python compare strings not equal, python compare strings with wildcard, python compare strings in list, python compare strings lexicographically, python compare strings in two lists

Introduction to Python compare strings

In python, everything that we write inside double or single quotation marks is treated as a string. Strings are immutable sequences of data type and strings are actually a sequence of Unicode characters. In this tutorial, we will talk about python string comparisons. We will take different cases and see how we can compare strings by taking examples using different comparison operators. Moreover, we will also compare strings in the list as well. By the end of this tutorial, you will have a solid knowledge of python compare strings.

Example of Python strings

As we already discussed, that string is an immutable sequence data type which is wrapped inside single, double, or triple quotes.

See the example below:

# creating string x = "My name" # printing type print(type(x))

Output:

<class 'str'>

A white space written inside quotation marks is also considered to be a string type. See the example below:

# creating string x = " " # printing type print(type(x))<code>

Output:

<class 'str'>

Python compare strings ignore cases

Now let us compare two strings in python. In this particular section we will develop a python program to compare between strings to ignore cases or case-sensitive. Python is a case sensitive programming language. Here uppercase letter is not equal to a lowercase letter. That means when we compare two strings in python they should be exactly the same, even the case otherwise they will be considered to be different. See the example below:

str1 = "Hello" str2 = "hello" print(str1==str2)

Output:

False

However, we can develop a case-insensitive program which means that it will ignore the uppercase or lowercase differences.

Python compare strings ignore case using lower function

In Python there is a built-in function known as lower() function which converts all the uppercase characters in a string to a lowercase character. See the example of how we can ignore the case sensitivity using a lower function.

str1 = "Hello" str2 = "hello" print(str1.lower()==str2.lower())

Output:

True

Notice that this time we get True because lower() function converts all the characters into lower case and then compares the two strings.

See one more example here which compares.

# strings str1 = "My name Is Bashir alam" str2 = "my name is bashir alam" # if statement to compare two strings if str1.lower() == str2.lower():    print("they are same") else:    print("they are not same")

Output:

they are same

Python compare strings ignore case using upper method

We also have a built-in function known as upper() which converts all the lower case characters to upper case. It works in a similar way we lower works. See the example below:

# strings str1 = "Hello" str2 = "hello" # check using upper function print(str1.upper()==str2.upper())

Output:

True

In a similar way, we can use them in if-else statements as well. See one more example below:

# strings str1 = "My name Is Bashir alam" str2 = "my name is bashir alam" # if statement to compare two strings if str1.upper() == str2.upper():    print("they are same") else:    print("they are not same")

Output:

they are same

Python compare strings ignore case using casefold function

The casefold function is another python built-in function that works similar to lower and upper methods. But unlike lower() and upper() which performs a strict string comparison by removing all the case distinctions, the casefold method is used for caseless matching. For example the german letter ß is equal to ss in english. If we apply an upper or lower method to compare this character with ss, it will return False, but the casefold will convert ß to ss and return True. See the following example.

# strings str1 = "ß" str2 = "ss" # check using upper function print(str1.casefold()==str2.casefold())

Output:

True

Python compare two strings characters by characters

Another way to compare two strings is to compare character by character. As we know strings are iterable so we can iterate over them and compare each character. There can be many ways to iterate but here we will see the simplest one. See the example below which used for loop to iterate over string and compares both:

# function to compare two strings def stringCompare(str1, str2):    # for loop to iterate    for i in range(len(str1)):        # check each characters        if str1[i]!=str2[i]:            # return false is one of the character is not equal            return False        else:            pass    # if all characters are equal, it returns true    return True print(stringCompare("my name is bashir alam", "my name is bashir alam"))

Output:

True

In the above example, the function takes two strings as an input and uses a for loop and then compares each character of strings by specifying their index number.

In a similar way we can compare character by character by ignoring the case difference. See the example which ignores the case difference.

# function to compare two strings def stringCompare(str1, str2):    # for loop to iterate    for i in range(len(str1)):        # check each characters        if str1[i].lower()!=str2[i].lower():            # return false is one of the character is not equal            return False        else:            pass    # if all characters are equal, it returns true    return True print(stringCompare("My NamE is BAshir ALam", "my name is bashir alam"))

Output:

True

We still get True because the lower() method converts all the upper case into lowercase and then compares the characters.

Python compare strings and return the difference

There might be a situation where we wanted to find out the difference between two strings, or we wanted to compare two strings but it returns False so we wanted to figured out the difference between those two. In python there can be many ways to achieve this but here we will take the most simplest one. See the following code.

# function to print the difference def compareString(str1, str2):    # creating empty variables    result1 = ''    result2 = ''    #handle the case where one string is longer than the other    maxlen=len(str2) if len(str1)<len(str2) else len(str1)    #loop through the characters    for i in range(maxlen):        #use a slice rather than index in case one string longer than other        letter1=str1[i:i+1]        letter2=str2[i:i+1]        #create string with differences        if letter1 != letter2:            # adding letters to result            result1+=letter1            result2+=letter2    # printing the difference    print ("Letters different in str1: ",result1)    print ("Letters different in str2: ",result2) # calling the function compareString("Bashir", 'bashir')

Output:

Letters different in str1:  B Letters different in str2:  b

Notice that in the given string only the B was different and this program successfully returns the different value.  Here is another way of applying the same logic. See the code below which returns the difference values.

# function to print the difference def compareString(str1, str2):    # creating empty strings    result1 = ""    result2 = ""    # looping str1 and str2    for a, b in zip(str1, str2):        # check if equal or not        if a != b:            result1+=a            result2+=b    return result1 +"  and  " +result2 # calling the function print(compareString("Bashir", 'bAshir'))

Output:

Ba  and  bA

As we mentioned earlier, there can be many more methods to achieve the same result. You can choose any of them depending on your coding skills.

Python compare strings ignore whitespaces

Now let's see the following simple example where we will try to compare two strings, one with whitespace before it and one without any white space.  See the example below:

# strings str1 = "B" str2 = " B" # compare two strings print(str1==str2)

Output:

False

We get False because whitespaces are also considered to be characters in python. Suppose we don't want our program to consider whitespaces, in python we can achieve this by a simple piece of code. Python provides us a built-in function known as strip() which helps us to achieve the above-mentioned logic and it ignores the white spaces.

# strings str1 = "B" str2 = " B" # compare two strings using strip function print(str2.strip() == str1.strip())

Output:

True

This time it returns True because the strip function ignores the whitespaces and then we compare the strings.

See one more example which contains a lot of white spaces.

# strings str1 = "This is Bashir Alam" str2 = "        This is Bashir Alam" # compare two strings using strip function if str1.strip()==str2.strip():    print("both are same") else:    print("Both are different")

Output:

both are same

Python compare strings not equal operator

So far we have compared the strings using python equality sign ( == ). However, python also supports us to compare strings based on not equality sign (!=). This will return true if both of the strings are not equal otherwise it will return False. See the example below:

# strings str1 = "Bashir" str2 = "Alam" # compare strings print(str1!=str2)

Output:

True

It returns True because both of the strings are not equal and our assumption is True.

In Python is not is also an alternative to compare two strings. It will also return True if both the strings are not the same.  See the example which uses is not instead of != operator

# strings str1 = "Bashir" str2 = "Alam" # compare strings print(str1 is not str2)

Output:

True

Python compare strings with wildcard

The asterisk (*) is used to specify any number of characters. It is typically used at the end of a root word. This is great when you want to search for variable endings of a root word. For example, searching for clean* would tell the database to look at all the possible words ending to root clean. Result might be cleaning, cleaner or cleaned depending on our list of words.

While question mark (?) is used to represent a single character anywhere in the word. It is most useful when there are variable spellings for a word, and we want to search for all variants at once. For example, searching for l?st might return list or lost depending on the words available.

Now let us take an example of both asterisk and question mark to search for different words. See the python code below:

# Regular expression library import re # list of different words WordList = ["list", "clean", "lost","cleaner"           , "worker", "working"] # for loop to iterate our list of words for word in WordList:        # The . symbol is used in place of ? symbol        if re.search('l.st', word) :            # printing the word                print (word)

Output:

list lost

Now let us take the example of asterisk and see what we get for clean*. See the example below:

# Regular expression library import re # list of different words WordList = ["list", "clean", "lost","cleaner"            , "worker", "working"] # for loop to iterate our list of words for word in WordList:        # The .+ symbol is used in place of * symbol        if re.search('clean.+', word) :            # printing the word            print (word)

Output:

cleaner

Python compare strings in a list

Python provides us various ways to compare two lists. Comparison is the process when the data items of one side are checked against another data item to see whether they are the same or not. Here we will compare the data of two sets and see if they are equal or not.

See the example below which direct equality sign to compare two lists containing strings as a data set.

# lists containing strings list1 = ["a", "b", "c", "d"] list2 = ["a", "b", "c", "d"]  # compares two lists if list1 == list2:    print("The list1 and list2 are equal") else:    print("The list1 and list2 are not equal")

Output:

The list1 and list2 are equal

Suppose we want to compare two sets and we assume that it should return True for duplicates values as well. We can achieve this by using the set() method, which will first convert our list to set and delete any duplicate elements. See the example below:

# lists containing strings list1 = ["a", "b", "c", "d", "d"] list2 = ["a", "b", "c", "d"]  # compares two lists if set(list1) == set(list2):    print("The list1 and list2 are equal") else:    print("The list1 and list2 are not equal")

Output:

The list1 and list2 are equal

Notice that list1 contains duplicates elements but still we get True for comparison because set() method removed the duplicate values.

Another method of comparing two sets of strings might be to loop over them and compare each element and if they return True if all are the same. See the python program below which compares the list of strings using for loop.

# function to compare list def compareList(str1, str2):    # for loop to iterate    for i in range(len(str1)):        # check if data is not equal        if str1[i]!=str2[i]:            # return False            return False        else:            pass    # return True    return True # calling the function print(compareList(["a", "b", "c", "d"] ,["a", "b", "c", "d"] ))

Output:

True

Python compare strings lexicographically

We use comparison operators to compare strings lexicographically in Python. We already have used equal and not equal operators to compare strings. Here is a list of comparison operators that are used to compare two strings.

  • Equal to == : check if two strings are equal
  • Not equal to !=: check if two strings are not equal
  • Greater then > : check if a string is greater than other
  • Less than < : check if a string is less than other

As we already have used the first two operators, in this section we will focus on the last two comparison operators. See the following example which compares two strings.

# strings str1 = "B" str2 = "b" # comparing print(str1 > str2) print(str2 > str1)

Output:

False True

Notice that the upper case B is smaller than the lower case b because in the ASCII  set, the numeric value of upper B is 66 while that of lower one is 98. In python all the uppercase alphabets have smaller numeric values ( starting from 65 ) and all the lower case alphabets have larger numeric values ( starting from 97).

See the example below which uses less than operator. This time it will return True for uppercase letters.

# strings str1 = "B" str2 = "b" # comparing print(str1 < str2) print(str2 < str1)

Output:

True False

Now let us compare two lists containing strings dataset. See the example below:

# strings str1 = ["bashir", "alam"] str2 = ["bashir", "Alam"] # comparing print(str1 < str2) print(str2 < str1)

Output:

False True

Summary

Strings are sequences of characters that can include numbers, letters, symbols and whitespaces. Everything in python that is written inside single, double or triple quotation marks will be considered as string.  Python strings are an important data type because they allow us to interact with text-based data in our programs. In this tutorial we learned about string comparison. We covered various ways to compare strings in python using different comparison operators and examples.

Further Reading Section

python compare strings
Python string
comparison operators