How do you extract a substring within column that contains people's Name & title that are in "Myles, Mr. Thomas Francis" format and only want "Mr."

Question

How do you extract a substring within column that contains people's Name & title that are in "Myles, Mr. Thomas Francis" format and only want "Mr."

86 views Asked by Noumenax At 17 August 2024 at 16:29

enter image description here enter image description hereWant to add matched results as new column of dataframe within a python function

I tried using re.search() expression

for i in input_df["Name"]:
   Title[i] = re.search(".$",i)

I get Type_error and not sure how to write pattern to get desired result

Original Q&A

There are 2 answers

hught On 11 May 2023 at 14:27

re.search returns a "match object" if a match is found, or None if no match is found. So you may want to do something like:

for i in input_df["Name"]:
    x = re.search("Mr[s]*\.",i)
    if x:
        Title[i] = (x.group())

Your code sets the value of Title[i] to a match object instead of a string, which is probably where the type error is coming from. Use the .group() method to return just the matching part of the string. Use the if statement to handle cases where no match was found.

As for the regex, I don't know what your data looks like exactly but you could try matching one more more capital letters, followed by zero or more lower case letters, followed by a period. Eg re.search("[A-Z]+[a-z]*\.",i)

Be warned regex can trip up over edge cases, so check carefully.

**Tim Biegeleisen** · Accepted Answer · 2023-05-11 13:44:13

You could use str.extract here with the regex pattern \b[A-Z][a-z]+\.:

input_df["Title"] = input_df["Name"].str.extract(r'\b([A-Z][a-z]+\.)')

For a more sophisticated option, you could also use str.replace:

input_df["Title"] = input_df["Name"].str.replace(r'^.*,\s+|\s+.*$', '', regex=True)

TechQA.

How do you extract a substring within column that contains people's Name & title that are in "Myles, Mr. Thomas Francis" format and only want "Mr."

There are 2 answers

Related Questions in PYTHON

Related Questions in STRING

Related Questions in DATAFRAME

Related Questions in SUBSTRING

Related Questions in EXTRACT

Popular Questions

Popular Tags

Trending Questions