Welcome to ShenZhenJia Knowledge Sharing Community for programmer and developer-Open, Learning and Share
menu search
person
Welcome To Ask or Share your Answers For Others

Categories

I am trying to separate the time from AM in a string. The code

content= "11:20pm"
content = re.findall(r"[^Wd_]+|d+", content)
print(content)

I expect ['11:20','pm'] but with my code I get ['11','20','pm'] What should I do?

question from:https://stackoverflow.com/questions/66051771/what-should-be-the-regex-expresesion

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
thumb_up_alt 0 like thumb_down_alt 0 dislike
125 views
Welcome To Ask or Share your Answers For Others

1 Answer

You may match the time by adding (?::d+)? pattern to the d+ alternative:

content= "11:20pm"
content = re.findall(r"[^Wd_]+|d+(?::d+)?", content)
print(content) # => ['11:20', 'pm']

See the Python demo and the regex demo.

Note you might want to extend the pattern to also match float values, and if yes, you would need to use r"[^Wd_]+|d+(?:[:.]d+)?".

Details:

  • [^Wd_]+ - one or more letters
  • | - or
  • d+ - one or more digits
  • (?::d+)? - an optional sequence of a `:~ and one or more digits.

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
thumb_up_alt 0 like thumb_down_alt 0 dislike
Welcome to ShenZhenJia Knowledge Sharing Community for programmer and developer-Open, Learning and Share
...