如何使用正则python中的关键字列表提取单词？我正在尝试使用Python中的正则拨号提取位置。现在我正在这样做： def get_location（s）： s = s.Strip（strip_chars）关键字=“ at |外部|接近” location_pattern =“（？p

Question

它仅给出“ AT” AS输出，但也应该提供建筑物。

captures = match.capturesdict()

我无法用来提取其他示例的捕获。

当我这样做时。它似乎在起作用。有人可以解释我做错了什么吗？

```
这里的主要问题是您需要放置
```
location_pattern = 'at|outside\s\w+
{keywords}
```
。这是一个示意图示例：
```
(?:{keywords})

a|b|c\s+\w+

Answer 1

a

+++++（a | b | c）\ s+s+\ w+w+w+w+

a

b

<whitespace(s)>

c`，然后它尝试匹配whitespaces，然后匹配字chars.

请参阅更新的代码（在线

demo）：

. When you put the alternation list into a group,

输出：

, it matches either

注意，

, or

由于

or

到处都不匹配，因此不起作用，并且必须遵循空格和字符。您可以以相同的方式修复它：

import regex as re
def get_location(s):
    STRIP_CHARS = '*'
    s = s.strip(STRIP_CHARS)
    keywords = "at|outside|near"
    location_pattern = "(?P<location>((?P<place>(?:{keywords})\s+[A-Za-z]+)))".format(keywords = keywords)
    location_regex = re.compile(location_pattern, re.IGNORECASE | re.UNICODE)

    for match in location_regex.finditer(s):
        match_str = match.group(0)
        indices = match.span(0)
        print ("Match", match)
        match_str = match.group(0)
        indices = match.span(0)
        print (match_str)
        captures = match.capturesdict()
        print(captures)

get_location("Im at building 3")

.

如果将关键字放入组中，则('Match', <regex.Match object; span=(3, 14), match='at building'>) at building {'place': ['at building'], 'location': ['at building']}

将效果很好（请参见上面的输出）。

location_pattern = 'at|outside\s\w+

如何使用正则python中的关键字列表提取单词？我正在尝试使用Python中的正则拨号提取位置。现在我正在这样做： def get_location（s）： s = s.Strip（strip_chars）关键字=“ at |外部|接近” location_pattern =“（？p

问题描述投票：0回答：1

1个回答

最新问题

如何使用正则python中的关键字列表提取单词？ 我正在尝试使用Python中的正则拨号提取位置。 现在我正在这样做： def get_location（s）： s = s.Strip（strip_chars） 关键字=“ at |外部|接近” location_pattern =“（？p

问题描述 投票：0回答：1

1个回答

最新问题

如何使用正则python中的关键字列表提取单词？我正在尝试使用Python中的正则拨号提取位置。现在我正在这样做： def get_location（s）： s = s.Strip（strip_chars）关键字=“ at |外部|接近” location_pattern =“（？p

问题描述投票：0回答：1