分隔字符串

Question

给出一个字符串，我想生成所有可能的组合。换句话说，所有可能的方式都是在字符串中放置逗号。

例如：

input:  ["abcd"]
output: ["abcd"]
        ["abc","d"]
        ["ab","cd"]
        ["ab","c","d"]
        ["a","bc","d"]
        ["a","b","cd"]
        ["a","bcd"]
        ["a","b","c","d"]

我对如何生成所有可能的列表有些困惑。组合只会给我列出一组字符串的子集长度，排列会提供所有可能的排序方式。

由于遍历切片，我只能用列表中的一个逗号来处理所有情况，但是我不能用“ ab”，“ c”，“ d”和“ a”之类的两个逗号来处理情况。” b“，” cd“

我的尝试（切片）：

test="abcd"

for x in range(len(test)):
     print test[:x],test[x:]

Answer 1

怎么样：

from itertools import combinations

def all_splits(s):
    for numsplits in range(len(s)):
        for c in combinations(range(1,len(s)), numsplits):
            split = [s[i:j] for i,j in zip((0,)+c, c+(None,))]
            yield split

此后：

>>> for x in all_splits("abcd"):
...     print(x)
...     
['abcd']
['a', 'bcd']
['ab', 'cd']
['abc', 'd']
['a', 'b', 'cd']
['a', 'bc', 'd']
['ab', 'c', 'd']
['a', 'b', 'c', 'd']

Answer 2

您当然可以使用itertools，但我认为直接编写递归生成器会更容易：

def gen_commas(s):
    yield s
    for prefix_len in range(1, len(s)):
        prefix = s[:prefix_len]
        for tail in gen_commas(s[prefix_len:]):
            yield prefix + "," + tail

然后

print list(gen_commas("abcd"))

打印

['abcd', 'a,bcd', 'a,b,cd', 'a,b,c,d', 'a,bc,d', 'ab,cd', 'ab,c,d', 'abc,d']

我不确定为什么我发现这更容易。也许只是因为直接进行操作非常简单；-）

Answer 3

您可以生成可以放置逗号的n-1个位置的幂集：

what's a good way to combinate through a set?

然后在每个位置插入逗号。

Answer 4

使用itertools：

import itertools
input_str =  "abcd"
for k in range(1,len(input_str)):
    for subset in itertools.combinations(range(1,len(input_str)), k): 
        s = list(input_str)
        for i,x in enumerate(subset): s.insert(x+i, ",")
        print "".join(s)

Gives：

a,bcd
ab,cd
abc,d
a,b,cd
a,bc,d
ab,c,d
a,b,c,d

也是递归版本：

def commatoze(s,p=1):
    if p == len(s):
        print s
        return
    commatoze(s[:p] + ',' + s[p:], p + 2)
    commatoze(s, p + 1)

input_str =  "abcd"
commatoze(input_str)

Answer 5

您可以求解integer composition problem，并使用合成来指导在哪里拆分列表。只需一点点动态编程就可以很容易地解决整数组成。

def composition(n):
    if n == 1: 
        return [[1]] 
    comp = composition (n - 1) 
    return [x + [1] for x in comp] + [y[:-1] + [y[-1]+1] for y in comp]

def split(lst, guide):
    ret = []
    total = 0
    for g in guide:
        ret.append(lst[total:total+g])
        total += g
    return ret

lst = list('abcd')
for guide in composition(len(lst)):
    print split(lst, guide)

生成整数成分的另一种方法：

from itertools import groupby
def composition(n):
    for i in xrange(2**(n-1)):
        yield [len(list(group)) for _, group in groupby('{0:0{1}b}'.format(i, n))]

Answer 6

给出

import more_itertools as mit

代码

list(mit.partitions("abcd"))

输出

[[['a', 'b', 'c', 'd']],
 [['a'], ['b', 'c', 'd']],
 [['a', 'b'], ['c', 'd']],
 [['a', 'b', 'c'], ['d']],
 [['a'], ['b'], ['c', 'd']],
 [['a'], ['b', 'c'], ['d']],
 [['a', 'b'], ['c'], ['d']],
 [['a'], ['b'], ['c'], ['d']]]

通过more_itertools安装more_itertools。

分隔字符串

问题描述投票：23回答：6

6个回答

最新问题

分隔字符串

问题描述 投票：23回答：6

6个回答

最新问题

问题描述投票：23回答：6