首页猿问为什么我收到...

为什么我收到 RuntimeError：生成器引发了 StopIteration？以及如何解决？

Python

心有法竹 2022-01-18 21:35:20

我正在制作存储在列表 docToken 中的令牌的 Bigrams。print(docToken[520])输出：['sleepy', 'account', 'just', 'man', 'tired', 'twitter', 'case', 'romney', 'candidate', 'looks']list(nltk.bigrams(docToken[520]))输出：[('sleepy', 'account'), ('account', 'just'), ('just', 'man'), ('man', 'tired'), ('tired', 'twitter '), ('twitter', 'case'), ('case', 'romney'), ('romney', 'candidate'), ('candidate', 'looks')]当我nltk.bigrams(docToken[i])在循环中使用时，我在范围> = 1000上遇到以下错误：bigram=[]for i in range(5000): ls=list(nltk.bigrams(docToken[i])) for j in ls: bigram.append(list(j))当第一个循环中的 range(500) 时它工作得很好，但是当 Range 为 1000 或更大时，它给了我以下错误：StopIteration Traceback (most recent call last) ~\Anaconda3\lib\site-packages\nltk\util.py in ngrams(sequence, n, pad_left, pad_right, left_pad_symbol, right_pad_symbol) 467 while n > 1: --> 468 history.append(next(sequence)) 469 n -= 1StopIteration: The above exception was the direct cause of the following exception:RuntimeError Traceback (most recent call last)<ipython-input-76-8982951528bd> in <module>() 1 bigram=[] 2 for i in range(5000):----> 3 ls=list(nltk.bigrams(docToken[i])) 4 for j in ls: 5 bigram.append(list(j))~\Anaconda3\lib\site-packages\nltk\util.py in bigrams(sequence, **kwargs) 489 """ 490 --> 491 for item in ngrams(sequence, 2, **kwargs): 492 yield item 493 RuntimeError: generator raised StopIteration

查看完整描述

3 回答

三国纷争

TA贡献1804条经验获得超7个赞

我也面临同样的错误。一个可能的原因可能是其中一个元素docToken是一个空列表。

例如，当i=2第二个元素是空列表时，以下代码会引发相同的错误。

from nltk import bigrams

docToken= [['the', 'wildlings', 'are', 'dead'], [], ['do', 'the', 'dead', 'frighten', 'you', 'ser', 'waymar']]

for i in range(3):

print (i)

print (list(nltk.bigrams(docToken[i])))

输出：

[('the', 'wildlings'), ('wildlings', 'are'), ('are', 'dead')]

---------------------------------------------------------------------------

StopIteration Traceback (most recent call last)

~\AppData\Local\Continuum\anaconda3\lib\site-packages\nltk\util.py in ngrams(sequence, n, pad_left, pad_right, left_pad_symbol, right_pad_symbol)

467 while n > 1:

--> 468 history.append(next(sequence))

469 n -= 1

StopIteration:

The above exception was the direct cause of the following exception:

RuntimeError Traceback (most recent call last)

<ipython-input-58-91f35cae32ed> in <module>

2 for i in range(3):

3 print (i)

----> 4 list(nltk.bigrams(docToken[i]))

~\AppData\Local\Continuum\anaconda3\lib\site-packages\nltk\util.py in bigrams(sequence, **kwargs)

489 """

490

--> 491 for item in ngrams(sequence, 2, **kwargs):

492 yield item

493

RuntimeError: generator raised StopIteration

您可以从中过滤出空列表docToken，然后创建二元组：

docToken= [['the', 'wildlings', 'are', 'dead'], [], ['do', 'the', 'dead', 'frighten', 'you', 'ser', 'waymar']]

docToken = [x for x in docToken if x]

bigram = []

for i in range(len(docToken)):

bigram.append(["_".join(w) for w in bigrams(docToken[i])])

bigram

输出：

[['the_wildlings', 'wildlings_are', 'are_dead'],

['do_the',

'the_dead',

'dead_frighten',

'frighten_you',

'you_ser',

'ser_waymar']]

另一个可能的原因可能是您nltk在 python 3.7 中使用了 3.3。

请使用 nltk 3.4，它是第一个支持 Python 3.7 的版本，您的问题应该在这个版本中得到解决。

请看这里。

反对回复 2022-01-18

红糖糍粑

TA贡献1815条经验获得超6个赞

我通过从 3.3 -> 3.4 升级 nltk 来解决这个问题

做简单：

pip install nltk==3.4

反对回复 2022-01-18

ITMISS

TA贡献1871条经验获得超8个赞

我无法解决此错误。不知道为什么nltk.bigrams(docToken[i])会生成这个，但我能够使用以下代码创建二元组。

bigram={}

for i in range(size):

ls=[]

for j in range(len(docToken[i])-1):

for k in range(j,len(docToken[i])-1):

ls.append([docToken[i][j],docToken[i][k+1]])

bigram[i]=ls

反对回复 2022-01-18

3 回答
0 关注
455 浏览

关注

添加回答

0/150

提交

取消

热搜

最近搜索清空

为什么我收到 RuntimeError：生成器引发了 StopIteration？以及如何解决？

为什么我收到 RuntimeError：生成器引发了 StopIteration？以及如何解决？

3 回答

添加回答