首页 > 解决方案 > 为词频代码编码时出现 Web 抓取错误

问题描述

我正在做一个网络抓取项目,我是 python 新手,请在此代码的基础上帮助我。

 word_frequency = {}
        for token in k:
                if token not in word_frequency.keys():
                    word_frequency[token] = 
                else:
                    word_frequency[token] += 
        max_frequency = max(word_frequency.values)
        max_frequency
        word_highest = nlargest(100, word_frequency,key = word_frequency.get)
        word_highest
    #plotting line graph
    keys = word_frequency.keys()
    values = word_frequency.values()
    
    keys = list(keys)[:21] #first 20 most frequent words
    values = list(values)[:21]# frequency of first 20 most frequent words
    
    plt.plot(keys, values)
    plt.xticks(rotation = 80)
    plt.xlabel('Words')
    plt.ylabel('Frequency')
    plt.title('Word Frequency')
    plt.show()

我应该怎么办?我的语法错误是:

     NameError                                 Traceback (most recent call last)
        <ipython-input-2-1783d4bb5793> in <module>()
              2 #determining up the frequency
              3 word_frequency = {}
        ----> 4 for token in k:
              5         if token not in word_frequency.keys():
              6             word_frequency[token] = 1# if word occured only once put value as 1
        
        NameError: name 'k' is not defined
        TypeError                                 Traceback (most recent call last)
        <ipython-input-5-293f321e384b> in <module>()
              1 # finding the maximum frequency
        ----> 2 max_frequency = max(word_frequency.values)
              3 max_frequency
        
        TypeError: 'builtin_function_or_method' object is not iterable
        NameError                                 Traceback (most recent call last)
        <ipython-input-4-29ca2b9912cd> in <module>()
              1 # Accesing first 100 words with largest frequency
        ----> 2 word_highest = nlargest(100, word_frequency,key = word_frequency.get)
              3 word_highest
        
        NameError: name 'nlargest' is not defined
        NameError                                 Traceback (most recent call last)
        <ipython-input-6-06cf5f0bf91a> in <module>()
              6 values = list(values)[:21]# frequency of first 20 most frequent words
              7 
        ----> 8 plt.plot(keys, values)
              9 plt.xticks(rotation = 80)
             10 plt.xlabel('Words')
        
        NameError: name 'plt' is not defined

请帮帮我,我该怎么办?

标签: python

解决方案


推荐阅读