首页 > 解决方案 > sb.lmplot() 的问题

问题描述

我想用 lmplot 探索我的 dataa 之间的线性关系,但它返回错误。有人可以帮我理解我的代码有什么问题吗?我已经清理了我的数据,消除了空数据并以浮点格式转换了 str 格式的值。之后我尝试做 lmplot()。这是我找到 csv 文件的链接;关联

代码

#Analysis of obesity by country

import pandas as pd
import seaborn as sb
import matplotlib.pyplot as plt
import numpy as np 

address = 'C:/Users/Andre/Desktop/Python/firstMN/obesity-cleaned.csv'
dt = pd.read_csv(address)

#eliminate superfluos data
dt.drop(dt['Obesity (%)'][dt['Obesity (%)'].values == 'No data'].index, inplace=True)  

for i in range(len(dt)):
   dt['Obesity (%)'].values[i] = float(dt['Obesity (%)'].values[i].split()[0]) 

#print(dt['Country'], '\n') 
#print(dt['Obesity (%)'])

sb.lmplot('Country', 'Obesity (%)', dt) 
plt.show(

头部数据集

   Unnamed: 0      Country  Year Obesity (%)         Sex
0           0  Afghanistan  1975         0.5  Both sexes
1           1  Afghanistan  1975         0.2        Male
2           2  Afghanistan  1975         0.8      Female
3           3  Afghanistan  1976         0.5  Both sexes
4           4  Afghanistan  1976         0.2        Male
5           5  Afghanistan  1976         0.8      Female
6           6  Afghanistan  1977         0.6  Both sexes
7           7  Afghanistan  1977         0.2        Male
8           8  Afghanistan  1977         0.9      Female
9           9  Afghanistan  1978         0.6  Both sexes

输出

Traceback (most recent call last):
  File "C:\Users\Andre\Desktop\Python\firstMN\obesity.py", line 23, in <module>
    sb.lmplot('Country', 'Obesity (%)', dt)
  File "C:\Users\Andre\AppData\Local\Programs\Python\Python38-32\lib\site-packages\seaborn\regression.py", line 616, in lmplot
    facets.map_dataframe(regplot, x, y, **regplot_kws)
  File "C:\Users\Andre\AppData\Local\Programs\Python\Python38-32\lib\site-packages\seaborn\axisgrid.py", line 828, in map_dataframe
    self._facet_plot(func, ax, args, kwargs)
  File "C:\Users\Andre\AppData\Local\Programs\Python\Python38-32\lib\site-packages\seaborn\axisgrid.py", line 846, in _facet_plot
    func(*plot_args, **plot_kwargs)
  File "C:\Users\Andre\AppData\Local\Programs\Python\Python38-32\lib\site-packages\seaborn\regression.py", line 817, in regplot
    plotter.plot(ax, scatter_kws, line_kws)
  File "C:\Users\Andre\AppData\Local\Programs\Python\Python38-32\lib\site-packages\seaborn\regression.py", line 369, in plot
    self.lineplot(ax, line_kws)
  File "C:\Users\Andre\AppData\Local\Programs\Python\Python38-32\lib\site-packages\seaborn\regression.py", line 412, in lineplot
    grid, yhat, err_bands = self.fit_regression(ax)
  File "C:\Users\Andre\AppData\Local\Programs\Python\Python38-32\lib\site-packages\seaborn\regression.py", line 200, in fit_regression
    grid = np.linspace(x_min, x_max, 100)
  File "<__array_function__ internals>", line 5, in linspace
  File "C:\Users\Andre\AppData\Local\Programs\Python\Python38-32\lib\site-packages\numpy\core\function_base.py", line 129, in linspace
    start = asanyarray(start) * 1.0
numpy.core._exceptions.UFuncTypeError: ufunc 'multiply' did not contain a loop with signature matching types (dtype('<U32'), dtype('<U32')) -> dtype('<U32')

标签: pythonpandasmachine-learningseaborndata-science

解决方案


问题是lmplot您需要两个数字,您不能将“国家”作为 x 值。你需要在这里使用另一个情节。

如果你这样做(虽然它没有意义),它的工作原理

sb.lmplot(x='Year', y='Obesity (%)', data=dt) 
plt.show()

推荐阅读