当我尝试直接从 github 读取 csv 文件时,我得到了
ParserError
:
import pandas as pd
url = 'https://github.com/marcopeix/AppliedTimeSeriesAnalysisWithPython/tree/main/data/jj.csv'
df = pd.read_csv(url)
ParserError: Error tokenizing data. C error: Expected 1 fields in line 41, saw 29
但是如果我下载文件并从磁盘读取它,它就可以正常工作:
df = pd.read_csv('/home/data/jj.csv')
您的页面未链接到原始 CSV 文件,而是链接到 HTML 页面
您应该使用原始 URL:
url = 'https://raw.githubusercontent.com/marcopeix/AppliedTimeSeriesAnalysisWithPython/refs/heads/main/data/jj.csv'
这不是获取实际数据的 URL,而是 Github 的链接。
您应该使用RAW变体:
https://raw.githubusercontent.com/marcopeix/AppliedTimeSeriesAnalysisWithPython/refs/heads/main/data/jj.csv
import pandas as pd
url = 'https://raw.githubusercontent.com/marcopeix/AppliedTimeSeriesAnalysisWithPython/refs/heads/main/data/jj.csv'
df = pd.read_csv(url)