如何使用Python从WSJ中提取每日收盘价？

2024-05-14 • 问答

我使用python 3和pandas将WSJ的每日收盘价解析为EXCEL。但是，无法提取网页屏幕上显示的每日关闭时间。这里是链接：“ https://quotes.wsj.com/index/COMP/historical-prices” 如何将屏幕上的关闭数据下载到Excel中？以及如何使用其他名称（例如comp.xlxs）将“下载电子表格”按钮文件下载到excel中？

Here are the codes:

import requests
import pandas as pd

url = 'https://quotes.wsj.com/index/COMP/historical-prices'

jsonData = requests.get(url).json()

final_df = pd.DataFrame()
for row in jsonData['data']:
    #row = jsonData['data'][1]

    data_row = []
    for idx,colspan in enumerate(row['colspan']):
        colspan_int = int(colspan[0])
        data_row.append(row['td'][idx] * colspan_int)
        flat_list = [item for sublist in data_row for item in sublist]
    temp_row = pd.DataFrame([flat_list])
    final_df = final_df.append(temp_row,sort=True).reset_index(drop=True)


wait2 = input("PRESS ENTER TO CONTINUE.")

关注问题引号：

#
url = 'https://quotes.wsj.com/index/HK/XHKG/HSI/historical-prices/download?num_rows=15&range_days=15&endDate=12/06/2019'
response = requests.get(url)
open('HSI.csv','wb').write(response.content)
read_file = pd.read_csv (r'C:\A-CEO\REPORTS\STOCKS\PROFILE\Python\HSI.csv')
read_file.to_excel (r'C:\A-CEO\REPORTS\STOCKS\PROFILE\Python\HSI.xlsx',index = None,header=True)

#
url = 'https://quotes.wsj.com/index/SPX/historical-prices/download?num_rows=15&range_days=15&endDate=12/06/2019'
response = requests.get(url)
open('SPX.csv','wb').write(response.content)
read_file = pd.read_csv (r'C:\A-CEO\REPORTS\STOCKS\PROFILE\Python\SPX.csv')
read_file.to_excel (r'C:\A-CEO\REPORTS\STOCKS\PROFILE\Python\SPX.xlsx',header=True)


#
url = 'https://quotes.wsj.com/index/COMP/historical-prices/download?num_rows=15&range_days=15&endDate=12/06/2019'
response = requests.get(url)
open('COMP.csv','wb').write(response.content)
read_file = pd.read_csv (r'C:\A-CEO\REPORTS\STOCKS\PROFILE\Python\COMP.csv')
read_file.to_excel (r'C:\A-CEO\REPORTS\STOCKS\PROFILE\Python\COMP.xlsx',header=True)

import requests import pandas as pd import io #original URL had a bunch of other parameters I omitted,only these seem to matter but YMMV url = 'https://quotes.wsj.com/index/COMP/historical-prices/download?num_rows=360&range_days=360&endDate=11/06/2019' response = requests.get(url) #do this if you want the CSV written to your machine open('test_file.csv','wb').write(response.content) # this decodes the content of the downloaded response and presents it to pandas df_test = pd.read_csv(io.StringIO(response.content.decode('utf-8')))

base_url = 'https://quotes.wsj.com/index/{ticker_name}/historical-prices/download?num_rows=360&range_days=360&endDate=11/06/2019' ticker_list = ['COMP','SPX','HK/XHKG/HSI'] for ticker in ticker_list: response = requests.get(base_url.format(ticker_name = ticker)) #do this if you want the CSV written to your machine open('prices_'+ticker.replace('/','-')+'.csv','wb').write(response.content)

如何使用Python从WSJ中提取每日收盘价？

nihao10 回答：如何使用Python从WSJ中提取每日收盘价？

大家都在问