首页 > 解决方案 > 使用 Python 从 Student Doctor Network (SDN) 收集面试问题

问题描述

我是一个 python 新手,我正在开发一个从学生医生网络(一个流行的医学预科学生论坛)上抓取医学院面试问题。我不确定创建一个随机打印来自任何特定学校的面试问题的程序的最简单方法是什么。

带有问题的示例页面:https ://www.studentdoctor.net/schools/school/emory/survey/26/emory-university-school-of-medicine/1

我不确定 BeautifulSoup 是否是正确的选择,但希望能提供任何帮助或指导,以找到为我的程序提取面试问题的最佳方法。谢谢!

标签: pythonweb-scraping

解决方案


您可以使用此示例如何从页面中提取问题:

import requests
from bs4 import BeautifulSoup


url = "https://www.studentdoctor.net/schools/school/emory/survey/26/emory-university-school-of-medicine/1"

soup = BeautifulSoup(requests.get(url).content, "html.parser")
for question in soup.select("h3"):
    q = question.get_text(strip=True).replace("\n", " ")
    if q.endswith("?"):
        print(q)

印刷:

How to tell strength of clinical years?
How did the interview impress you?
What was the stress level of the interview?
How you think you did?
How do you rank this school among ALL other schools?
How long was the interview?
Where did the interview take place?
What was the style of the interview?
What type of interview was it?
How many people interviewed you?
What is one of the specific questions they asked you (question 1)?
What is one of the specific questions they asked you (question 2)?
What is one of the specific questions they asked you (question 3)?
What was the most interesting question?
What was the most difficult question?
How did you prepare for the interview?
What impressed you positively?
What impressed you negatively?
What did you wish you had known ahead of time?
What are your general comments?
Who was the tour given by?
How did the tourguide seem?
How do you rank the facilities?
What is your in-state status?
What was your total time spent traveling?
What was your primary mode of travel?
About how much did you spend on room, food, and travel?
What airport did you use?
Where did you stay?
What is the name of the hotel you stayed in?
Would you recommend the hotel?
How do you rank this school among other schools to which you've applied?
What is your ranking of this school's location?
What is your ranking of this area's cultural life?
How is the responsiveness of the admissions office?
How is the friendliness of the admissions office?
How eco-friendly are the school's application materials and interview process?
What are your suggestions for the admissions office?

推荐阅读