Forum in maintenance, we will back soon 🙂

Notifications

Clear all

Web scrapping from Indeed.com

Python Scripting

Last Post by Hasan Aboul Hasan 1 year ago

2 Posts

2 Users

0 Reactions

527 Views

RSS

sIVARAM bandaru

(@sivaram-bandaru)

Posts: 43

Trusted Member Customer

Topic starter

Step 1: I did a job search on Indeed.com and used the link as below in puthin

from bs4 import BeautifulSoup
import requests
url="https://www.indeed.com/jobs?q=IT+director&l=Remote&from=searchOnHP&vjk=84953521ad7c4774"
req=requests.get(url)


soup=BeautifulSoup(req.text,"html.parser")


# Find all the job posts
job_posts = soup.find_all('meta', name="description")
# Print the title of each job post
for job_post in job_posts:
    title = job_post.a.text
    print(title)

Step 2: Verified the html code has the description as below.

<meta http-equiv="content-type" content="text/html; charset=utf-8">

	<meta name="description" content="1,349 IT Director jobs available in Remote on Indeed.com. Apply to Director of Information Technology, Director of Partnerships, Director of Analytics and more!">

	<meta name="referrer" content="origin-when-cross-origin">

Step 3: Running the python code in step 1 is giving lots of errors as below

line 507, in send
raise ConnectTimeout(e, request=request)

What is that I am doing wrong?

Posted : 09/08/2023 4:29 pm

Topic Tags

web scrapping

Hasan Aboul Hasan

(@admin)

Posts: 1253

Member Admin

Indeed.com, like many other websites, has terms of service that prohibit scraping.

Posted : 09/09/2023 7:09 am

Forum Icons: Forum contains no unread posts Forum contains unread posts

Topic Icons: Not Replied Replied Active Hot Sticky Unapproved Solved Private Closed

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Forum in maintenance, we will back soon 🙂

Web scrapping from Indeed.com

© 2024 LearnWithHasan | All Rights Reserved.