Channel: Active questions tagged python - Stack Overflow

↧

SSL Certificate Verification Error When Scraping Website and Inserting Data into MongoDB

March 30, 2024, 5:11 pm

≫ Next: How to remove integers from a mixed numpy array containing sub-arrays and integers?

≪ Previous: How should I troubleshoot "RuntimeWarning: invalid value encountered in arccos" in NumPy?

Problem Description:

I'm attempting to scrape the website at https://www.cbit.ac.in/current_students/acedamic-calendar/ using the requests library along with BeautifulSoup. However, upon making a request to the website, I encounter the following SSL certificate verification error:

requests.exceptions.SSLError:  HTTPSConnectionPool(host='www.cbit.ac.in', port=443):    Max retries exceeded with url:      /current_students/acedamic-calendar/      (Caused by SSLError(SSLCertVerificationError(1,'[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1123)')))

Approach:

To address the SSL verification issue, I've attempted to specify the path to the CA certificate using the verify parameter in the requests.get() function call. The CA certificate path is /Users/rishilboddula/Downloads/cbit.ac.in.cer. Despite this, the SSL verification error persists.

After successfully scraping the website, I intend to store the extracted URLs in a MongoDB collection named ull using the pymongo library. However, due to the SSL verification error, I'm unable to proceed with the scraping and data insertion process.

Request for Assistance:

I'm seeking guidance on resolving the SSL certificate verification error to successfully scrape the website and insert the data into MongoDB. Additionally, if there are any best practices or alternative approaches for handling SSL certificate verification in Python, I would greatly appreciate any insights.

# Import necessary librariesimport requestsfrom bs4 import BeautifulSoupimport pymongo# Specify the path to the CA certificateca_cert_path = '/Users/rishilboddula/Downloads/cbit.ac.in.cer'# Make a request to the website with SSL verificationreq = requests.get('https://www.cbit.ac.in/current_students/acedamic-calendar/', verify=ca_cert_path)# Parse the HTML contentsoup = BeautifulSoup(req.content, 'html.parser')# Extract all URLs from the webpagelinks = soup.find_all('a')urls = [link.get('href') for link in links]# Connect to MongoDBclient = pymongo.MongoClient('mongodb://localhost:27017')db = client["data"]ull = db["ull"]# Insert each URL into the MongoDB collectionfor url in urls:    ull.insert_one({"url": url})

↧

Trending Articles

Bath man appears in court charged with attempted murder of a man...

March 16, 2015, 7:37 am

MACLEAN, Allan

July 30, 2019, 6:00 am

Black Angus Grilled Artichokes

July 16, 2016, 4:37 pm

Practice Sheet of Right form of verbs for HSC Students

September 22, 2019, 11:40 pm

Police blotter for Jan. 12

January 12, 2018, 3:30 am

99 God Status for Whatsapp, Facebook

June 5, 2016, 11:46 pm

Rajasthan Board 12th Science Result 2018 name wise- RBSE 12th commerce result...

May 26, 2018, 9:35 pm

Notorious Naushad of Ippa gang nabbed

July 19, 2019, 6:37 am

Child Kidnapping: Amy McNeil was kidnapped on her way to school by 5 adults;...

February 5, 2017, 10:40 am

Sonible Smartlimit v1.1.5-R2R

April 16, 2024, 7:10 am

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

December 22, 2016, 3:50 am

मतलबी दोस्त स्टेट्स | Matlabi Dost Status in Hindi – Selfish Friends Status

February 13, 2020, 3:12 am

Arrow Flash 2 – Sinhala Dubbed – Episode 23 – 20th March 2016

March 20, 2016, 9:39 am

[GET] AI Traffic Goldmine

July 6, 2025, 4:23 am

[E² Plugin] HDF-Radio

January 26, 2025, 9:02 am

Universal Multi-Patch v1.3 By RADIXX11

January 29, 2018, 2:45 pm

IWAN – Thanks and Praise ( Throw Back Thursday )

March 9, 2016, 11:43 pm

RONALD P SONDERGAARD Arrested by Miami-Dade County Corrections on Mar 03, 2017

March 3, 2017, 6:25 am

मुख मैथुन से उठाएं सेक्स का भरपूर मज़ा, जानें क्या है इसका सही तरीकामुख मैथुन...

May 17, 2020, 2:04 pm

HSSC Excise & Taxation Inspector Result 2017 Scorecard/ Category Wise Merit List

July 29, 2017, 2:44 am

© 2025 //www.rssing.com