Questions tagged [beautifulsoup]

0 votes
1 replies
Parser BS4 write to file
I'm study python and have a task. I have to write the scraping results to file, but there are some errors. Each string in the result file as "No...
asked 4 months ago
-1 votes
2 replies
Problem with scraping data from website with BeautifulSoup
I am trying to take a movie rating from the website Letterboxd. I have used code like this on other websites and it has worked, but it is not get...
0 votes
1 replies
Get same tags with different content from xml with beautifulsoup
I've got this xml: <dc:type>image fixe</dc:type> <dc:type>image</dc:type> <dc:type>still image</dc:type> <...
asked 4 months ago
2 votes
2 replies
How to scrape multiple webpages without overwriting the results?
New to scraping and trying to scrape multiple webpages from Transfermarkt without overwriting the previous one. Know that this question has been...
1 votes
1 replies
Not able to scrape specific information off BSE website using bs4
I'm trying to scrape the previous close and open stock price from this website. Here's an image as a reference to where the information to scrape...
asked 4 months ago
0 votes
1 replies
Can't figure out how to get BS4 to retrieve youtube view count on /videos page
I am trying to navigate to a page using BS4, lets use /history for an example. I want to gather the viewcount for all the videos that are current...
0 votes
1 replies
How can I scrape data from an HTML table into a Python list/dict?
I'm trying to import data from Baseball Prospectus into a Python table / dictionary (which would be better?). Below is what I have, based on fo...
1 votes
1 replies
What exactly is a BS4 'element', how are elements counted, which parser gets to decide? Obviously confused
I am now confused by something I thought I understood, but turns out I've been taking for granted. One frequently encounters this type of for l...
1 votes
1 replies
I can't make a loop over multiples pages in Beautifulsoup
I'm trying to scrape data estate in this link: https://www.pap.fr/annonce/ventes-maisons- I was able to scrape the data but only from the first...
asked 4 months ago
1 votes
3 replies
I can't display html code - Beautifulsoup
(I'm a beginner in the web scraping) I want to scrap this link: https://www.seloger.com/list.htm?tri=initial&idtypebien=1,2&pxMax=300000...
asked 4 months ago
2 votes
1 replies
Extracting particular text section between tags from HTML
I would like to extract text in a specific section from HTML file (section "Item 1A"). I want to get text start from "Item 1A", in the content se...
asked 4 months ago
0 votes
3 replies
I need to display the third <li> in BeautifulSoup
I am trying to scrape a page Web, the problem that I can't scrape the third item, I managed to display the first item with this code : repo = so...
asked 4 months ago
2 votes
3 replies
I want to get the image link inside a RSS feed description tag
I want to get the image link inside a RSS feed description tag. Using feedparser got the values in the discription tag.But i want to get the ima...
4 votes
2 replies
Link attribute not getting printed in BeautifulSoup object
I am coding a program that will get pull the top news headlines from google news. It is supposed to be printing the headline and the link for the...
asked 4 months ago
-4 votes
1 replies
How can I write all the answer in the function zip or in Json?
I have a response from the site. I want to write it in Json or Zip. But I do not know how to do it. What should I write ? import requests from b...
asked 4 months ago
1 votes
1 replies
How to post a form to aspx site using python
I am trying to do a search query using python on this site https://www.ahpra.gov.au/Registration/Registers-of-Practitioners.aspx?m=Search but am...
0 votes
1 replies
How to convert DataFrame from HTML to SQL using Pandas and use it for a Search field in Flask?
Having gather all results from scrapping a website using BeautifulSoup and have generated an HTML file with all of the lists that were fulfilled...
asked 4 months ago
2 votes
2 replies
How to define the “source.find” part of BeautifulSoup
I need to scrape a list of restaurant links from a food delivery website to afterwards scrape their menus. This is the site i wanna scrape: https...
2 votes
2 replies
How to scrape a website table where the cell values have the same class name?
I am trying to scrape a (football squad) table from Transfermarkt.com for a project but some columns have the same class name and cannot be diffe...
2 votes
5 replies
Scraping hidden text of hotels reviews
I am scraping hotels customers reviews, from yelp platform, but I am struggling to get the text from each review. I try selenium find_element_by...
0 votes
1 replies
Is it possible to search multiple containers in an a single line of code?
I have a scraper that scrapes a page for products. Every container is set up the same way, but they are grouped into several different s. I can w...
asked 4 months ago
1 votes
1 replies
scraping different dates through loop
I have a python code to scrape one page from soccer results and odds website from selenium import webdriver from selenium.webdriver.common.by im...
0 votes
1 replies
How to load XML elements' content separately to Python list?
I have an XML file named 'config.xml': <?xml version="1.0" encoding="UTF-8"?> <config> <set1> <data1> data content...
0 votes
2 replies
How can I loop through multiple unknown number of pages and get their texts after the year is substituted in the url?
I am trying to extract some information based on the year entered in the url. The information extracted is from an unknown number of pages. How...
asked 4 months ago
1 votes
1 replies
Getting strings as a list into a single line with beautifulsoup
I want to get address content in a signle line as it creates a problem when I try to write them to csv text = """ <B721> <PARTY-US>...
asked 4 months ago