Channel: Python: (Beautifulsoup) How to limit extracted text from a html news article to only the news article. - Stack Overflow

Python: (Beautifulsoup) How to limit extracted text from a html news article to only the news article.

September 19, 2016, 2:01 pm

≪ Previous: Answer by Vinnie James for Python: (Beautifulsoup) How to limit extracted text from a html news article to only the news article.

I wrote this test code which uses BeautifulSoup.

url = "http://www.dailymail.co.uk/news/article-3795511/Harry-Potter-sale-half-million-pound-house-Iconic-Privet-Drive-market-suburban-Berkshire-complete-cupboard-stairs-one-magical-boy.html"html = urllib.request.urlopen(url).read()  soup = BeautifulSoup(html,"lxml")for n in soup.find_all('p'):    print(n.get_text())

It works fine but it also retrieves text that is not part of the news article, such as the time it was posted, number of comments, copyrights ect.

I would wish for it to only retrieve text from the news article itself, how would one go about this?

↧

↧

Latest Images

Eco Data 4/26/24

Eco Data 4/26/24

April 25, 2024, 5:00 pm

‘Pay day every day’ may become Shangri-La Group, BPOs’ secret to happy employees

April 25, 2024, 5:51 am

Nonprofit donates custom home in this East Bay city for Marine injured in...

Nonprofit donates custom home in this East Bay city for Marine injured in...

April 23, 2024, 7:00 am

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

April 22, 2024, 6:00 am

Ukraine bans military from online gambling amid addiction concerns

Ukraine bans military from online gambling amid addiction concerns

April 22, 2024, 5:17 am

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

April 20, 2024, 8:08 pm

OCBC Bank Singapore Offers Up to 2.8% p.a. Fixed Deposit Promotion from 21...

April 20, 2024, 12:38 pm

National Poetry Month 2024: Maxine Starr

National Poetry Month 2024: Maxine Starr

April 19, 2024, 9:56 am

Vegan Chicken Pot Pie

Vegan Chicken Pot Pie

April 19, 2024, 9:18 am

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

April 19, 2024, 7:03 am

Trending Articles

A Wall Street guide to watches

August 5, 2015, 7:32 am

Show Dem Camp – Palm Wine Music (Vol. 2 Album)

October 19, 2018, 1:16 am

Who Is Jennifer Hines? Bryan Olesen Wife Is Mother Of 3 Kids

March 5, 2024, 2:19 am

AUDIO | Diamond Platnumz ft Mugabe - LawaMa | Download

July 25, 2014, 8:00 am

Consuelo Ortiga y Rey: The "Crush ng Bayan" in Rizal's Time

August 4, 2013, 11:32 pm

Happy Birthday Wishes for Bhabhi in Hindi & English |हैप्पी बर्थडे भाभी

March 13, 2020, 3:01 am

Bar Rescue - The Prime Bar (WildeFire Bistro) Update

September 15, 2019, 6:50 am

Varzish Sport Tv HD Biss Key Frequency Update

January 15, 2017, 9:03 pm

NAT, NCAE, LAPG, SREYA, ELNA and PHIL-IR Materials and Reviewers

February 27, 2017, 6:16 pm

Guntur District Police Officers Mobile Numbers

April 17, 2017, 2:10 am

Monsters Among Us: John Michael Baker killed Shannon Stewart Roberts then...

January 5, 2017, 7:55 pm

Who Is Junior Pope?| Biography| Profile| History Of Nollywood Actor “Pope...

July 26, 2017, 8:45 am

Barbados Postal Codes St. Michael

July 27, 2013, 8:05 am

99 formas de llamarle a tus tetas

May 19, 2017, 5:00 am

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

August 20, 2016, 5:13 pm

Gangland murders in Dublin (1990-94)

April 17, 2020, 1:54 am

[Converge] Fiber Home Super Admin Access

September 12, 2020, 7:10 am

Understanding Xtream IPTV Code 2025 Nono 7

April 25, 2024, 1:58 pm

[THEME] osTicket Awesome ― fully responsive theme

May 29, 2016, 6:25 pm

Mp3 Download: Lil Baby X Gunna - My Jeans (feat. Young Thug)

October 13, 2018, 2:15 am

More Pages to Explore .....

Latest Images

Eco Data 4/26/24

Eco Data 4/26/24

April 25, 2024, 5:00 pm

‘Pay day every day’ may become Shangri-La Group, BPOs’ secret to happy employees

April 25, 2024, 5:51 am

Nonprofit donates custom home in this East Bay city for Marine injured in...

Nonprofit donates custom home in this East Bay city for Marine injured in...

April 23, 2024, 7:00 am

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

April 22, 2024, 6:00 am

Ukraine bans military from online gambling amid addiction concerns

Ukraine bans military from online gambling amid addiction concerns

April 22, 2024, 5:17 am

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

April 20, 2024, 8:08 pm

OCBC Bank Singapore Offers Up to 2.8% p.a. Fixed Deposit Promotion from 21...

April 20, 2024, 12:38 pm

National Poetry Month 2024: Maxine Starr

National Poetry Month 2024: Maxine Starr

April 19, 2024, 9:56 am

Vegan Chicken Pot Pie

Vegan Chicken Pot Pie

April 19, 2024, 9:18 am

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

April 19, 2024, 7:03 am

© 2024 //www.rssing.com