How do I clean html code with multiple unwanted newlines using Python?

I have a lot of html pages that have somehow become embedded with multiple newline characters, with the tags on separate lines and some of the sentences split up at apparently random intervals. Here is an example of what I am dealing with:

<html><head><title>One of many</title></head><body><h1>Spam is not ham</h1><p>Many plates of Spam</p><p>Use the Fry option to properly cook theSpamuntil done.</p><p>Enquiries for more recipes can be made through theFeed Meoption.</p></body></html>

I used the replace() function with partial success for the beginning tags with this code:

html_filename = 'page.htm'f = open(html_filename, encoding="utf-8")file_str = f.readlines()f.close()with open(html_filename, 'w', encoding="utf-8") as f:    for line in file_str:        if '<h1>\n' in line:            tmp = line.replace('<h1>\n', '<h1>')            f.write(tmp)        elif '<p>\n' in line:            tmp = line.replace('<p>\n', '<p>')            f.write(tmp)        else:            f.write(line)

and get the following result:

<html><head><title>One of many</title></head><body><h1>Spam is not ham</h1><p>Many plates of Spam</p><p>Use the Fry option to properly cook theSpamuntil done.</p><p>Enquiries for more recipes can be made through theFeed Meoption.</p></body></html>

However, I can't figure out how to resolve the lines with just text or the lines with just an end tag.

How do I clean html code with multiple unwanted newlines using Python?

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...