Python and BeautifulSoup4's BS4's Decompose Method To Remove Unwanted Inner Tags

<div class="c-shca-icon-item__body-name"> < a href="/Products/MX72244"> < span class="c-shca-icon-item__body-name-brand"> ADATA < / span> XPG SX8200 PCI - E 3.0 x4 M .2 NVMe SSD, 480 GB < / a>

The problem in the html above is that there is a "span" inside the "a" and this is unusual except for a few products with ADATA.
Otherwise it would print out ADATA with the other product information "XPG SX8200 PCI - E 3.0 x4 M .2 NVMe SSD, 480 GB". So we just want the product info and not the ADATA inside span.

The solution is to use beautifulsoup's "decompose method below".

hddname = prices.find("div", {"class": "c-shca-icon-item__body-name"}) hddname=hddname.find("a") hddname.span.decompose() hddname=hddname.get_text().strip()

Once we have the "a" tag contents which contain a "span inside". We just use the decompose() method.

The format is as follows

soupname.tagname.decompose()

Tags:

python, beautifulsoup, bs, decompose, method, unwanted, tags, div, quot, shca, icon, item__body, href, products, mx, span, adata, xpg, sx, pci, nvme, ssd, gb, html, info, hddname, prices, get_text, contents, contain, format, follows, soupname, tagname,