Python and BeautifulSoup4's BS4's Decompose Method To Remove Unwanted Inner Tags



< a href="/Products/MX72244">
< span class="c-shca-icon-item__body-name-brand">

ADATA
< / span>
XPG SX8200 PCI - E 3.0 x4 M .2 NVMe SSD, 480 GB
< / a>


The problem in the html above is that there is a "span" inside the "a" and this is unusual except for a few products with ADATA.
Otherwise it would print out ADATA with the other product information "XPG SX8200 PCI - E 3.0 x4 M .2 NVMe SSD, 480 GB". So we just want the product info and not the ADATA inside span.

The solution is to use beautifulsoup's "decompose method below".



hddname = prices.find("div", {"class": "c-shca-icon-item__body-name"})
hddname=hddname.find("a")
hddname.span.decompose()
hddname=hddname.get_text().strip()

Once we have the "a" tag contents which contain a "span inside". We just use the decompose() method.

The format is as follows

soupname.tagname.decompose()


Tags:

python, beautifulsoup, bs, decompose, method, unwanted, tags, div, quot, shca, icon, item__body, href, products, mx, span, adata, xpg, sx, pci, nvme, ssd, gb, html, info, hddname, prices, get_text, contents, contain, format, follows, soupname, tagname,

  • Linux Mint 18.2 Create Config File To Start Application Upon Login
  • Dell Wyse Thin Client BIOS Access Key
  • sudoers fle in /etc warning about comments/includes!
  • Centos 7 Reallocate logical volume space to another
  • lvm how to reduce volume size
  • letsencrypt certbot error "Unable to find a virtual host listening on port 80 which is currently needed for Certbot to prove to the CA that you control your domain. Please add a virtual host for port 80."
  • SSH error cannot Forward or Listen "bind: Cannot assign requested address"
  • X11 SSH Linux Forwarding Error
  • Using Microsoft's Group Policy Management Editor (GPME) To Change Default Permissions and Internet Explorer Security Settings
  • Thunderbird How To Disable An Email Account
  • iptables linux firewall recommended rules for public computing
  • LVM How To Create Pool (dynamically allocated and not fully allocated)
  • VMWare Pro Workstation Nic Disconnected and No IP Using NAT
  • Linux How To Create A RamDisk
  • mdadm force resync when resync=PENDING solution
  • Proxmox Breaks Storage/LVM Backing If Killing QEMU-IMG
  • Proxmox trying to acquire lock... TASK ERROR: can't lock file '/var/lock/qemu-server/lock-102.conf' - got timeout
  • Debian 9 SSH root password authentication failure password not working problem / solution
  • QEMU / KVM How To Manually Create Basic Virtual Machine VM
  • Linux wlan0 check all wireless clients