我想要以下功能.
- input : this is test <b> bold text </b> normal text
- expected output: this is test normal text
解决方法
使用BeautifulSoup的解决方案:
- from BeautifulSoup import BeautifulSoup
- def removeTag(soup,tagname):
- for tag in soup.findAll(tagname):
- contents = tag.contents
- parent = tag.parent
- tag.extract()
- s = BeautifulSoup("abcd <b> btag </b> hello <d>dtag</d>")
- removeTag(s,"b")
- print s
- removeTag(s,"d")
- print s
收益:
- >>>
- abcd hello <d>dtag</d>
- abcd hello