Library to extract data from HTML and XML using XPath and CSS
