Class JsoupBasedHtmlParser
java.lang.Object
org.apache.jmeter.protocol.http.parser.BaseParser
org.apache.jmeter.protocol.http.parser.HTMLParser
org.apache.jmeter.protocol.http.parser.JsoupBasedHtmlParser
- All Implemented Interfaces:
- LinkExtractorParser
Parser based on JSOUP
- Since:
- 2.10
 TODO Factor out common code between LagartoBasedHtmlParserand this one (adapter pattern)
- 
Field SummaryFields inherited from class org.apache.jmeter.protocol.http.parser.HTMLParserATT_ARCHIVE, ATT_BACKGROUND, ATT_CODE, ATT_CODEBASE, ATT_DATA, ATT_HREF, ATT_IS_IMAGE, ATT_REL, ATT_SRC, ATT_STYLE, ATT_TYPE, DEFAULT_PARSER, ICON, IE_UA, IE_UA_PATTERN, PARSER_CLASSNAME, PRELOAD, SHORTCUT_ICON, STYLESHEET, TAG_APPLET, TAG_BASE, TAG_BGSOUND, TAG_BODY, TAG_EMBED, TAG_FRAME, TAG_IFRAME, TAG_IMAGE, TAG_INPUT, TAG_LINK, TAG_OBJECT, TAG_SCRIPT
- 
Constructor SummaryConstructors
- 
Method SummaryModifier and TypeMethodDescriptiongetEmbeddedResourceURLs(String userAgent, byte[] html, URL baseUrl, URLCollection coll, String encoding) Get the URLs for all the resources that a browser would automatically download following the download of the HTML content, that is: images, stylesheets, javascript files, applets, etc...Methods inherited from class org.apache.jmeter.protocol.http.parser.HTMLParserextractIEVersion, getEmbeddedResourceURLs, getEmbeddedResourceURLs, isEnableConditionalComments, normalizeUrlValueMethods inherited from class org.apache.jmeter.protocol.http.parser.BaseParsergetParser, isReusable
- 
Constructor Details- 
JsoupBasedHtmlParserpublic JsoupBasedHtmlParser()
 
- 
- 
Method Details- 
getEmbeddedResourceURLspublic Iterator<URL> getEmbeddedResourceURLs(String userAgent, byte[] html, URL baseUrl, URLCollection coll, String encoding) throws HTMLParseException Description copied from class:HTMLParserGet the URLs for all the resources that a browser would automatically download following the download of the HTML content, that is: images, stylesheets, javascript files, applets, etc...All URLs should be added to the Collection. Malformed URLs can be reported to the caller by having the Iterator return the corresponding RL String. Overall problems parsing the html should be reported by throwing an HTMLParseException. N.B. The Iterator returns URLs, but the Collection will contain objects of class URLString. - Specified by:
- getEmbeddedResourceURLsin class- HTMLParser
- Parameters:
- userAgent- User Agent
- html- HTML code
- baseUrl- Base URL from which the HTML code was obtained
- coll- URLCollection
- encoding- Charset
- Returns:
- an Iterator for the resource URLs
- Throws:
- HTMLParseException- when parsing the- htmlfails
 
 
-