public class LinkCollector extends BaseCollector implements Collector
DOM, parsedItems| Constructor and Description |
|---|
LinkCollector(String textBasedHTMLSource)
It gives the source to the parent constructor to make the DOM.
|
| Modifier and Type | Method and Description |
|---|---|
void |
parse()
It parsed the DOM.
|
void |
parseByRule(CollectorSelector rule)
It parses the DOM by rule ID.
|
protected void |
validateLinks(boolean noResource)
It removes the non-valid URL-s from the parsed results.
|
collectAttributeValueBy, collectAttributeValueBy, getItems, resetParsedItemspublic LinkCollector(String textBasedHTMLSource)
textBasedHTMLSource - A regular text based HTML code as a String.public void parse()
public void parseByRule(CollectorSelector rule)
parseByRule in interface Collectorrule - The ID of the rule.protected void validateLinks(boolean noResource)
noResource - When this is true, this method will removes the resource links from the parsed results.Copyright © 2015. All rights reserved.