public class TwitterDecidingScope extends org.archive.crawler.deciderules.DecidingScope
In addition, the number of results to be considered is determined by the parameters "pages" and "twitter_results_per_page".
Modifier and Type | Field and Description |
---|---|
static String |
ATTR_GEOLOCATIONS
Attribute/value pair.
|
static String |
ATTR_KEYWORDS
Attribute/value pair.
|
static String |
ATTR_LANG
Attribute/value pair.
|
static String |
ATTR_PAGES
Attribute/value pair.
|
static String |
ATTR_QUEUE_KEYWORD_LINKS
Attribute/value pair specifying whether an html search for the given keyword(s) should also be queued.
|
static String |
ATTR_QUEUE_LINKS
Attribute/value pair specifying whether embedded links should be queued.
|
static String |
ATTR_QUEUE_USER_STATUS
Attribute/value pair specifying whether the status of discovered users should be harvested.
|
static String |
ATTR_QUEUE_USER_STATUS_LINKS
Attribute/value pair specifying whether one should additionally queue all links embedded in a users status.
|
static String |
ATTR_RESULTS_PER_PAGE
Attribute/value pair.
|
Constructor and Description |
---|
TwitterDecidingScope(String name)
Constructor for the method.
|
Modifier and Type | Method and Description |
---|---|
boolean |
addSeed(org.archive.crawler.datamodel.CandidateURI curi)
Adds a candidate uri as a seed for the crawl.
|
void |
initialize(org.archive.crawler.framework.CrawlController controller)
This routine makes any necessary Twitter API calls and queues the content discovered.
|
getDecideRule, innerAccepts, kickUpdate
addSeedListener, checkClose, getSeedfile, isSameHost, isSeed, listUsedFiles, refreshSeeds, seedsIterator, seedsIterator, toString
accepts, getFilterOffPosition, returnTrueIfMatches
addElementToDefinition, checkValue, earlyInitialize, getAbsoluteName, getAttribute, getAttribute, getAttribute, getAttributeInfo, getAttributeInfo, getAttributeInfoIterator, getAttributes, getDataContainerRecursive, getDataContainerRecursive, getDefaultValue, getDescription, getElementFromDefinition, getLegalValues, getLocalAttribute, getMBeanInfo, getMBeanInfo, getParent, getPreservedFields, getSettingsHandler, getUncheckedAttribute, getValue, globalSettings, invoke, isInitialized, isOverridden, iterator, removeElementFromDefinition, setAsOrder, setAttribute, setAttribute, setAttributes, setDescription, setPreservedFields, unsetAttribute
public static final String ATTR_KEYWORDS
public static final String ATTR_PAGES
public static final String ATTR_RESULTS_PER_PAGE
public static final String ATTR_GEOLOCATIONS
public static final String ATTR_LANG
public static final String ATTR_QUEUE_LINKS
public static final String ATTR_QUEUE_USER_STATUS
public static final String ATTR_QUEUE_USER_STATUS_LINKS
public static final String ATTR_QUEUE_KEYWORD_LINKS
public TwitterDecidingScope(String name)
name
- the name of this scope.public void initialize(org.archive.crawler.framework.CrawlController controller)
initialize
in class org.archive.crawler.framework.CrawlScope
controller
- The controller for this crawl.public boolean addSeed(org.archive.crawler.datamodel.CandidateURI curi)
addSeed
in class org.archive.crawler.framework.CrawlScope
curi
- The crawl uri to be added.Copyright © 2005–2016 The Royal Danish Library, the Danish State and University Library, the National Library of France and the Austrian National Library.. All rights reserved.