dk.netarkivet.wayback
Class ExtractWaybackCDXBatchJob.MyAggressiveUrlCanonicalizer

java.lang.Object
  extended by org.archive.wayback.util.url.AggressiveUrlCanonicalizer
      extended by dk.netarkivet.wayback.ExtractWaybackCDXBatchJob.MyAggressiveUrlCanonicalizer
All Implemented Interfaces:
org.archive.wayback.UrlCanonicalizer
Enclosing class:
ExtractWaybackCDXBatchJob

public static class ExtractWaybackCDXBatchJob.MyAggressiveUrlCanonicalizer
extends org.archive.wayback.util.url.AggressiveUrlCanonicalizer

This class overrides the standard wayback canonicalizer in order to use our version of UURIFactory (see Bug 1719).


Constructor Summary
ExtractWaybackCDXBatchJob.MyAggressiveUrlCanonicalizer()
           
 
Method Summary
 java.lang.String urlStringToKey(java.lang.String urlString)
           
 
Methods inherited from class org.archive.wayback.util.url.AggressiveUrlCanonicalizer
canonicalize, doStripRegexMatch, main
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ExtractWaybackCDXBatchJob.MyAggressiveUrlCanonicalizer

public ExtractWaybackCDXBatchJob.MyAggressiveUrlCanonicalizer()
Method Detail

urlStringToKey

public java.lang.String urlStringToKey(java.lang.String urlString)
                                throws org.apache.commons.httpclient.URIException
Specified by:
urlStringToKey in interface org.archive.wayback.UrlCanonicalizer
Overrides:
urlStringToKey in class org.archive.wayback.util.url.AggressiveUrlCanonicalizer
Throws:
org.apache.commons.httpclient.URIException