Details
Description
Fix C7b, C7c, C7d, C7g, C8a, C8b, C8c, C9b, C9c, C9e, 10c - Skal hente fra en ekstern tekst fil
C7b, C7g: danishMajorCities_UTF8.txt (danishMajorCities)
C7c, C7d: placenames_UTF8.txt (placenames)
C8a: foreninger_lowercased_UTF8.txt (foreninger_lowercased)
C8b, C8c: foreninger_one_word_lowercased_UTF8.txt (foreninger_one_word_lowercased)
C9b: virksomheder_lowercased_UTF8.txt (virksomheder_lowercased)
C9c, C9e: virksomheder_one_word_lowercased_UTF8.txt (virksomheder_one_word_lowercased)
C10c: DanishNames_UTF8.txt (DanishNames)
Man skal kunne referere til en ekstern liste.
Lige som det eksisterende kode.
criteriaRun-combinedComboJson-alt-seq.pig
captures = FOREACH captures GENERATE CombinedCombo(url, date, text, links, hostname, true, '/home/test/workflow/wordslist/danishMajorCities_UTF8.txt', '/home/test/workflow/wordslist/DanishNames_UTF8.txt', '/home/test/workflow/wordslist/foreninger_lowercased_UTF8.txt', '/home/test/workflow/wordslist/foreninger_one_word_lowercased_UTF8.txt', '/home/test/workflow/wordslist/placenames_UTF8.txt', '/home/test/workflow/wordslist/virksomheder_lowercased_UTF8.txt', '/home/test/workflow/wordslist/virksomheder_one_word_lowercased_UTF8.txt');