Loading...

XML

Word

Printable

Details

Type: Improvement
Resolution: Fixed
Priority: Major
Fix Version/s: 3.18.0, I49
Affects Version/s: 3.14.0, 3.15.0
Component/s: Archive
Labels:
None

Organization:

BNF
Accuracy of estimate:
Rough

Verification:

Hide

1. Start standard distributed NAS-system
2. Ingest 100 domains into system.
3. Make snapshotharvest SH1 (1 MBytes/pr/domain)
4. Activate SH1 and await that it finished
5. Make snapshotharvest SH2, depending on SH1 (3 MBytes/pr/domain)
6. Log on to the server where the indexserver application is running, and prepare to kill the application (find the PID of the indexserver-application):
6. Activate SH2 in the GUI
7. When you see in the indexserver-log that the Indexserver has started working on the
index, shut it down using kill -9 PID
8. Start the indexserver again.
9. See that the harvest SH2 starts and finishes.

Show
1. Start standard distributed NAS-system 2. Ingest 100 domains into system. 3. Make snapshotharvest SH1 (1 MBytes/pr/domain) 4. Activate SH1 and await that it finished 5. Make snapshotharvest SH2, depending on SH1 (3 MBytes/pr/domain) 6. Log on to the server where the indexserver application is running, and prepare to kill the application (find the PID of the indexserver-application): 6. Activate SH2 in the GUI 7. When you see in the indexserver-log that the Indexserver has started working on the index, shut it down using kill -9 PID 8. Start the indexserver again. 9. See that the harvest SH2 starts and finishes.

Description

When our production engineers ran robustness tests, they tried activating a few harvest definitions, let the jobs request a deduplication index, then killed and restarted the IndexServerApplication.

The harvesters kept waiting for a reply to the index request (and won't stop before hitting their timeout) while the IndexServer had obviously forgotten of all previous requests.

This is due to the fact that IndexRequestServer is implemented in a rather crude way, as it starts a thread for each request.

We should make an implementation that keeps track of index requests based on the subset of jobs that is requested, and which would be able to restart processing these requests when restarting the application.

Also harvesters should actively check for index generation completion instead of passively waiting for an answer. An even better solution would be not to submit a job before the relevant index has been generated, because it is a prerequisite to the job successful completion.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

IndexServerApplication0.log.0
25 kB
01/Sep/11 2:12 PM

Issue Links

Trackbacks

2011-09-06 Netarkiv møde DK møde Tidspunkt: 06. sep 12:30 13:00 Actions fra sidste møde 20110830 Netarkiv møde NARK:20110830 Netarkiv møde 3.17.0 status (Mikis) 3.17.0 status https://sbforge.org/jira/secure/TaskBoard.jspa?...

Activity

People

Assignee:: Søren Vejrup Carlsen (Inactive)

Reporter:: Nicolas Giraud (Inactive)

Inspector:: Mikis Seth Sørensen (Inactive)

Watchers:: 0 Start watching this issue

Dates

Created:: 05/Apr/11 4:15 PM

Updated:: 16/Nov/11 3:53 PM

Resolved:: 16/Nov/11 3:53 PM

Time Tracking

Estimated:

70h

Remaining:

Logged:

25h