Item31: Remote crawlers require more reliable publishing mechanism

Priority: CurrentState: AppliesTo: Component: WaitingFor:
Low Confirmed crawler    

Details

Currently, the crawler publishes results by writing an XML file onto the nfs-mount, and signalling the front-end that a new XML file is ready for publishing by tickling a certain url.

This is fine for the situation where all crawler nodes were in the datacenter, with a dedicated backbone link. It is less optimal for remote crawler node's.

Proposed solution: fix the 'publish by posting xml via http' feature.

-- KoenMartens - 09 Apr 2008

ItemTemplate
Summary Remote crawlers require more reliable publishing mechanism
ReportedBy KoenMartens
AppliesTo crawler
Priority Low
CurrentState Confirmed
WaitingFor

Topic revision: r1 - 09 Apr 2008 - 15:11:56 - KoenMartens