Hardcoded task ping timeout kills tasks localizing large amounts of data
Tasks need to localizing large amounts of data
1 AM, 1 Task Attempt
1. Start a MR task. Localization lasts for more than 5 min(feature start)
2. AM Ping Timeout (disconnect)
AM kills the MR task when the localization process takes too long time.
The ping timeout is hardcoded. When it takes more than 5 mins to finish the localization process, AM will try to kill the task.
The ping timeout is hardcoded to 5 minutes. When the AM pings time out the task, AM will try to kill the task.
Incorrect exception handling
Remove the ping timeout check from the AM task heartbeat handler.