Page 1 of 1

Project: 5913 (Run 6, Clone 624, Gen 18) - SHAKE Violations

Posted: Sun Feb 28, 2010 8:53 pm
by Tobit
I receive instant SHAKE Violations on this WU and the server kept reassigning it to me until it dediced to go to sleep for 24 hours.

Code: Select all

[20:04:17] + Attempting to send results [February 28 20:04:17 UTC]
[20:04:17] - Reading file work/wuresults_06.dat from core
[20:04:17]   (Read 44574 bytes from disk)
[20:04:17] Connecting to http://171.64.65.20:8080/
[20:04:18] Posted data.
[20:04:18] Initial: 0000; - Uploaded at ~44 kB/s
[20:04:18] - Averaged speed for that direction ~47 kB/s
[20:04:18] + Results successfully sent
[20:04:18] Thank you for your contribution to Folding@Home.
[20:04:18] + Number of Units Completed: 616

[20:04:22] Trying to send all finished work units
[20:04:22] + No unsent completed units remaining.
[20:04:22] - Preparing to get new work unit...
[20:04:22] + Attempting to get work packet
[20:04:22] - Will indicate memory of 4093 MB
[20:04:22] - Connecting to assignment server
[20:04:22] Connecting to http://assign-GPU.stanford.edu:8080/
[20:04:23] Posted data.
[20:04:23] Initial: 40AB; - Successful: assigned to (171.64.65.20).
[20:04:23] + News From Folding@Home: Welcome to Folding@Home
[20:04:23] Loaded queue successfully.
[20:04:23] Connecting to http://171.64.65.20:8080/
[20:04:23] Posted data.
[20:04:23] Initial: 0000; - Receiving payload (expected size: 69105)
[20:04:24] - Downloaded at ~67 kB/s
[20:04:24] - Averaged speed for that direction ~69 kB/s
[20:04:24] + Received work.
[20:04:24] Trying to send all finished work units
[20:04:24] + No unsent completed units remaining.
[20:04:24] + Closed connections
[20:04:24] 
[20:04:24] + Processing work unit
[20:04:24] Core required: FahCore_14.exe
[20:04:24] Core found.
[20:04:24] Working on queue slot 07 [February 28 20:04:24 UTC]
[20:04:24] + Working ...
[20:04:24] - Calling '.\FahCore_14.exe -dir work/ -suffix 07 -checkpoint 15 -verbose -lifeline 1520 -version 623'

[20:04:24] 
[20:04:24] *------------------------------*
[20:04:24] Folding@Home GPU Core - Beta
[20:04:24] Version 1.26 (Wed Oct 14 13:09:26 PDT 2009)
[20:04:24] 
[20:04:24] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[20:04:24] Build host: vspm46
[20:04:24] Board Type: Nvidia
[20:04:24] Core      : 
[20:04:24] Preparing to commence simulation
[20:04:24] - Looking at optimizations...
[20:04:24] - Created dyn
[20:04:24] - Files status OK
[20:04:24] - Expanded 68593 -> 357580 (decompressed 521.3 percent)
[20:04:24] Called DecompressByteArray: compressed_data_size=68593 data_size=357580, decompressed_data_size=357580 diff=0
[20:04:24] - Digital signature verified
[20:04:24] 
[20:04:24] Project: 5913 (Run 6, Clone 624, Gen 18)
[20:04:24] 
[20:04:24] Assembly optimizations on if available.
[20:04:24] Entering M.D.
[20:04:30] Tpr hash work/wudata_07.tpr:  1100426158 1227109992 1701485226 763441525 3563591168
[20:04:32] Working on Protein
[20:04:33] Client config found, loading data.
[20:04:34] mdrun_gpu returned 
[20:04:34] SHAKE violations on GPU
[20:04:34] 
[20:04:34] Folding@home Core Shutdown: UNSTABLE_MACHINE
[20:04:36] CoreStatus = 7A (122)
[20:04:36] Sending work to server
[20:04:36] Project: 5913 (Run 6, Clone 624, Gen 18)
[20:04:36] - Read packet limit of 540015616... Set to 524286976.
[20:04:36] - Error: Could not get length of results file work/wuresults_07.dat
[20:04:36] - Error: Could not read unit 07 file. Removing from queue.
[20:04:36] Trying to send all finished work units
[20:04:36] + No unsent completed units remaining.
[20:04:36] - Preparing to get new work unit...
[20:04:36] + Attempting to get work packet
[20:04:36] - Will indicate memory of 4093 MB
[20:04:36] - Connecting to assignment server
[20:04:36] Connecting to http://assign-GPU.stanford.edu:8080/
[20:04:37] Posted data.
[20:04:37] Initial: 40AB; - Successful: assigned to (171.64.65.20).
[20:04:37] + News From Folding@Home: Welcome to Folding@Home
[20:04:37] Loaded queue successfully.
[20:04:37] Connecting to http://171.64.65.20:8080/
[20:04:38] Posted data.
[20:04:38] Initial: 0000; - Receiving payload (expected size: 69105)
[20:04:38] Conversation time very short, giving reduced weight in bandwidth avg
[20:04:38] - Downloaded at ~134 kB/s
[20:04:38] - Averaged speed for that direction ~77 kB/s
[20:04:38] + Received work.
[20:04:38] Trying to send all finished work units
[20:04:38] + No unsent completed units remaining.
[20:04:38] + Closed connections
[20:04:43] 
[20:04:43] + Processing work unit
[20:04:43] Core required: FahCore_14.exe
[20:04:43] Core found.
[20:04:43] Working on queue slot 08 [February 28 20:04:43 UTC]
[20:04:43] + Working ...
[20:04:43] - Calling '.\FahCore_14.exe -dir work/ -suffix 08 -checkpoint 15 -verbose -lifeline 1520 -version 623'

[20:04:43] 
[20:04:43] *------------------------------*
[20:04:43] Folding@Home GPU Core - Beta
[20:04:43] Version 1.26 (Wed Oct 14 13:09:26 PDT 2009)
[20:04:43] 
[20:04:43] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[20:04:43] Build host: vspm46
[20:04:43] Board Type: Nvidia
[20:04:43] Core      : 
[20:04:43] Preparing to commence simulation
[20:04:43] - Looking at optimizations...
[20:04:43] - Created dyn
[20:04:43] - Files status OK
[20:04:43] - Expanded 68593 -> 357580 (decompressed 521.3 percent)
[20:04:43] Called DecompressByteArray: compressed_data_size=68593 data_size=357580, decompressed_data_size=357580 diff=0
[20:04:43] - Digital signature verified
[20:04:43] 
[20:04:43] Project: 5913 (Run 6, Clone 624, Gen 18)
[20:04:43] 
[20:04:43] Assembly optimizations on if available.
[20:04:43] Entering M.D.
[20:04:49] Tpr hash work/wudata_08.tpr:  1100426158 1227109992 1701485226 763441525 3563591168
[20:04:51] Working on Protein
[20:04:53] Client config found, loading data.
[20:04:53] mdrun_gpu returned 
[20:04:53] SHAKE violations on GPU
[20:04:53] 
[20:04:53] Folding@home Core Shutdown: UNSTABLE_MACHINE
[20:04:53] Starting GUI Server
[20:04:56] CoreStatus = 7A (122)
[20:04:56] Sending work to server
[20:04:56] Project: 5913 (Run 6, Clone 624, Gen 18)
[20:04:56] - Read packet limit of 540015616... Set to 524286976.
[20:04:56] - Error: Could not get length of results file work/wuresults_08.dat
[20:04:56] - Error: Could not read unit 08 file. Removing from queue.
[20:04:56] Trying to send all finished work units
[20:04:56] + No unsent completed units remaining.
[20:04:56] - Preparing to get new work unit...
[20:04:56] + Attempting to get work packet
[20:04:56] - Will indicate memory of 4093 MB
[20:04:56] - Connecting to assignment server
[20:04:56] Connecting to http://assign-GPU.stanford.edu:8080/
[20:04:56] Posted data.
[20:04:56] Initial: 40AB; - Successful: assigned to (171.64.65.20).
[20:04:56] + News From Folding@Home: Welcome to Folding@Home
[20:04:57] Loaded queue successfully.
[20:04:57] Connecting to http://171.64.65.20:8080/
[20:04:57] Posted data.
[20:04:57] Initial: 0000; - Receiving payload (expected size: 69105)
[20:04:58] - Downloaded at ~67 kB/s
[20:04:58] - Averaged speed for that direction ~75 kB/s
[20:04:58] + Received work.
[20:04:58] Trying to send all finished work units
[20:04:58] + No unsent completed units remaining.
[20:04:58] + Closed connections
[20:05:03] 
[20:05:03] + Processing work unit
[20:05:03] Core required: FahCore_14.exe
[20:05:03] Core found.
[20:05:03] Working on queue slot 09 [February 28 20:05:03 UTC]
[20:05:03] + Working ...
[20:05:03] - Calling '.\FahCore_14.exe -dir work/ -suffix 09 -checkpoint 15 -verbose -lifeline 1520 -version 623'

[20:05:03] 
[20:05:03] *------------------------------*
[20:05:03] Folding@Home GPU Core - Beta
[20:05:03] Version 1.26 (Wed Oct 14 13:09:26 PDT 2009)
[20:05:03] 
[20:05:03] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[20:05:03] Build host: vspm46
[20:05:03] Board Type: Nvidia
[20:05:03] Core      : 
[20:05:03] Preparing to commence simulation
[20:05:03] - Looking at optimizations...
[20:05:03] - Created dyn
[20:05:03] - Files status OK
[20:05:03] - Expanded 68593 -> 357580 (decompressed 521.3 percent)
[20:05:03] Called DecompressByteArray: compressed_data_size=68593 data_size=357580, decompressed_data_size=357580 diff=0
[20:05:03] - Digital signature verified
[20:05:03] 
[20:05:03] Project: 5913 (Run 6, Clone 624, Gen 18)
[20:05:03] 
[20:05:03] Assembly optimizations on if available.
[20:05:03] Entering M.D.
[20:05:09] Tpr hash work/wudata_09.tpr:  1100426158 1227109992 1701485226 763441525 3563591168
[20:05:10] Working on Protein
[20:05:12] Client config found, loading data.
[20:05:12] mdrun_gpu returned 
[20:05:12] SHAKE violations on GPU
[20:05:12] 
[20:05:12] Folding@home Core Shutdown: UNSTABLE_MACHINE
[20:05:15] CoreStatus = 7A (122)
[20:05:15] Sending work to server
[20:05:15] Project: 5913 (Run 6, Clone 624, Gen 18)
[20:05:15] - Read packet limit of 540015616... Set to 524286976.
[20:05:15] - Error: Could not get length of results file work/wuresults_09.dat
[20:05:15] - Error: Could not read unit 09 file. Removing from queue.
[20:05:15] Trying to send all finished work units
[20:05:15] + No unsent completed units remaining.
[20:05:15] - Preparing to get new work unit...
[20:05:15] + Attempting to get work packet
[20:05:15] - Will indicate memory of 4093 MB
[20:05:15] - Connecting to assignment server
[20:05:15] Connecting to http://assign-GPU.stanford.edu:8080/
[20:05:16] Posted data.
[20:05:16] Initial: 40AB; - Successful: assigned to (171.64.65.20).
[20:05:16] + News From Folding@Home: Welcome to Folding@Home
[20:05:16] Loaded queue successfully.
[20:05:16] Connecting to http://171.64.65.20:8080/
[20:05:16] Posted data.
[20:05:16] Initial: 0000; - Receiving payload (expected size: 69105)
[20:05:17] - Downloaded at ~67 kB/s
[20:05:17] - Averaged speed for that direction ~73 kB/s
[20:05:17] + Received work.
[20:05:17] Trying to send all finished work units
[20:05:17] + No unsent completed units remaining.
[20:05:17] + Closed connections
[20:05:22] 
[20:05:22] + Processing work unit
[20:05:22] Core required: FahCore_14.exe
[20:05:22] Core found.
[20:05:22] Working on queue slot 00 [February 28 20:05:22 UTC]
[20:05:22] + Working ...
[20:05:22] - Calling '.\FahCore_14.exe -dir work/ -suffix 00 -checkpoint 15 -verbose -lifeline 1520 -version 623'

[20:05:22] 
[20:05:22] *------------------------------*
[20:05:22] Folding@Home GPU Core - Beta
[20:05:22] Version 1.26 (Wed Oct 14 13:09:26 PDT 2009)
[20:05:22] 
[20:05:22] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[20:05:22] Build host: vspm46
[20:05:22] Board Type: Nvidia
[20:05:22] Core      : 
[20:05:22] Preparing to commence simulation
[20:05:22] - Looking at optimizations...
[20:05:22] - Created dyn
[20:05:22] - Files status OK
[20:05:22] - Expanded 68593 -> 357580 (decompressed 521.3 percent)
[20:05:22] Called DecompressByteArray: compressed_data_size=68593 data_size=357580, decompressed_data_size=357580 diff=0
[20:05:22] - Digital signature verified
[20:05:22] 
[20:05:22] Project: 5913 (Run 6, Clone 624, Gen 18)
[20:05:22] 
[20:05:22] Assembly optimizations on if available.
[20:05:22] Entering M.D.
[20:05:28] Tpr hash work/wudata_00.tpr:  1100426158 1227109992 1701485226 763441525 3563591168
[20:05:29] Working on Protein
[20:05:31] Client config found, loading data.
[20:05:31] Starting GUI Server
[20:05:31] mdrun_gpu returned 
[20:05:31] SHAKE violations on GPU
[20:05:32] 
[20:05:32] Folding@home Core Shutdown: UNSTABLE_MACHINE
[20:05:35] CoreStatus = 7A (122)
[20:05:35] Sending work to server
[20:05:35] Project: 5913 (Run 6, Clone 624, Gen 18)
[20:05:35] - Read packet limit of 540015616... Set to 524286976.
[20:05:35] - Error: Could not get length of results file work/wuresults_00.dat
[20:05:35] - Error: Could not read unit 00 file. Removing from queue.
[20:05:35] Trying to send all finished work units
[20:05:35] + No unsent completed units remaining.
[20:05:35] - Preparing to get new work unit...
[20:05:35] + Attempting to get work packet
[20:05:35] - Will indicate memory of 4093 MB
[20:05:35] - Connecting to assignment server
[20:05:35] Connecting to http://assign-GPU.stanford.edu:8080/
[20:05:35] Posted data.
[20:05:35] Initial: 40AB; - Successful: assigned to (171.64.65.20).
[20:05:35] + News From Folding@Home: Welcome to Folding@Home
[20:05:36] Loaded queue successfully.
[20:05:36] Connecting to http://171.64.65.20:8080/
[20:05:36] Posted data.
[20:05:36] Initial: 0000; - Receiving payload (expected size: 69105)
[20:05:37] - Downloaded at ~67 kB/s
[20:05:37] - Averaged speed for that direction ~72 kB/s
[20:05:37] + Received work.
[20:05:37] Trying to send all finished work units
[20:05:37] + No unsent completed units remaining.
[20:05:37] + Closed connections
[20:05:42] 
[20:05:42] + Processing work unit
[20:05:42] Core required: FahCore_14.exe
[20:05:42] Core found.
[20:05:42] Working on queue slot 01 [February 28 20:05:42 UTC]
[20:05:42] + Working ...
[20:05:42] - Calling '.\FahCore_14.exe -dir work/ -suffix 01 -checkpoint 15 -verbose -lifeline 1520 -version 623'

[20:05:42] 
[20:05:42] *------------------------------*
[20:05:42] Folding@Home GPU Core - Beta
[20:05:42] Version 1.26 (Wed Oct 14 13:09:26 PDT 2009)
[20:05:42] 
[20:05:42] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[20:05:42] Build host: vspm46
[20:05:42] Board Type: Nvidia
[20:05:42] Core      : 
[20:05:42] Preparing to commence simulation
[20:05:42] - Looking at optimizations...
[20:05:42] - Created dyn
[20:05:42] - Files status OK
[20:05:42] - Expanded 68593 -> 357580 (decompressed 521.3 percent)
[20:05:42] Called DecompressByteArray: compressed_data_size=68593 data_size=357580, decompressed_data_size=357580 diff=0
[20:05:42] - Digital signature verified
[20:05:42] 
[20:05:42] Project: 5913 (Run 6, Clone 624, Gen 18)
[20:05:42] 
[20:05:42] Assembly optimizations on if available.
[20:05:42] Entering M.D.
[20:05:48] Tpr hash work/wudata_01.tpr:  1100426158 1227109992 1701485226 763441525 3563591168
[20:05:48] Working on Protein
[20:05:51] Client config found, loading data.
[20:05:51] mdrun_gpu returned 
[20:05:51] SHAKE violations on GPU
[20:05:51] 
[20:05:51] Folding@home Core Shutdown: UNSTABLE_MACHINE
[20:05:52] Starting GUI Server
[20:05:56] CoreStatus = 7A (122)
[20:05:56] Sending work to server
[20:05:56] Project: 5913 (Run 6, Clone 624, Gen 18)
[20:05:56] - Read packet limit of 540015616... Set to 524286976.
[20:05:56] - Error: Could not get length of results file work/wuresults_01.dat
[20:05:56] - Error: Could not read unit 01 file. Removing from queue.
[20:05:56] EUE limit exceeded. Pausing 24 hours.

Re: Project: 5913 (Run 6, Clone 624, Gen 18) - SHAKE Violations

Posted: Sun Feb 28, 2010 9:39 pm
by Zagen30
I can't comment on the scientific errors, but if you restart the client it'll probably give the bad WU one more crack before grabbing another one. At least that's what always happens when my clients have gotten bad WUs that they won't give up on.

Re: Project: 5913 (Run 6, Clone 624, Gen 18) - SHAKE Violations

Posted: Mon Mar 01, 2010 4:04 am
by bruce
I've reported this one as a bad WU.