Page 1 of 1

Project: 5911 (Run 3, Clone 35, Gen 4) -self test failure

Posted: Sun Feb 28, 2010 6:37 am
by anko1
Got this one 5 times, each instant failure and same message, along with removing from queue b/c couldn't get length of results.

Last instance:

Code: Select all

[22:43:12] + Processing work unit
[22:43:12] Core required: FahCore_14.exe
[22:43:12] Core found.
[22:43:12] Working on queue slot 00 [February 27 22:43:12 UTC]
[22:43:12] + Working ...
[22:43:12] - Calling '.\FahCore_14.exe -dir work/ -suffix 00 -priority 96 -cpu 86 -checkpoint 15 -verbose -lifeline 5260 -version 623'

[22:43:12] 
[22:43:12] *------------------------------*
[22:43:12] Folding@Home GPU Core - Beta
[22:43:12] Version 1.26 (Wed Oct 14 13:09:26 PDT 2009)
[22:43:12] 
[22:43:12] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[22:43:12] Build host: vspm46
[22:43:12] Board Type: Nvidia
[22:43:12] Core      : 
[22:43:12] Preparing to commence simulation
[22:43:12] - Looking at optimizations...
[22:43:12] - Created dyn
[22:43:12] - Files status OK
[22:43:12] - Expanded 69084 -> 357580 (decompressed 517.6 percent)
[22:43:12] Called DecompressByteArray: compressed_data_size=69084 data_size=357580, decompressed_data_size=357580 diff=0
[22:43:12] - Digital signature verified
[22:43:12] 
[22:43:12] Project: 5911 (Run 3, Clone 35, Gen 4)
[22:43:12] 
[22:43:13] Assembly optimizations on if available.
[22:43:13] Entering M.D.
[22:43:19] Tpr hash work/wudata_00.tpr:  2601140089 4139656169 2513526555 1158789911 1732238406
[22:43:19] Working on Protein
[22:43:20] mdrun_gpu returned 
[22:43:20] Self-test failure
[22:43:20] 
[22:43:20] Folding@home Core Shutdown: UNSTABLE_MACHINE
[22:43:23] CoreStatus = 7A (122)
[22:43:23] Sending work to server
[22:43:23] Project: 5911 (Run 3, Clone 35, Gen 4)
[22:43:23] - Read packet limit of 540015616... Set to 524286976.
[22:43:23] - Error: Could not get length of results file work/wuresults_00.dat
[22:43:23] - Error: Could not read unit 00 file. Removing from queue.
[22:43:23] EUE limit exceeded. Pausing 24 hours.
Edit - pulled it one more time on restart (after deleting work folder); now happily working on a 5910.

Re: Project: 5911 (Run 3, Clone 35, Gen 4) -self test failure

Posted: Sun Feb 28, 2010 7:00 am
by sortofageek
This WU has not been completed successfully by anyone so far.

Re: Project: 5911 (Run 3, Clone 35, Gen 4) -self test failure

Posted: Sun Feb 28, 2010 8:15 am
by bruce
Have you run MemtestG80?

I believe a self-test failure means your GPU has a hardware problem (including the possibility of a driver problem).

Re: Project: 5911 (Run 3, Clone 35, Gen 4) -self test failure

Posted: Sun Feb 28, 2010 9:16 am
by Nathan_P
bruce wrote:Have you run MemtestG80?

I believe a self-test failure means your GPU has a hardware problem (including the possibility of a driver problem).
I've just had the same WU fail 6 times on a less than 1month old GTS 250, i think its reasonable to say that this one may be a bad wu

Code: Select all

[04:35:15] 
[04:35:15] + Processing work unit
[04:35:15] Core required: FahCore_14.exe
[04:35:15] Core found.
[04:35:15] Working on queue slot 08 [February 28 04:35:15 UTC]
[04:35:15] + Working ...
[04:35:15] - Calling '.\FahCore_14.exe -dir work/ -suffix 08 -priority 96 -checkpoint 15 -verbose -lifeline 2408 -version 623'

[04:35:15] 
[04:35:15] *------------------------------*
[04:35:15] Folding@Home GPU Core - Beta
[04:35:15] Version 1.26 (Wed Oct 14 13:09:26 PDT 2009)
[04:35:15] 
[04:35:15] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[04:35:15] Build host: vspm46
[04:35:15] Board Type: Nvidia
[04:35:15] Core      : 
[04:35:15] Preparing to commence simulation
[04:35:15] - Looking at optimizations...
[04:35:15] - Created dyn
[04:35:15] - Files status OK
[04:35:15] - Expanded 70180 -> 360060 (decompressed 513.0 percent)
[04:35:15] Called DecompressByteArray: compressed_data_size=70180 data_size=360060, decompressed_data_size=360060 diff=0
[04:35:15] - Digital signature verified
[04:35:15] 
[04:35:15] Project: 5910 (Run 10, Clone 70, Gen 39)
[04:35:15] 
[04:35:15] Assembly optimizations on if available.
[04:35:15] Entering M.D.
[04:35:21] Tpr hash work/wudata_08.tpr:  1609103518 1840599143 61034873 3417121864 487604504
[04:35:22] Working on Protein
[04:35:23] Client config found, loading data.
[04:35:23] Starting GUI Server
[04:36:13] Completed 1%
[04:37:26] Completed 2%
[04:38:34] Completed 3%
[04:39:39] Completed 4%
[04:40:55] Completed 5%
[04:42:14] Completed 6%
[04:43:33] Completed 7%
[04:44:49] Completed 8%
[04:46:01] Completed 9%
[04:47:20] Completed 10%
[04:48:32] Completed 11%
[04:49:59] Completed 12%
[04:51:07] Completed 13%
[04:52:20] Completed 14%
[04:53:35] Completed 15%
[04:55:02] Completed 16%
[04:56:17] Completed 17%
[04:57:40] Completed 18%
[04:58:49] Completed 19%
[05:00:01] Completed 20%
[05:01:16] Completed 21%
[05:02:18] - Autosending finished units... [February 28 05:02:18 UTC]
[05:02:18] Trying to send all finished work units
[05:02:18] + No unsent completed units remaining.
[05:02:18] - Autosend completed
[05:02:18] + Working...
[05:02:29] Completed 22%
[05:03:41] Completed 23%
[05:04:53] Completed 24%
[05:06:12] Completed 25%
[05:07:14] Completed 26%
[05:08:40] Completed 27%
[05:10:10] Completed 28%
[05:11:40] Completed 29%
[05:12:55] Completed 30%
[05:14:11] Completed 31%
[05:15:30] Completed 32%
[05:16:42] Completed 33%
[05:18:05] Completed 34%
[05:19:17] Completed 35%
[05:20:44] Completed 36%
[05:22:06] Completed 37%
[05:23:29] Completed 38%
[05:24:41] Completed 39%
[05:26:01] Completed 40%
[05:27:16] Completed 41%
[05:28:28] Completed 42%
[05:29:48] Completed 43%
[05:31:07] Completed 44%
[05:32:23] Completed 45%
[05:33:28] Completed 46%
[05:34:36] Completed 47%
[05:35:48] Completed 48%
[05:37:11] Completed 49%
[05:38:27] Completed 50%
[05:39:46] Completed 51%
[05:41:05] Completed 52%
[05:42:21] Completed 53%
[05:43:40] Completed 54%
[05:44:49] Completed 55%
[05:46:12] Completed 56%
[05:47:34] Completed 57%
[05:48:43] Completed 58%
[05:50:06] Completed 59%
[05:51:29] Completed 60%
[05:52:44] Completed 61%
[05:53:49] Completed 62%
[05:55:09] Completed 63%
[05:56:31] Completed 64%
[05:57:47] Completed 65%
[05:59:13] Completed 66%
[06:00:33] Completed 67%
[06:01:48] Completed 68%
[06:03:07] Completed 69%
[06:04:27] Completed 70%
[06:05:39] Completed 71%
[06:06:55] Completed 72%
[06:08:14] Completed 73%
[06:09:30] Completed 74%
[06:10:45] Completed 75%
[06:12:08] Completed 76%
[06:13:27] Completed 77%
[06:14:46] Completed 78%
[06:15:59] Completed 79%
[06:17:21] Completed 80%
[06:18:30] Completed 81%
[06:19:46] Completed 82%
[06:20:54] Completed 83%
[06:22:13] Completed 84%
[06:23:33] Completed 85%
[06:24:45] Completed 86%
[06:25:57] Completed 87%
[06:27:13] Completed 88%
[06:28:28] Completed 89%
[06:29:44] Completed 90%
[06:31:07] Completed 91%
[06:32:23] Completed 92%
[06:33:38] Completed 93%
[06:34:50] Completed 94%
[06:36:06] Completed 95%
[06:37:25] Completed 96%
[06:38:52] Completed 97%
[06:40:04] Completed 98%
[06:41:16] Completed 99%
[06:42:49] Completed 100%
[06:42:49] Successful run
[06:42:49] DynamicWrapper: Finished Work Unit: sleep=10000
[06:42:59] Reserved 11296 bytes for xtc file; Cosm status=0
[06:42:59] Allocated 11296 bytes for xtc file
[06:42:59] - Reading up to 11296 from "work/wudata_08.xtc": Read 11296
[06:42:59] Read 11296 bytes from xtc file; available packet space=786419168
[06:42:59] xtc file hash check passed.
[06:42:59] Reserved 23472 23472 786419168 bytes for arc file=<work/wudata_08.trr> Cosm status=0
[06:42:59] Allocated 23472 bytes for arc file
[06:42:59] - Reading up to 23472 from "work/wudata_08.trr": Read 23472
[06:42:59] Read 23472 bytes from arc file; available packet space=786395696
[06:42:59] trr file hash check passed.
[06:42:59] Allocated 560 bytes for edr file
[06:42:59] Read bedfile
[06:42:59] edr file hash check passed.
[06:42:59] Allocated 57129 bytes for logfile
[06:42:59] Read logfile
[06:42:59] GuardedRun: success in DynamicWrapper
[06:42:59] GuardedRun: done
[06:42:59] Run: GuardedRun completed.
[06:43:00] - Writing 92969 bytes of core data to disk...
[06:43:00] Done: 92457 -> 43382 (compressed to 46.9 percent)
[06:43:00]   ... Done.
[06:43:00] - Shutting down core 
[06:43:00] 
[06:43:00] Folding@home Core Shutdown: FINISHED_UNIT
[06:43:04] CoreStatus = 64 (100)
[06:43:04] Unit 8 finished with 97 percent of time to deadline remaining.
[06:43:04] Updated performance fraction: 0.965483
[06:43:04] Sending work to server
[06:43:04] Project: 5910 (Run 10, Clone 70, Gen 39)
[06:43:04] - Read packet limit of 540015616... Set to 524286976.


[06:43:04] + Attempting to send results [February 28 06:43:04 UTC]
[06:43:04] - Reading file work/wuresults_08.dat from core
[06:43:04]   (Read 43894 bytes from disk)
[06:43:04] Connecting to http://171.64.65.20:8080/
[06:43:06] Posted data.
[06:43:06] Initial: 0000; - Uploaded at ~21 kB/s
[06:43:06] - Averaged speed for that direction ~19 kB/s
[06:43:06] + Results successfully sent
[06:43:06] Thank you for your contribution to Folding@Home.
[06:43:06] + Number of Units Completed: 925

[06:43:10] Trying to send all finished work units
[06:43:10] + No unsent completed units remaining.
[06:43:10] - Preparing to get new work unit...
[06:43:10] + Attempting to get work packet
[06:43:10] - Will indicate memory of 2047 MB
[06:43:10] - Connecting to assignment server
[06:43:10] Connecting to http://assign-GPU.stanford.edu:8080/
[06:43:11] Posted data.
[06:43:11] Initial: 40AB; - Successful: assigned to (171.64.65.20).
[06:43:11] + News From Folding@Home: Welcome to Folding@Home
[06:43:11] Loaded queue successfully.
[06:43:11] Connecting to http://171.64.65.20:8080/
[06:43:12] Posted data.
[06:43:12] Initial: 0000; - Receiving payload (expected size: 69596)
[06:43:13] - Downloaded at ~67 kB/s
[06:43:13] - Averaged speed for that direction ~62 kB/s
[06:43:13] + Received work.
[06:43:13] Trying to send all finished work units
[06:43:13] + No unsent completed units remaining.
[06:43:13] + Closed connections
[06:43:13] 
[06:43:13] + Processing work unit
[06:43:13] Core required: FahCore_14.exe
[06:43:13] Core found.
[06:43:13] Working on queue slot 09 [February 28 06:43:13 UTC]
[06:43:13] + Working ...
[06:43:13] - Calling '.\FahCore_14.exe -dir work/ -suffix 09 -priority 96 -checkpoint 15 -verbose -lifeline 2408 -version 623'

[06:43:13] 
[06:43:13] *------------------------------*
[06:43:13] Folding@Home GPU Core - Beta
[06:43:13] Version 1.26 (Wed Oct 14 13:09:26 PDT 2009)
[06:43:13] 
[06:43:13] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[06:43:13] Build host: vspm46
[06:43:13] Board Type: Nvidia
[06:43:13] Core      : 
[06:43:13] Preparing to commence simulation
[06:43:13] - Looking at optimizations...
[06:43:13] - Created dyn
[06:43:13] - Files status OK
[06:43:13] - Expanded 69084 -> 357580 (decompressed 517.6 percent)
[06:43:13] Called DecompressByteArray: compressed_data_size=69084 data_size=357580, decompressed_data_size=357580 diff=0
[06:43:13] - Digital signature verified
[06:43:13] 
[06:43:13] Project: 5911 (Run 3, Clone 35, Gen 4)
[06:43:13] 
[06:43:13] Assembly optimizations on if available.
[06:43:13] Entering M.D.
[06:43:19] Tpr hash work/wudata_09.tpr:  2601140089 4139656169 2513526555 1158789911 1732238406
[06:43:20] Working on Protein
[06:43:20] mdrun_gpu returned 
[06:43:20] Self-test failure
[06:43:20] 
[06:43:20] Folding@home Core Shutdown: UNSTABLE_MACHINE
[06:43:23] CoreStatus = 7A (122)
[06:43:23] Sending work to server
[06:43:23] Project: 5911 (Run 3, Clone 35, Gen 4)
[06:43:23] - Read packet limit of 540015616... Set to 524286976.
[06:43:23] - Error: Could not get length of results file work/wuresults_09.dat
[06:43:23] - Error: Could not read unit 09 file. Removing from queue.
[06:43:23] Trying to send all finished work units
[06:43:23] + No unsent completed units remaining.
[06:43:23] - Preparing to get new work unit...
[06:43:23] + Attempting to get work packet
[06:43:23] - Will indicate memory of 2047 MB
[06:43:23] - Connecting to assignment server
[06:43:23] Connecting to http://assign-GPU.stanford.edu:8080/
[06:43:24] Posted data.
[06:43:24] Initial: 40AB; - Successful: assigned to (171.64.65.20).
[06:43:24] + News From Folding@Home: Welcome to Folding@Home
[06:43:24] Loaded queue successfully.
[06:43:24] Connecting to http://171.64.65.20:8080/
[06:43:25] Posted data.
[06:43:25] Initial: 0000; - Receiving payload (expected size: 69596)
[06:43:26] - Downloaded at ~67 kB/s
[06:43:26] - Averaged speed for that direction ~63 kB/s
[06:43:26] + Received work.
[06:43:26] Trying to send all finished work units
[06:43:26] + No unsent completed units remaining.
[06:43:26] + Closed connections
[06:43:31] 
[06:43:31] + Processing work unit
[06:43:31] Core required: FahCore_14.exe
[06:43:31] Core found.
[06:43:31] Working on queue slot 00 [February 28 06:43:31 UTC]
[06:43:31] + Working ...
[06:43:31] - Calling '.\FahCore_14.exe -dir work/ -suffix 00 -priority 96 -checkpoint 15 -verbose -lifeline 2408 -version 623'

[06:43:31] 
[06:43:31] *------------------------------*
[06:43:31] Folding@Home GPU Core - Beta
[06:43:31] Version 1.26 (Wed Oct 14 13:09:26 PDT 2009)
[06:43:31] 
[06:43:31] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[06:43:31] Build host: vspm46
[06:43:31] Board Type: Nvidia
[06:43:31] Core      : 
[06:43:31] Preparing to commence simulation
[06:43:31] - Looking at optimizations...
[06:43:31] - Created dyn
[06:43:31] - Files status OK
[06:43:31] - Expanded 69084 -> 357580 (decompressed 517.6 percent)
[06:43:31] Called DecompressByteArray: compressed_data_size=69084 data_size=357580, decompressed_data_size=357580 diff=0
[06:43:31] - Digital signature verified
[06:43:31] 
[06:43:31] Project: 5911 (Run 3, Clone 35, Gen 4)
[06:43:31] 
[06:43:31] Assembly optimizations on if available.
[06:43:31] Entering M.D.
[06:43:37] Tpr hash work/wudata_00.tpr:  2601140089 4139656169 2513526555 1158789911 1732238406
[06:43:38] Working on Protein
[06:43:38] mdrun_gpu returned 
[06:43:38] Self-test failure
[06:43:38] 
[06:43:38] Folding@home Core Shutdown: UNSTABLE_MACHINE
[06:43:41] CoreStatus = 7A (122)
[06:43:41] Sending work to server
[06:43:41] Project: 5911 (Run 3, Clone 35, Gen 4)
[06:43:41] - Read packet limit of 540015616... Set to 524286976.
[06:43:41] - Error: Could not get length of results file work/wuresults_00.dat
[06:43:41] - Error: Could not read unit 00 file. Removing from queue.
[06:43:41] Trying to send all finished work units
[06:43:41] + No unsent completed units remaining.
[06:43:41] - Preparing to get new work unit...
[06:43:41] + Attempting to get work packet
[06:43:41] - Will indicate memory of 2047 MB
[06:43:41] - Connecting to assignment server
[06:43:41] Connecting to http://assign-GPU.stanford.edu:8080/
[06:43:42] Posted data.
[06:43:42] Initial: 40AB; - Successful: assigned to (171.64.65.20).
[06:43:42] + News From Folding@Home: Welcome to Folding@Home
[06:43:42] Loaded queue successfully.
[06:43:42] Connecting to http://171.64.65.20:8080/
[06:43:43] Posted data.
[06:43:43] Initial: 0000; - Receiving payload (expected size: 69596)
[06:43:44] - Downloaded at ~67 kB/s
[06:43:44] - Averaged speed for that direction ~64 kB/s
[06:43:44] + Received work.
[06:43:44] Trying to send all finished work units
[06:43:44] + No unsent completed units remaining.
[06:43:44] + Closed connections
[06:43:49] 
[06:43:49] + Processing work unit
[06:43:49] Core required: FahCore_14.exe
[06:43:49] Core found.
[06:43:49] Working on queue slot 01 [February 28 06:43:49 UTC]
[06:43:49] + Working ...
[06:43:49] - Calling '.\FahCore_14.exe -dir work/ -suffix 01 -priority 96 -checkpoint 15 -verbose -lifeline 2408 -version 623'

[06:43:49] 
[06:43:49] *------------------------------*
[06:43:49] Folding@Home GPU Core - Beta
[06:43:49] Version 1.26 (Wed Oct 14 13:09:26 PDT 2009)
[06:43:49] 
[06:43:49] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[06:43:49] Build host: vspm46
[06:43:49] Board Type: Nvidia
[06:43:49] Core      : 
[06:43:49] Preparing to commence simulation
[06:43:49] - Looking at optimizations...
[06:43:49] - Created dyn
[06:43:49] - Files status OK
[06:43:49] - Expanded 69084 -> 357580 (decompressed 517.6 percent)
[06:43:49] Called DecompressByteArray: compressed_data_size=69084 data_size=357580, decompressed_data_size=357580 diff=0
[06:43:49] - Digital signature verified
[06:43:49] 
[06:43:49] Project: 5911 (Run 3, Clone 35, Gen 4)
[06:43:49] 
[06:43:49] Assembly optimizations on if available.
[06:43:49] Entering M.D.
[06:43:55] Tpr hash work/wudata_01.tpr:  2601140089 4139656169 2513526555 1158789911 1732238406
[06:43:56] Working on Protein
[06:43:56] mdrun_gpu returned 
[06:43:56] Self-test failure
[06:43:56] 
[06:43:56] Folding@home Core Shutdown: UNSTABLE_MACHINE
[06:44:00] CoreStatus = 7A (122)
[06:44:00] Sending work to server
[06:44:00] Project: 5911 (Run 3, Clone 35, Gen 4)
[06:44:00] - Read packet limit of 540015616... Set to 524286976.
[06:44:00] - Error: Could not get length of results file work/wuresults_01.dat
[06:44:00] - Error: Could not read unit 01 file. Removing from queue.
[06:44:00] Trying to send all finished work units
[06:44:00] + No unsent completed units remaining.
[06:44:00] - Preparing to get new work unit...
[06:44:00] + Attempting to get work packet
[06:44:00] - Will indicate memory of 2047 MB
[06:44:00] - Connecting to assignment server
[06:44:00] Connecting to http://assign-GPU.stanford.edu:8080/
[06:44:00] Posted data.
[06:44:00] Initial: 40AB; - Successful: assigned to (171.64.65.20).
[06:44:00] + News From Folding@Home: Welcome to Folding@Home
[06:44:01] Loaded queue successfully.
[06:44:01] Connecting to http://171.64.65.20:8080/
[06:44:01] Posted data.
[06:44:01] Initial: 0000; - Receiving payload (expected size: 69596)
[06:44:02] - Downloaded at ~67 kB/s
[06:44:02] - Averaged speed for that direction ~65 kB/s
[06:44:02] + Received work.
[06:44:02] Trying to send all finished work units
[06:44:02] + No unsent completed units remaining.
[06:44:02] + Closed connections
[06:44:07] 
[06:44:07] + Processing work unit
[06:44:07] Core required: FahCore_14.exe
[06:44:07] Core found.
[06:44:07] Working on queue slot 02 [February 28 06:44:07 UTC]
[06:44:07] + Working ...
[06:44:07] - Calling '.\FahCore_14.exe -dir work/ -suffix 02 -priority 96 -checkpoint 15 -verbose -lifeline 2408 -version 623'

[06:44:08] 
[06:44:08] *------------------------------*
[06:44:08] Folding@Home GPU Core - Beta
[06:44:08] Version 1.26 (Wed Oct 14 13:09:26 PDT 2009)
[06:44:08] 
[06:44:08] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[06:44:08] Build host: vspm46
[06:44:08] Board Type: Nvidia
[06:44:08] Core      : 
[06:44:08] Preparing to commence simulation
[06:44:08] - Looking at optimizations...
[06:44:08] - Created dyn
[06:44:08] - Files status OK
[06:44:08] - Expanded 69084 -> 357580 (decompressed 517.6 percent)
[06:44:08] Called DecompressByteArray: compressed_data_size=69084 data_size=357580, decompressed_data_size=357580 diff=0
[06:44:08] - Digital signature verified
[06:44:08] 
[06:44:08] Project: 5911 (Run 3, Clone 35, Gen 4)
[06:44:08] 
[06:44:08] Assembly optimizations on if available.
[06:44:08] Entering M.D.
[06:44:14] Tpr hash work/wudata_02.tpr:  2601140089 4139656169 2513526555 1158789911 1732238406
[06:44:14] Working on Protein
[06:44:15] mdrun_gpu returned 
[06:44:15] Self-test failure
[06:44:15] 
[06:44:15] Folding@home Core Shutdown: UNSTABLE_MACHINE
[06:44:18] CoreStatus = 7A (122)
[06:44:18] Sending work to server
[06:44:18] Project: 5911 (Run 3, Clone 35, Gen 4)
[06:44:18] - Read packet limit of 540015616... Set to 524286976.
[06:44:18] - Error: Could not get length of results file work/wuresults_02.dat
[06:44:18] - Error: Could not read unit 02 file. Removing from queue.
[06:44:18] Trying to send all finished work units
[06:44:18] + No unsent completed units remaining.
[06:44:18] - Preparing to get new work unit...
[06:44:18] + Attempting to get work packet
[06:44:18] - Will indicate memory of 2047 MB
[06:44:18] - Connecting to assignment server
[06:44:18] Connecting to http://assign-GPU.stanford.edu:8080/
[06:44:19] Posted data.
[06:44:19] Initial: 40AB; - Successful: assigned to (171.64.65.20).
[06:44:19] + News From Folding@Home: Welcome to Folding@Home
[06:44:19] Loaded queue successfully.
[06:44:19] Connecting to http://171.64.65.20:8080/
[06:44:19] Posted data.
[06:44:19] Initial: 0000; - Receiving payload (expected size: 69596)
[06:44:21] - Downloaded at ~33 kB/s
[06:44:21] - Averaged speed for that direction ~59 kB/s
[06:44:21] + Received work.
[06:44:21] Trying to send all finished work units
[06:44:21] + No unsent completed units remaining.
[06:44:21] + Closed connections
[06:44:26] 
[06:44:26] + Processing work unit
[06:44:26] Core required: FahCore_14.exe
[06:44:26] Core found.
[06:44:26] Working on queue slot 03 [February 28 06:44:26 UTC]
[06:44:26] + Working ...
[06:44:26] - Calling '.\FahCore_14.exe -dir work/ -suffix 03 -priority 96 -checkpoint 15 -verbose -lifeline 2408 -version 623'

[06:44:26] 
[06:44:26] *------------------------------*
[06:44:26] Folding@Home GPU Core - Beta
[06:44:26] Version 1.26 (Wed Oct 14 13:09:26 PDT 2009)
[06:44:26] 
[06:44:26] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[06:44:26] Build host: vspm46
[06:44:26] Board Type: Nvidia
[06:44:26] Core      : 
[06:44:26] Preparing to commence simulation
[06:44:26] - Looking at optimizations...
[06:44:26] - Created dyn
[06:44:26] - Files status OK
[06:44:26] - Expanded 69084 -> 357580 (decompressed 517.6 percent)
[06:44:26] Called DecompressByteArray: compressed_data_size=69084 data_size=357580, decompressed_data_size=357580 diff=0
[06:44:26] - Digital signature verified
[06:44:26] 
[06:44:26] Project: 5911 (Run 3, Clone 35, Gen 4)
[06:44:26] 
[06:44:26] Assembly optimizations on if available.
[06:44:26] Entering M.D.
[06:44:32] Tpr hash work/wudata_03.tpr:  2601140089 4139656169 2513526555 1158789911 1732238406
[06:44:32] Working on Protein
[06:44:33] mdrun_gpu returned 
[06:44:33] Self-test failure
[06:44:33] 
[06:44:33] Folding@home Core Shutdown: UNSTABLE_MACHINE
[06:44:36] CoreStatus = 7A (122)
[06:44:36] Sending work to server
[06:44:36] Project: 5911 (Run 3, Clone 35, Gen 4)
[06:44:36] - Read packet limit of 540015616... Set to 524286976.
[06:44:36] - Error: Could not get length of results file work/wuresults_03.dat
[06:44:36] - Error: Could not read unit 03 file. Removing from queue.
[06:44:36] EUE limit exceeded. Pausing 24 hours.
[09:11:18] ***** Got a SIGTERM signal (2)
[09:11:18] Killing all core threads

Folding@Home Client Shutdown.

Re: Project: 5911 (Run 3, Clone 35, Gen 4) -self test failure

Posted: Sun Feb 28, 2010 12:56 pm
by toTOW
And there are 4 reports of immediate failure in the DB, so I've marked the WU as bad.

Re: Project: 5911 (Run 3, Clone 35, Gen 4) -self test failure

Posted: Sun Feb 28, 2010 10:40 pm
by anko1
bruce wrote:Have you run MemtestG80?

I believe a self-test failure means your GPU has a hardware problem (including the possibility of a driver problem).
Yes, I've done MemtestG80 with no problems. I also updated the drivers when I was having problems with the GPU taking one core while also doing SMP projects (updating fixed this problem).

Re: Project: 5911 (Run 3, Clone 35, Gen 4) -self test failure

Posted: Sun Feb 28, 2010 10:42 pm
by anko1
toTOW wrote:And there are 4 reports of immediate failure in the DB, so I've marked the WU as bad.

Thanks, toTOW.