5768 (Run 13, Clone 101, Gen 818) EUE

Moderators: Site Moderators, FAHC Science Team

Post Reply
linuxpng
Posts: 4
Joined: Wed May 05, 2010 8:14 pm

5768 (Run 13, Clone 101, Gen 818) EUE

Post by linuxpng »

Seems like I've had this unit before and it always dies before starting. I'm running other units w/o problems.

Code: Select all

[19:48:56] + Attempting to send results [May 5 19:48:56 UTC]
[19:48:59] + Results successfully sent
[19:48:59] Thank you for your contribution to Folding@Home.
[19:48:59] + Number of Units Completed: 632

[19:49:03] - Preparing to get new work unit...
[19:49:03] + Attempting to get work packet
[19:49:03] - Connecting to assignment server
[19:49:03] - Successful: assigned to (171.67.108.11).
[19:49:03] + News From Folding@Home: Welcome to Folding@Home
[19:49:03] Loaded queue successfully.
[19:49:04] + Closed connections
[19:49:04] 
[19:49:04] + Processing work unit
[19:49:04] Core required: FahCore_11.exe
[19:49:04] Core found.
[19:49:04] Working on queue slot 06 [May 5 19:49:04 UTC]
[19:49:04] + Working ...
[19:49:04] 
[19:49:04] *------------------------------*
[19:49:04] Folding@Home GPU Core
[19:49:04] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[19:49:04] 
[19:49:04] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[19:49:04] Build host: amoeba
[19:49:04] Board Type: Nvidia
[19:49:04] Core      : 
[19:49:04] Preparing to commence simulation
[19:49:04] - Looking at optimizations...
[19:49:04] DeleteFrameFiles: successfully deleted file=work/wudata_06.ckp
[19:49:04] - Created dyn
[19:49:04] - Files status OK
[19:49:04] - Expanded 46662 -> 252912 (decompressed 542.0 percent)
[19:49:04] Called DecompressByteArray: compressed_data_size=46662 data_size=252912, decompressed_data_size=252912 diff=0
[19:49:04] - Digital signature verified
[19:49:04] 
[19:49:04] Project: 5768 (Run 13, Clone 101, Gen 818)
[19:49:04] 
[19:49:04] Assembly optimizations on if available.
[19:49:04] Entering M.D.
[19:49:10] Tpr hash work/wudata_06.tpr:  2856244317 2956492547 3884238936 3127460894 350911855
[19:49:10] 
[19:49:10] Calling fah_main args: 14 usage=100
[19:49:10] 
[19:49:11] Working on Protein
[19:49:12] mdrun_gpu returned 
[19:49:12] Self-test failure
[19:49:12] 
[19:49:12] Folding@home Core Shutdown: UNSTABLE_MACHINE
[19:49:14] CoreStatus = 7A (122)
[19:49:14] Sending work to server
[19:49:14] Project: 5768 (Run 13, Clone 101, Gen 818)
[19:49:14] - Read packet limit of 540015616... Set to 524286976.
[19:49:14] - Error: Could not get length of results file work/wuresults_06.dat
[19:49:14] - Error: Could not read unit 06 file. Removing from queue.
[19:49:14] - Preparing to get new work unit...
[19:49:14] + Attempting to get work packet
[19:49:14] - Connecting to assignment server
[19:49:15] - Successful: assigned to (171.67.108.11).
[19:49:15] + News From Folding@Home: Welcome to Folding@Home
[19:49:15] Loaded queue successfully.
[19:49:16] + Closed connections
[19:49:21] 
[19:49:21] + Processing work unit
[19:49:21] Core required: FahCore_11.exe
[19:49:21] Core found.
[19:49:21] Working on queue slot 07 [May 5 19:49:21 UTC]
[19:49:21] + Working ...
[19:49:21] 
[19:49:21] *------------------------------*
[19:49:21] Folding@Home GPU Core
[19:49:21] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[19:49:21] 
[19:49:21] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[19:49:21] Build host: amoeba
[19:49:21] Board Type: Nvidia
[19:49:21] Core      : 
[19:49:21] Preparing to commence simulation
[19:49:21] - Looking at optimizations...
[19:49:21] DeleteFrameFiles: successfully deleted file=work/wudata_07.ckp
[19:49:21] - Created dyn
[19:49:21] - Files status OK
[19:49:21] - Expanded 46662 -> 252912 (decompressed 542.0 percent)
[19:49:21] Called DecompressByteArray: compressed_data_size=46662 data_size=252912, decompressed_data_size=252912 diff=0
[19:49:21] - Digital signature verified
[19:49:21] 
[19:49:21] Project: 5768 (Run 13, Clone 101, Gen 818)
[19:49:21] 
[19:49:21] Assembly optimizations on if available.
[19:49:21] Entering M.D.
[19:49:27] Tpr hash work/wudata_07.tpr:  2856244317 2956492547 3884238936 3127460894 350911855
[19:49:27] 
[19:49:27] Calling fah_main args: 14 usage=100
[19:49:27] 
[19:49:28] Working on Protein
[19:49:29] mdrun_gpu returned 
[19:49:29] Self-test failure
[19:49:29] 
[19:49:29] Folding@home Core Shutdown: UNSTABLE_MACHINE
[19:49:31] CoreStatus = 7A (122)
[19:49:31] Sending work to server
[19:49:31] Project: 5768 (Run 13, Clone 101, Gen 818)
[19:49:31] - Read packet limit of 540015616... Set to 524286976.
[19:49:31] - Error: Could not get length of results file work/wuresults_07.dat
[19:49:31] - Error: Could not read unit 07 file. Removing from queue.
[19:49:31] - Preparing to get new work unit...
[19:49:31] + Attempting to get work packet
[19:49:31] - Connecting to assignment server
[19:49:32] - Successful: assigned to (171.67.108.11).
[19:49:32] + News From Folding@Home: Welcome to Folding@Home
[19:49:32] Loaded queue successfully.
[19:49:33] + Closed connections
[19:49:38] 
[19:49:38] + Processing work unit
[19:49:38] Core required: FahCore_11.exe
[19:49:38] Core found.
[19:49:38] Working on queue slot 08 [May 5 19:49:38 UTC]
[19:49:38] + Working ...
[19:49:38] 
[19:49:38] *------------------------------*
[19:49:38] Folding@Home GPU Core
[19:49:38] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[19:49:38] 
[19:49:38] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[19:49:38] Build host: amoeba
[19:49:38] Board Type: Nvidia
[19:49:38] Core      : 
[19:49:38] Preparing to commence simulation
[19:49:38] - Looking at optimizations...
[19:49:38] DeleteFrameFiles: successfully deleted file=work/wudata_08.ckp
[19:49:38] - Created dyn
[19:49:38] - Files status OK
[19:49:38] - Expanded 46662 -> 252912 (decompressed 542.0 percent)
[19:49:38] Called DecompressByteArray: compressed_data_size=46662 data_size=252912, decompressed_data_size=252912 diff=0
[19:49:38] - Digital signature verified
[19:49:38] 
[19:49:38] Project: 5768 (Run 13, Clone 101, Gen 818)
[19:49:38] 
[19:49:38] Assembly optimizations on if available.
[19:49:38] Entering M.D.
[19:49:44] Tpr hash work/wudata_08.tpr:  2856244317 2956492547 3884238936 3127460894 350911855
[19:49:44] 
[19:49:44] Calling fah_main args: 14 usage=100
[19:49:44] 
[19:49:45] Working on Protein
[19:49:46] mdrun_gpu returned 
[19:49:46] Self-test failure
[19:49:46] 
[19:49:46] Folding@home Core Shutdown: UNSTABLE_MACHINE
[19:49:48] CoreStatus = 7A (122)
[19:49:48] Sending work to server
[19:49:48] Project: 5768 (Run 13, Clone 101, Gen 818)
[19:49:48] - Read packet limit of 540015616... Set to 524286976.
[19:49:48] - Error: Could not get length of results file work/wuresults_08.dat
[19:49:48] - Error: Could not read unit 08 file. Removing from queue.
[19:49:48] - Preparing to get new work unit...
[19:49:48] + Attempting to get work packet
[19:49:48] - Connecting to assignment server
[19:49:49] - Successful: assigned to (171.67.108.11).
[19:49:49] + News From Folding@Home: Welcome to Folding@Home
[19:49:49] Loaded queue successfully.
[19:49:50] + Closed connections
[19:49:55] 
[19:49:55] + Processing work unit
[19:49:55] Core required: FahCore_11.exe
[19:49:55] Core found.
[19:49:55] Working on queue slot 09 [May 5 19:49:55 UTC]
[19:49:55] + Working ...
[19:49:55] 
[19:49:55] *------------------------------*
[19:49:55] Folding@Home GPU Core
[19:49:55] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[19:49:55] 
[19:49:55] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[19:49:55] Build host: amoeba
[19:49:55] Board Type: Nvidia
[19:49:55] Core      : 
[19:49:55] Preparing to commence simulation
[19:49:55] - Looking at optimizations...
[19:49:55] DeleteFrameFiles: successfully deleted file=work/wudata_09.ckp
[19:49:55] - Created dyn
[19:49:55] - Files status OK
[19:49:55] - Expanded 46662 -> 252912 (decompressed 542.0 percent)
[19:49:55] Called DecompressByteArray: compressed_data_size=46662 data_size=252912, decompressed_data_size=252912 diff=0
[19:49:55] - Digital signature verified
[19:49:55] 
[19:49:55] Project: 5768 (Run 13, Clone 101, Gen 818)
[19:49:55] 
[19:49:55] Assembly optimizations on if available.
[19:49:55] Entering M.D.
[19:50:01] Tpr hash work/wudata_09.tpr:  2856244317 2956492547 3884238936 3127460894 350911855
[19:50:01] 
[19:50:01] Calling fah_main args: 14 usage=100
[19:50:01] 
[19:50:02] Working on Protein
[19:50:02] mdrun_gpu returned 
[19:50:02] Self-test failure
[19:50:02] 
[19:50:02] Folding@home Core Shutdown: UNSTABLE_MACHINE
[19:50:05] CoreStatus = 7A (122)
[19:50:05] Sending work to server
[19:50:05] Project: 5768 (Run 13, Clone 101, Gen 818)
[19:50:05] - Read packet limit of 540015616... Set to 524286976.
[19:50:05] - Error: Could not get length of results file work/wuresults_09.dat
[19:50:05] - Error: Could not read unit 09 file. Removing from queue.
[19:50:05] - Preparing to get new work unit...
[19:50:05] + Attempting to get work packet
[19:50:05] - Connecting to assignment server
[19:50:06] - Successful: assigned to (171.67.108.11).
[19:50:06] + News From Folding@Home: Welcome to Folding@Home
[19:50:06] Loaded queue successfully.
[19:50:07] + Closed connections
[19:50:12] 
[19:50:12] + Processing work unit
[19:50:12] Core required: FahCore_11.exe
[19:50:12] Core found.
[19:50:12] Working on queue slot 00 [May 5 19:50:12 UTC]
[19:50:12] + Working ...
[19:50:12] 
[19:50:12] *------------------------------*
[19:50:12] Folding@Home GPU Core
[19:50:12] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[19:50:12] 
[19:50:12] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[19:50:12] Build host: amoeba
[19:50:12] Board Type: Nvidia
[19:50:12] Core      : 
[19:50:12] Preparing to commence simulation
[19:50:12] - Looking at optimizations...
[19:50:12] DeleteFrameFiles: successfully deleted file=work/wudata_00.ckp
[19:50:12] - Created dyn
[19:50:12] - Files status OK
[19:50:12] - Expanded 46662 -> 252912 (decompressed 542.0 percent)
[19:50:12] Called DecompressByteArray: compressed_data_size=46662 data_size=252912, decompressed_data_size=252912 diff=0
[19:50:12] - Digital signature verified
[19:50:12] 
[19:50:12] Project: 5768 (Run 13, Clone 101, Gen 818)
[19:50:12] 
[19:50:12] Assembly optimizations on if available.
[19:50:12] Entering M.D.
[19:50:18] Tpr hash work/wudata_00.tpr:  2856244317 2956492547 3884238936 3127460894 350911855
[19:50:18] 
[19:50:18] Calling fah_main args: 14 usage=100
[19:50:18] 
[19:50:19] Working on Protein
[19:50:19] mdrun_gpu returned 
[19:50:19] Self-test failure
[19:50:19] 
[19:50:19] Folding@home Core Shutdown: UNSTABLE_MACHINE
[19:50:22] CoreStatus = 7A (122)
[19:50:22] Sending work to server
[19:50:22] Project: 5768 (Run 13, Clone 101, Gen 818)
[19:50:22] - Read packet limit of 540015616... Set to 524286976.
[19:50:22] - Error: Could not get length of results file work/wuresults_00.dat
[19:50:22] - Error: Could not read unit 00 file. Removing from queue.
[19:50:22] EUE limit exceeded. Pausing 24 hours.

Folding@Home Client Shutdown.


--- Opening Log file [May 5 20:07:00 UTC] 


# Windows GPU Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.23

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\barcomb\AppData\Roaming\Folding@Home-gpu2
Arguments: -gpu 1 

[20:07:00] - Ask before connecting: No
[20:07:00] - User name: goobus (Team 169927)
[20:07:00] - User ID: 305680C342E5379C
[20:07:00] - Machine ID: 11
[20:07:00] 
[20:07:00] Loaded queue successfully.
[20:07:00] Initialization complete
[20:07:00] - Preparing to get new work unit...
[20:07:00] + Attempting to get work packet
[20:07:00] - Connecting to assignment server
[20:07:00] - Successful: assigned to (171.67.108.11).
[20:07:00] + News From Folding@Home: Welcome to Folding@Home
[20:07:01] Loaded queue successfully.
[20:07:01] + Closed connections
[20:07:01] 
[20:07:01] + Processing work unit
[20:07:01] Core required: FahCore_11.exe
[20:07:01] Core found.
[20:07:01] Working on queue slot 01 [May 5 20:07:01 UTC]
[20:07:01] + Working ...
[20:07:01] 
[20:07:01] *------------------------------*
[20:07:01] Folding@Home GPU Core
[20:07:01] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[20:07:01] 
[20:07:01] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[20:07:01] Build host: amoeba
[20:07:01] Board Type: Nvidia
[20:07:01] Core      : 
[20:07:01] Preparing to commence simulation
[20:07:01] - Looking at optimizations...
[20:07:01] DeleteFrameFiles: successfully deleted file=work/wudata_01.ckp
[20:07:01] - Created dyn
[20:07:01] - Files status OK
[20:07:01] - Expanded 46662 -> 252912 (decompressed 542.0 percent)
[20:07:01] Called DecompressByteArray: compressed_data_size=46662 data_size=252912, decompressed_data_size=252912 diff=0
[20:07:01] - Digital signature verified
[20:07:01] 
[20:07:01] Project: 5768 (Run 13, Clone 101, Gen 818)
[20:07:01] 
[20:07:01] Assembly optimizations on if available.
[20:07:01] Entering M.D.
[20:07:07] Tpr hash work/wudata_01.tpr:  2856244317 2956492547 3884238936 3127460894 350911855
[20:07:07] 
[20:07:07] Calling fah_main args: 14 usage=100
[20:07:07] 
[20:07:08] Working on Protein
[20:07:09] mdrun_gpu returned 
[20:07:09] Self-test failure
[20:07:09] 
[20:07:09] Folding@home Core Shutdown: UNSTABLE_MACHINE
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 5768 (Run 13, Clone 101, Gen 818) EUE

Post by bruce »

There's no record of you uploading that WU, but it has been completed successfully by others. (In fact, Gen 819 has already been completed.)

What setting do you have configured for Small/Normal/Big? The message "Read packet limit of 540015616... Set to 524286976." often seems to indicate that you need to configure FAH to accept bigger WUs, although it might mean that the Pande Group has incorrectly classified that project's size.

I don't understand how that can be true with a project that dies before it gets started, though.
linuxpng
Posts: 4
Joined: Wed May 05, 2010 8:14 pm

Re: 5768 (Run 13, Clone 101, Gen 818) EUE

Post by linuxpng »

I'm not using the console client but the windows systray one. What setting am I looking for, or is this an argument I would need to add manually?
toTOW
Site Moderator
Posts: 6435
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: 5768 (Run 13, Clone 101, Gen 818) EUE

Post by toTOW »

"Allow receipt of work assignment and return of results greater than 10 MB ins size" in the Connection tab.

edit : Self test failure can occur in two situations :

- on a bad WU ... so you'll see the error 6 times (with the client pausing for 24H after 5 attempts), and then it moves to a working WU.
- on a bad card ... in this case, this error will occur on various WUs (different PRCG).
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
RAH
Posts: 131
Joined: Sun Dec 02, 2007 6:29 am
Hardware configuration: 1. C2Q 8200@2880 / W7Pro64 / SMP2 / 2 GPU - GTS250/GTS450
2. C2D 6300@3600 / XPsp3 / SMP2 / 1 GPU - GT240
Location: Florida

Re: 5768 (Run 13, Clone 101, Gen 818) EUE

Post by RAH »

What I don't understand, if Gen 819 is already finished, and has to have 818 done to do so, why on earth is
the AS giving it out?
Image
linuxpng
Posts: 4
Joined: Wed May 05, 2010 8:14 pm

Re: 5768 (Run 13, Clone 101, Gen 818) EUE

Post by linuxpng »

toTOW wrote:"Allow receipt of work assignment and return of results greater than 10 MB ins size" in the Connection tab.

edit : Self test failure can occur in two situations :

- on a bad WU ... so you'll see the error 6 times (with the client pausing for 24H after 5 attempts), and then it moves to a working WU.
- on a bad card ... in this case, this error will occur on various WUs (different PRCG).
Yeah that option is checked. I get this A LOT with 57xx series units. I've run memtest g80 on both GPUs of my 295 and both pass with 0 errors.

**edit** Vista x64sp2 driver 191.07
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 5768 (Run 13, Clone 101, Gen 818) EUE

Post by bruce »

RAH wrote:What I don't understand, if Gen 819 is already finished, and has to have 818 done to do so, why on earth is
the AS giving it out?
What makes you think that's true? Look at the date/time associated with the reports. The log shows it was reassigned around [May 5 19:50:12 UTC]. It was completed by someone two hours later at 15:04:29 PDT or [22:04:29 UTC]. Gen 819 was completed two hours later at 17:03:15 [00:03:15 UTC] and my post was half an hour later at 17:38 PDT [00:38 UTC].
RAH
Posts: 131
Joined: Sun Dec 02, 2007 6:29 am
Hardware configuration: 1. C2Q 8200@2880 / W7Pro64 / SMP2 / 2 GPU - GTS250/GTS450
2. C2D 6300@3600 / XPsp3 / SMP2 / 1 GPU - GT240
Location: Florida

Re: 5768 (Run 13, Clone 101, Gen 818) EUE

Post by RAH »

Last assignment was at 22:07 UTC. After completion.
Image
codysluder
Posts: 1024
Joined: Sun Dec 02, 2007 12:43 pm

Re: 5768 (Run 13, Clone 101, Gen 818) EUE

Post by codysluder »

RAH wrote:Last assignment was at 22:07 UTC. After completion.
Really? It looks like 20:07 UTC to me:
linuxpng wrote:

Code: Select all

[20:07:00] - Connecting to assignment server
[20:07:00] - Successful: assigned to (171.67.108.11).
[20:07:00] + News From Folding@Home: Welcome to Folding@Home
[20:07:01] Loaded queue successfully.
[20:07:01] + Closed connections
[20:07:01] 
[20:07:01] + Processing work unit
[20:07:01] Core required: FahCore_11.exe
[20:07:01] Core found.
[20:07:01] Working on queue slot 01 [May 5 20:07:01 UTC]
[20:07:01] + Working ...
[20:07:01] 
[20:07:01] *------------------------------*
[20:07:01] Folding@Home GPU Core
[20:07:01] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[20:07:01] 
[20:07:01] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[20:07:01] Build host: amoeba
[20:07:01] Board Type: Nvidia
[20:07:01] Core      : 
[20:07:01] Preparing to commence simulation
[20:07:01] - Looking at optimizations...
[20:07:01] DeleteFrameFiles: successfully deleted file=work/wudata_01.ckp
[20:07:01] - Created dyn
[20:07:01] - Files status OK
[20:07:01] - Expanded 46662 -> 252912 (decompressed 542.0 percent)
[20:07:01] Called DecompressByteArray: compressed_data_size=46662 data_size=252912, decompressed_data_size=252912 diff=0
[20:07:01] - Digital signature verified
[20:07:01] 
[20:07:01] Project: 5768 (Run 13, Clone 101, Gen 818)
[20:07:01] 
[20:07:01] Assembly optimizations on if available.
[20:07:01] Entering M.D.
[20:07:07] Tpr hash work/wudata_01.tpr:  2856244317 2956492547 3884238936 3127460894 350911855
[20:07:07] 
[20:07:07] Calling fah_main args: 14 usage=100
[20:07:07] 
[20:07:08] Working on Protein
[20:07:09] mdrun_gpu returned 
[20:07:09] Self-test failure
[20:07:09] 
[20:07:09] Folding@home Core Shutdown: UNSTABLE_MACHINE
linuxpng
Posts: 4
Joined: Wed May 05, 2010 8:14 pm

Re: 5768 (Run 13, Clone 101, Gen 818) EUE

Post by linuxpng »

toTOW wrote:"Allow receipt of work assignment and return of results greater than 10 MB ins size" in the Connection tab.

edit : Self test failure can occur in two situations :

- on a bad WU ... so you'll see the error 6 times (with the client pausing for 24H after 5 attempts), and then it moves to a working WU.
- on a bad card ... in this case, this error will occur on various WUs (different PRCG).
Well I got sick of the failures and I just stopped folding for awhile. Having read in the EVGA forums about someone else having this problem (and the solution I used) I said "screw it" uninstalled the systray client and removed the respective directories within XXX\appdata\roaming and installed the command line client.

I've been folding nothing but 57xx units for over 24 hours now w/o issues.

I had already removed queue.dat, work folder, and fahcore_11.exe several times before and let them get redownloaded so I have no idea what actually was broken. I never had any memtestg80 failures either.
Post Reply