Project: 5765 (Run 0, Clone 153, Gen 249)

Moderators: Site Moderators, FAHC Science Team

Post Reply
HendricksSA
Posts: 339
Joined: Fri Jun 26, 2009 4:34 am

Project: 5765 (Run 0, Clone 153, Gen 249)

Post by HendricksSA »

It isn't my night for Project 5765. Same problem as my other post on Project: 5765 (Run 7, Clone 238, Gen 289). GPU NANs and the EUE limit again. Just posting FYI. I do have another question. If these failures are a result of my hardware (which is not overclocked and runs everything else) then I should follow your advice and check my Nvidia GPU/CPU/memory. Where can I find the various tools you refer to? One link for an Nvidia GPU checker led me to a repository of scientific software but it was only available to Stanford account holders.

Here are the relevant details. Thanks in advance.

Code: Select all

[06:07:32] + Attempting to get work packet
[06:07:32] - Connecting to assignment server
[06:07:32] - Successful: assigned to (171.67.108.11).
[06:07:32] + News From Folding@Home: Welcome to Folding@Home
[06:07:32] Loaded queue successfully.
[06:07:33] + Closed connections
[06:07:33] 
[06:07:33] + Processing work unit
[06:07:33] Core required: FahCore_11.exe
[06:07:33] Core found.
[06:07:33] Working on queue slot 04 [July 24 06:07:33 UTC]
[06:07:33] + Working ...
[06:07:34] 
[06:07:34] *------------------------------*
[06:07:34] Folding@Home GPU Core - Beta
[06:07:34] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[06:07:34] 
[06:07:34] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[06:07:34] Build host: amoeba
[06:07:34] Board Type: Nvidia
[06:07:34] Core      : 
[06:07:34] Preparing to commence simulation
[06:07:34] - Looking at optimizations...
[06:07:34] - Created dyn
[06:07:34] - Files status OK
[06:07:34] - Expanded 46750 -> 252912 (decompressed 540.9 percent)
[06:07:34] Called DecompressByteArray: compressed_data_size=46750 data_size=252912, decompressed_data_size=252912 diff=0
[06:07:34] - Digital signature verified
[06:07:34] 
[06:07:34] Project: 5765 (Run 0, Clone 153, Gen 249)
[06:07:34] 
[06:07:34] Assembly optimizations on if available.
[06:07:34] Entering M.D.
[06:07:40] Working on Protein
[06:07:41] Client config found, loading data.
[06:07:41] mdrun_gpu returned 
[06:07:41] NANs detected on GPU
[06:07:41] 
[06:07:41] Folding@home Core Shutdown: UNSTABLE_MACHINE
[06:07:44] CoreStatus = 7A (122)
[06:07:44] Sending work to server
[06:07:44] Project: 5765 (Run 0, Clone 153, Gen 249)
[06:07:44] - Read packet limit of 540015616... Set to 524286976.
[06:07:44] - Error: Could not get length of results file work/wuresults_04.dat
[06:07:44] - Error: Could not read unit 04 file. Removing from queue.
[06:07:44] - Preparing to get new work unit...
[06:07:44] + Attempting to get work packet
[06:07:44] - Connecting to assignment server
[06:07:45] - Successful: assigned to (171.67.108.11).
[06:07:45] + News From Folding@Home: Welcome to Folding@Home
[06:07:45] Loaded queue successfully.
[06:07:46] + Closed connections
[06:07:51] 
[06:07:51] + Processing work unit
[06:07:51] Core required: FahCore_11.exe
[06:07:51] Core found.
[06:07:51] Working on queue slot 05 [July 24 06:07:51 UTC]
[06:07:51] + Working ...
[06:07:51] 
[06:07:51] *------------------------------*
[06:07:51] Folding@Home GPU Core - Beta
[06:07:51] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[06:07:51] 
[06:07:51] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[06:07:51] Build host: amoeba
[06:07:51] Board Type: Nvidia
[06:07:51] Core      : 
[06:07:51] Preparing to commence simulation
[06:07:51] - Looking at optimizations...
[06:07:51] - Created dyn
[06:07:51] - Files status OK
[06:07:51] - Expanded 46750 -> 252912 (decompressed 540.9 percent)
[06:07:51] Called DecompressByteArray: compressed_data_size=46750 data_size=252912, decompressed_data_size=252912 diff=0
[06:07:51] - Digital signature verified
[06:07:51] 
[06:07:51] Project: 5765 (Run 0, Clone 153, Gen 249)
[06:07:51] 
[06:07:51] Assembly optimizations on if available.
[06:07:51] Entering M.D.
[06:07:57] Working on Protein
[06:07:58] Client config found, loading data.
[06:07:58] mdrun_gpu returned 
[06:07:58] NANs detected on GPU
[06:07:58] 
[06:07:58] Folding@home Core Shutdown: UNSTABLE_MACHINE
[06:08:01] CoreStatus = 7A (122)
[06:08:01] Sending work to server
[06:08:01] Project: 5765 (Run 0, Clone 153, Gen 249)
[06:08:01] - Read packet limit of 540015616... Set to 524286976.
[06:08:01] - Error: Could not get length of results file work/wuresults_05.dat
[06:08:01] - Error: Could not read unit 05 file. Removing from queue.
[06:08:01] - Preparing to get new work unit...
[06:08:01] + Attempting to get work packet
[06:08:01] - Connecting to assignment server
[06:08:01] - Successful: assigned to (171.67.108.11).
[06:08:01] + News From Folding@Home: Welcome to Folding@Home
[06:08:01] Loaded queue successfully.
[06:08:02] + Closed connections
[06:08:07] 
[06:08:07] + Processing work unit
[06:08:07] Core required: FahCore_11.exe
[06:08:07] Core found.
[06:08:07] Working on queue slot 06 [July 24 06:08:07 UTC]
[06:08:07] + Working ...
[06:08:07] 
[06:08:07] *------------------------------*
[06:08:07] Folding@Home GPU Core - Beta
[06:08:07] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[06:08:07] 
[06:08:07] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[06:08:07] Build host: amoeba
[06:08:07] Board Type: Nvidia
[06:08:07] Core      : 
[06:08:07] Preparing to commence simulation
[06:08:07] - Looking at optimizations...
[06:08:07] - Created dyn
[06:08:07] - Files status OK
[06:08:07] - Expanded 46750 -> 252912 (decompressed 540.9 percent)
[06:08:07] Called DecompressByteArray: compressed_data_size=46750 data_size=252912, decompressed_data_size=252912 diff=0
[06:08:07] - Digital signature verified
[06:08:08] 
[06:08:08] Project: 5765 (Run 0, Clone 153, Gen 249)
[06:08:08] 
[06:08:08] Assembly optimizations on if available.
[06:08:08] Entering M.D.
[06:08:14] Working on Protein
[06:08:15] Client config found, loading data.
[06:08:15] Starting GUI Server
[06:08:15] mdrun_gpu returned 
[06:08:15] NANs detected on GPU
[06:08:15] 
[06:08:15] Folding@home Core Shutdown: UNSTABLE_MACHINE
[06:08:20] CoreStatus = 7A (122)
[06:08:20] Sending work to server
[06:08:20] Project: 5765 (Run 0, Clone 153, Gen 249)
[06:08:20] - Read packet limit of 540015616... Set to 524286976.
[06:08:20] - Error: Could not get length of results file work/wuresults_06.dat
[06:08:20] - Error: Could not read unit 06 file. Removing from queue.
[06:08:20] - Preparing to get new work unit...
[06:08:20] + Attempting to get work packet
[06:08:20] - Connecting to assignment server
[06:08:20] - Successful: assigned to (171.67.108.11).
[06:08:20] + News From Folding@Home: Welcome to Folding@Home
[06:08:20] Loaded queue successfully.
[06:08:21] + Closed connections
[06:08:26] 
[06:08:26] + Processing work unit
[06:08:26] Core required: FahCore_11.exe
[06:08:26] Core found.
[06:08:26] Working on queue slot 07 [July 24 06:08:26 UTC]
[06:08:26] + Working ...
[06:08:26] 
[06:08:26] *------------------------------*
[06:08:26] Folding@Home GPU Core - Beta
[06:08:26] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[06:08:26] 
[06:08:26] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[06:08:26] Build host: amoeba
[06:08:26] Board Type: Nvidia
[06:08:26] Core      : 
[06:08:26] Preparing to commence simulation
[06:08:26] - Looking at optimizations...
[06:08:26] - Created dyn
[06:08:26] - Files status OK
[06:08:27] - Expanded 46750 -> 252912 (decompressed 540.9 percent)
[06:08:27] Called DecompressByteArray: compressed_data_size=46750 data_size=252912, decompressed_data_size=252912 diff=0
[06:08:27] - Digital signature verified
[06:08:27] 
[06:08:27] Project: 5765 (Run 0, Clone 153, Gen 249)
[06:08:27] 
[06:08:27] Assembly optimizations on if available.
[06:08:27] Entering M.D.
[06:08:33] Working on Protein
[06:08:34] Client config found, loading data.
[06:08:34] mdrun_gpu returned 
[06:08:34] NANs detected on GPU
[06:08:34] 
[06:08:34] Folding@home Core Shutdown: UNSTABLE_MACHINE
[06:08:37] CoreStatus = 7A (122)
[06:08:37] Sending work to server
[06:08:37] Project: 5765 (Run 0, Clone 153, Gen 249)
[06:08:37] - Read packet limit of 540015616... Set to 524286976.
[06:08:37] - Error: Could not get length of results file work/wuresults_07.dat
[06:08:37] - Error: Could not read unit 07 file. Removing from queue.
[06:08:37] - Preparing to get new work unit...
[06:08:37] + Attempting to get work packet
[06:08:37] - Connecting to assignment server
[06:08:37] - Successful: assigned to (171.67.108.11).
[06:08:37] + News From Folding@Home: Welcome to Folding@Home
[06:08:37] Loaded queue successfully.
[06:08:38] + Closed connections
[06:08:43] 
[06:08:43] + Processing work unit
[06:08:43] Core required: FahCore_11.exe
[06:08:43] Core found.
[06:08:43] Working on queue slot 08 [July 24 06:08:43 UTC]
[06:08:43] + Working ...
[06:08:44] 
[06:08:44] *------------------------------*
[06:08:44] Folding@Home GPU Core - Beta
[06:08:44] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[06:08:44] 
[06:08:44] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[06:08:44] Build host: amoeba
[06:08:44] Board Type: Nvidia
[06:08:44] Core      : 
[06:08:44] Preparing to commence simulation
[06:08:44] - Looking at optimizations...
[06:08:44] - Created dyn
[06:08:44] - Files status OK
[06:08:44] - Expanded 46750 -> 252912 (decompressed 540.9 percent)
[06:08:44] Called DecompressByteArray: compressed_data_size=46750 data_size=252912, decompressed_data_size=252912 diff=0
[06:08:44] - Digital signature verified
[06:08:44] 
[06:08:44] Project: 5765 (Run 0, Clone 153, Gen 249)
[06:08:44] 
[06:08:44] Assembly optimizations on if available.
[06:08:44] Entering M.D.
[06:08:50] Working on Protein
[06:08:51] Client config found, loading data.
[06:08:51] mdrun_gpu returned 
[06:08:51] NANs detected on GPU
[06:08:51] 
[06:08:51] Folding@home Core Shutdown: UNSTABLE_MACHINE
[06:08:55] CoreStatus = 7A (122)
[06:08:55] Sending work to server
[06:08:55] Project: 5765 (Run 0, Clone 153, Gen 249)
[06:08:55] - Read packet limit of 540015616... Set to 524286976.
[06:08:55] - Error: Could not get length of results file work/wuresults_08.dat
[06:08:55] - Error: Could not read unit 08 file. Removing from queue.
[06:08:55] EUE limit exceeded. Pausing 24 hours.
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 5765 (Run 0, Clone 153, Gen 249)

Post by bruce »

Initially memtestg80 was only on the site that required you to register but it's also on the stanford site, itself. if you'll point me to where you found that reference I'll move it to one of the unrestricted sites. I think the best way to find it is to start with the Stanford client download page. Near the bottom it mentions "utilities" which takes you to http://folding.stanford.edu/English/DownloadUtils
Post Reply