Project: 6501 (Run 4, Clone 1, Gen 88)

Moderators: Site Moderators, FAHC Science Team

Post Reply
Fireball0236
Posts: 58
Joined: Sat Oct 09, 2010 6:05 am

Project: 6501 (Run 4, Clone 1, Gen 88)

Post by Fireball0236 »

More ERROR 0x0's... 3 times in a row. Then it re-downloaded its core, and got a new (big) WU, which it's working on atm.

Is it just the Linux Client that it every time errors out with a 0x0, rather than one of: EUE, Unstable_Machine, NaNs detected or "cannot continue further"?


Header information:

Code: Select all

--- Opening Log file [October 25 04:54:16] 


# Linux Console Edition #######################################################
###############################################################################

                       Folding@Home Client Version 6.02

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/s0192755/Desktop/folding/stavelot
Executable: ./fah6
Arguments: -verbosity 9 

[04:54:16] - Ask before connecting: No
[04:54:16] - User name: Fireball0236 (Team 194596)
[04:54:16] - User ID: XXXX
[04:54:16] - Machine ID: 1
[04:54:16] 
[04:54:16] Loaded queue successfully.
[04:54:16] - Autosending finished units...
[04:54:16] 
[04:54:16] Trying to send all finished work units
[04:54:16] + Processing work unit
[04:54:16] + No unsent completed units remaining.
[04:54:16] Core required: FahCore_78.exe
[04:54:16] - Autosend completed
[04:54:16] Core found.
[04:54:16] Working on Unit 06 [October 25 04:54:16]
[04:54:16] + Working ...
Error log:

Code: Select all

[09:20:59] - Preparing to get new work unit...
[09:20:59] + Attempting to get work packet
[09:20:59] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 15, Stepping: 11
[09:20:59] - Connecting to assignment server
[09:20:59] Connecting to http://assign.stanford.edu:8080/
[09:21:00] Posted data.
[09:21:00] Initial: 40AB; - Successful: assigned to (171.64.65.62).
[09:21:00] + News From Folding@Home: Welcome to Folding@Home
[09:21:00] Loaded queue successfully.
[09:21:00] Connecting to http://171.64.65.62:8080/
[09:21:01] Posted data.
[09:21:01] Initial: 0000; - Receiving payload (expected size: 749467)
[09:21:03] - Downloaded at ~365 kB/s
[09:21:03] - Averaged speed for that direction ~210 kB/s
[09:21:03] + Received work.
[09:21:03] Trying to send all finished work units
[09:21:03] + No unsent completed units remaining.
[09:21:03] + Closed connections
[09:21:03] 
[09:21:03] + Processing work unit
[09:21:03] Core required: FahCore_78.exe
[09:21:03] Core found.
[09:21:03] Working on Unit 07 [October 25 09:21:03]
[09:21:03] + Working ...
[09:21:03] - Calling './FahCore_78.exe -dir work/ -suffix 07 -checkpoint 15 -verbose -lifeline 32215 -version 602'

[09:21:04] 
[09:21:04] *------------------------------*
[09:21:04] Folding@Home Gromacs Core
[09:21:04] Version 1.90 (March 8, 2006)
[09:21:04] 
[09:21:04] Preparing to commence simulation
[09:21:04] - Looking at optimizations...
[09:21:04] - Created dyn
[09:21:04] - Files status OK
[09:21:04] - Expanded 748955 -> 3748137 (decompressed 500.4 percent)
[09:21:04] - Starting from initial work packet
[09:21:04] 
[09:21:04] Project: 6501 (Run 4, Clone 1, Gen 88)
[09:21:04] 
[09:21:04] Assembly optimizations on if available.
[09:21:04] Entering M.D.
[09:21:11] Protein: UBIQUITIN MODEL1250 in water
[09:21:11] 
[09:21:11] Writing local files
[09:21:11] Extra SSE boost OK.
[09:21:11] Writing local files
[09:21:11] Completed 0 out of 250000 steps  (0%)
[09:27:45] Writing local files
[09:27:45] Completed 2500 out of 250000 steps  (1%)
[09:28:36] CoreStatus = 0 (0)
[09:28:36] Client-core communications error: ERROR 0x0
[09:28:36] Deleting current work unit & continuing...
[09:28:54] Trying to send all finished work units
[09:28:54] + No unsent completed units remaining.
[09:28:54] - Preparing to get new work unit...
[09:28:54] + Attempting to get work packet
[09:28:54] - Connecting to assignment server
[09:28:54] Connecting to http://assign.stanford.edu:8080/
[09:28:55] Posted data.
[09:28:55] Initial: 40AB; - Successful: assigned to (171.64.65.62).
[09:28:55] + News From Folding@Home: Welcome to Folding@Home
[09:28:55] Loaded queue successfully.
[09:28:55] Connecting to http://171.64.65.62:8080/
[09:28:56] Posted data.
[09:28:56] Initial: 0000; - Receiving payload (expected size: 749467)
[09:28:58] - Downloaded at ~365 kB/s
[09:28:58] - Averaged speed for that direction ~241 kB/s
[09:28:58] + Received work.
[09:28:59] + Closed connections
[09:29:04] 
[09:29:04] + Processing work unit
[09:29:04] Core required: FahCore_78.exe
[09:29:04] Core found.
[09:29:04] Working on Unit 08 [October 25 09:29:04]
[09:29:04] + Working ...
[09:29:04] - Calling './FahCore_78.exe -dir work/ -suffix 08 -checkpoint 15 -verbose -lifeline 32215 -version 602'

[09:29:04] 
[09:29:04] *------------------------------*
[09:29:04] Folding@Home Gromacs Core
[09:29:04] Version 1.90 (March 8, 2006)
[09:29:04] 
[09:29:04] Preparing to commence simulation
[09:29:04] - Looking at optimizations...
[09:29:04] - Created dyn
[09:29:04] - Files status OK
[09:29:04] - Expanded 748955 -> 3748137 (decompressed 500.4 percent)
[09:29:04] - Starting from initial work packet
[09:29:04] 
[09:29:04] Project: 6501 (Run 4, Clone 1, Gen 88)
[09:29:04] 
[09:29:04] Assembly optimizations on if available.
[09:29:04] Entering M.D.
[09:29:11] Protein: UBIQUITIN MODEL1250 in water
[09:29:11] 
[09:29:11] Writing local files
[09:29:11] Extra SSE boost OK.
[09:29:11] Writing local files
[09:29:11] Completed 0 out of 250000 steps  (0%)
[09:35:45] Writing local files
[09:35:46] Completed 2500 out of 250000 steps  (1%)
[09:36:36] CoreStatus = 0 (0)
[09:36:36] Client-core communications error: ERROR 0x0
[09:36:36] Deleting current work unit & continuing...
[09:36:55] Trying to send all finished work units
[09:36:55] + No unsent completed units remaining.
[09:36:55] - Preparing to get new work unit...
[09:36:55] + Attempting to get work packet
[09:36:55] - Connecting to assignment server
[09:36:55] Connecting to http://assign.stanford.edu:8080/
[09:36:56] Posted data.
[09:36:56] Initial: 40AB; - Successful: assigned to (171.64.65.62).
[09:36:56] + News From Folding@Home: Welcome to Folding@Home
[09:36:56] Loaded queue successfully.
[09:36:56] Connecting to http://171.64.65.62:8080/
[09:36:57] Posted data.
[09:36:57] Initial: 0000; - Receiving payload (expected size: 749467)
[09:36:59] - Downloaded at ~365 kB/s
[09:36:59] - Averaged speed for that direction ~266 kB/s
[09:36:59] + Received work.
[09:36:59] + Closed connections
[09:37:04] 
[09:37:04] + Processing work unit
[09:37:04] Core required: FahCore_78.exe
[09:37:04] Core found.
[09:37:04] Working on Unit 09 [October 25 09:37:04]
[09:37:04] + Working ...
[09:37:04] - Calling './FahCore_78.exe -dir work/ -suffix 09 -checkpoint 15 -verbose -lifeline 32215 -version 602'

[09:37:04] 
[09:37:04] *------------------------------*
[09:37:04] Folding@Home Gromacs Core
[09:37:04] Version 1.90 (March 8, 2006)
[09:37:04] 
[09:37:04] Preparing to commence simulation
[09:37:04] - Looking at optimizations...
[09:37:04] - Created dyn
[09:37:04] - Files status OK
[09:37:05] - Expanded 748955 -> 3748137 (decompressed 500.4 percent)
[09:37:05] - Starting from initial work packet
[09:37:05] 
[09:37:05] Project: 6501 (Run 4, Clone 1, Gen 88)
[09:37:05] 
[09:37:05] Assembly optimizations on if available.
[09:37:05] Entering M.D.
[09:37:11] Protein: UBIQUITIN MODEL1250 in water
[09:37:11] 
[09:37:11] Writing local files
[09:37:11] Extra SSE boost OK.
[09:37:11] Writing local files
[09:37:12] Completed 0 out of 250000 steps  (0%)
[09:43:46] Writing local files
[09:43:46] Completed 2500 out of 250000 steps  (1%)
[09:44:36] CoreStatus = 0 (0)
[09:44:36] Client-core communications error: ERROR 0x0
[09:44:36] - Attempting to download new core...
[09:44:36] + Downloading new core: FahCore_78.exe
[09:44:36] Downloading core (/~pande/Linux/x86/Core_78.fah from www.stanford.edu)
[09:44:38] Initial: AFDE; + 10240 bytes downloaded
[09:44:38] Initial: CC14; + 20480 bytes downloaded
[09:44:39] Initial: BEE8; + 30720 bytes downloaded
[09:44:39] Initial: DAAF; + 40960 bytes downloaded
[09:44:39] Initial: 1C7A; + 51200 bytes downloaded
[09:44:39] Initial: 5758; + 61440 bytes downloaded
[09:44:39] Initial: 4CCD; + 71680 bytes downloaded
[09:44:39] Initial: 15EF; + 81920 bytes downloaded
[09:44:39] Initial: 48E8; + 92160 bytes downloaded
[09:44:39] Initial: D320; + 102400 bytes downloaded
[09:44:39] Initial: 82DB; + 112640 bytes downloaded
[09:44:39] Initial: 4576; + 122880 bytes downloaded
[09:44:39] Initial: FB62; + 133120 bytes downloaded
[09:44:39] Initial: 71CD; + 143360 bytes downloaded
[09:44:39] Initial: F63A; + 153600 bytes downloaded
[09:44:39] Initial: 0B66; + 163840 bytes downloaded
[09:44:39] Initial: C516; + 174080 bytes downloaded
[09:44:39] Initial: 3E7D; + 184320 bytes downloaded
[09:44:39] Initial: D29C; + 194560 bytes downloaded
[09:44:39] Initial: E3AD; + 204800 bytes downloaded
[09:44:39] Initial: ACFA; + 215040 bytes downloaded
[09:44:39] Initial: 348C; + 225280 bytes downloaded
[09:44:39] Initial: F2B6; + 235520 bytes downloaded
[09:44:39] Initial: CC9E; + 245760 bytes downloaded
[09:44:39] Initial: 1231; + 256000 bytes downloaded
[09:44:39] Initial: 9693; + 266240 bytes downloaded
[09:44:39] Initial: 4073; + 276480 bytes downloaded
[09:44:39] Initial: 616B; + 286720 bytes downloaded
[09:44:39] Initial: 5E96; + 296960 bytes downloaded
[09:44:39] Initial: 4430; + 307200 bytes downloaded
[09:44:39] Initial: B959; + 317440 bytes downloaded
[09:44:39] Initial: 48AC; + 327680 bytes downloaded
[09:44:39] Initial: 7846; + 337920 bytes downloaded
[09:44:39] Initial: 0B78; + 348160 bytes downloaded
[09:44:39] Initial: 653D; + 358400 bytes downloaded
[09:44:39] Initial: B0D6; + 368640 bytes downloaded
[09:44:39] Initial: 841B; + 378880 bytes downloaded
[09:44:39] Initial: 75AF; + 389120 bytes downloaded
[09:44:39] Initial: B47C; + 399360 bytes downloaded
[09:44:39] Initial: 4DC0; + 409600 bytes downloaded
[09:44:39] Initial: 8F7E; + 419840 bytes downloaded
[09:44:39] Initial: 9EF4; + 430080 bytes downloaded
[09:44:39] Initial: 0181; + 440320 bytes downloaded
[09:44:39] Initial: 503C; + 450560 bytes downloaded
[09:44:40] Initial: 2D30; + 460800 bytes downloaded
[09:44:40] Initial: 8867; + 471040 bytes downloaded
[09:44:40] Initial: CE43; + 481280 bytes downloaded
[09:44:40] Initial: 614C; + 491520 bytes downloaded
[09:44:40] Initial: 96F2; + 501760 bytes downloaded
[09:44:40] Initial: 252D; + 512000 bytes downloaded
[09:44:40] Initial: 97FE; + 522240 bytes downloaded
[09:44:40] Initial: 1024; + 532480 bytes downloaded
[09:44:40] Initial: 0666; + 542720 bytes downloaded
[09:44:40] Initial: 53CF; + 552960 bytes downloaded
[09:44:40] Initial: D31E; + 563200 bytes downloaded
[09:44:40] Initial: 1A46; + 573440 bytes downloaded
[09:44:40] Initial: B2C1; + 583680 bytes downloaded
[09:44:40] Initial: 17AF; + 593920 bytes downloaded
[09:44:40] Initial: BE0D; + 604160 bytes downloaded
[09:44:40] Initial: 79C2; + 614400 bytes downloaded
[09:44:40] Initial: 6B14; + 624640 bytes downloaded
[09:44:40] Initial: 1611; + 634880 bytes downloaded
[09:44:40] Initial: 4B64; + 645120 bytes downloaded
[09:44:40] Initial: E520; + 655360 bytes downloaded
[09:44:40] Initial: ADD2; + 665600 bytes downloaded
[09:44:40] Initial: 4218; + 675840 bytes downloaded
[09:44:40] Initial: 7E58; + 686080 bytes downloaded
[09:44:40] Initial: 913F; + 696320 bytes downloaded
[09:44:40] Initial: A369; + 706560 bytes downloaded
[09:44:40] Initial: 8E3A; + 716800 bytes downloaded
[09:44:40] Initial: D3A6; + 727040 bytes downloaded
[09:44:40] Initial: D3CB; + 737280 bytes downloaded
[09:44:40] Initial: 6736; + 747520 bytes downloaded
[09:44:40] Initial: 071F; + 757760 bytes downloaded
[09:44:40] Initial: AC46; + 768000 bytes downloaded
[09:44:40] Initial: 1B7F; + 778240 bytes downloaded
[09:44:40] Initial: 1E88; + 788480 bytes downloaded
[09:44:40] Initial: 5A90; + 798720 bytes downloaded
[09:44:40] Initial: 5F2E; + 808960 bytes downloaded
[09:44:40] Initial: AC86; + 819200 bytes downloaded
[09:44:40] Initial: 0E27; + 829440 bytes downloaded
[09:44:40] Initial: 9AFA; + 839680 bytes downloaded
[09:44:40] Initial: 5A8B; + 849920 bytes downloaded
[09:44:40] Initial: 9D8E; + 860160 bytes downloaded
[09:44:40] Initial: 63B7; + 870400 bytes downloaded
[09:44:40] Initial: 7E7F; + 880640 bytes downloaded
[09:44:40] Initial: CC68; + 890880 bytes downloaded
[09:44:40] Initial: 0C12; + 901120 bytes downloaded
[09:44:40] Initial: EA6C; + 911360 bytes downloaded
[09:44:40] Initial: 07EE; + 921600 bytes downloaded
[09:44:40] Initial: 45B7; + 931840 bytes downloaded
[09:44:40] Initial: F8C7; + 942080 bytes downloaded
[09:44:40] Initial: DEE6; + 952320 bytes downloaded
[09:44:40] Initial: C4DF; + 962560 bytes downloaded
[09:44:40] Initial: 5CEC; + 972800 bytes downloaded
[09:44:40] Initial: C871; + 983040 bytes downloaded
[09:44:40] Initial: F427; + 993280 bytes downloaded
[09:44:40] Initial: F6DF; + 1003520 bytes downloaded
[09:44:40] Initial: 19B3; + 1013760 bytes downloaded
[09:44:40] Initial: 1DE1; + 1024000 bytes downloaded
[09:44:40] Initial: F17C; + 1034240 bytes downloaded
[09:44:40] Initial: A200; + 1044480 bytes downloaded
[09:44:40] Initial: 93DE; + 1054720 bytes downloaded
[09:44:40] Initial: 5E7D; + 1064960 bytes downloaded
[09:44:40] Initial: F350; + 1075200 bytes downloaded
[09:44:40] Initial: C54F; + 1085440 bytes downloaded
[09:44:40] Initial: 4D25; + 1095680 bytes downloaded
[09:44:40] Initial: 1289; + 1105920 bytes downloaded
[09:44:40] Initial: B74E; + 1116160 bytes downloaded
[09:44:40] Initial: EF43; + 1126400 bytes downloaded
[09:44:40] Initial: 6B45; + 1134407 bytes downloaded
[09:44:40] Verifying core Core_78.fah...
[09:44:40] Signature is VALID
[09:44:40] 
[09:44:40] Trying to unzip core FahCore_78.exe
[09:44:41] Decompressed FahCore_78.exe (3435296 bytes) successfully
[09:44:41] + Core successfully engaged
[09:44:41] Deleting current work unit & continuing...
[09:45:00] Trying to send all finished work units
[09:45:00] + No unsent completed units remaining.
[09:45:00] - Preparing to get new work unit...

~ Fireball0236
sortofageek
Site Admin
Posts: 3110
Joined: Fri Nov 30, 2007 8:06 pm
Location: Team Helix
Contact:

Re: Project: 6501 (Run 4, Clone 1, Gen 88)

Post by sortofageek »

Project: 6501 (Run 4, Clone 1, Gen 88) is a bad WU. I reported it, but didn't get a response telling me my report was received, so I'm not sure it will stop.
Fireball0236
Posts: 58
Joined: Sat Oct 09, 2010 6:05 am

Re: Project: 6501 (Run 4, Clone 1, Gen 88)

Post by Fireball0236 »

Seemingly not, I was assigned this WU again twice overnight.

Code: Select all

[01:05:44] - Preparing to get new work unit...
[01:05:44] + Attempting to get work packet
[01:05:44] - Connecting to assignment server
[01:05:44] Connecting to http://assign.stanford.edu:8080/
[01:05:46] Posted data.
[01:05:46] Initial: 40AB; - Successful: assigned to (171.64.65.62).
[01:05:46] + News From Folding@Home: Welcome to Folding@Home
[01:05:46] Loaded queue successfully.
[01:05:46] Connecting to http://171.64.65.62:8080/
[01:05:47] Posted data.
[01:05:47] Initial: 0000; - Receiving payload (expected size: 749467)
[01:05:49] - Downloaded at ~365 kB/s
[01:05:50] - Averaged speed for that direction ~280 kB/s
[01:05:50] + Received work.
[01:05:50] Trying to send all finished work units
[01:05:50] + No unsent completed units remaining.
[01:05:50] + Closed connections
[01:05:50] 
[01:05:50] + Processing work unit
[01:05:50] Core required: FahCore_78.exe
[01:05:50] Core found.
[01:05:50] Working on Unit 01 [October 26 01:05:50]
[01:05:50] + Working ...
[01:05:50] - Calling './FahCore_78.exe -dir work/ -suffix 01 -checkpoint 15 -verbose -lifeline 32215 -version 602'

[01:05:50] 
[01:05:50] *------------------------------*
[01:05:50] Folding@Home Gromacs Core
[01:05:50] Version 1.90 (March 8, 2006)
[01:05:50] 
[01:05:50] Preparing to commence simulation
[01:05:50] - Looking at optimizations...
[01:05:50] - Created dyn
[01:05:50] - Files status OK
[01:05:50] - Expanded 748955 -> 3748137 (decompressed 500.4 percent)
[01:05:50] - Starting from initial work packet
[01:05:50] 
[01:05:50] Project: 6501 (Run 4, Clone 1, Gen 88)
[01:05:50] 
[01:05:50] Assembly optimizations on if available.
[01:05:50] Entering M.D.
[01:05:57] Protein: UBIQUITIN MODEL1250 in water
[01:05:57] 
[01:05:57] Writing local files
[01:05:57] Extra SSE boost OK.
[01:05:57] Writing local files
[01:05:57] Completed 0 out of 250000 steps  (0%)
[01:12:31] Writing local files
[01:12:31] Completed 2500 out of 250000 steps  (1%)
[01:13:22] CoreStatus = 0 (0)
[01:13:22] Client-core communications error: ERROR 0x0
[01:13:22] Deleting current work unit & continuing...
[01:13:41] Trying to send all finished work units
[01:13:41] + No unsent completed units remaining.
[01:13:41] - Preparing to get new work unit...
[01:13:41] + Attempting to get work packet
[01:13:41] - Connecting to assignment server
[01:13:41] Connecting to http://assign.stanford.edu:8080/
[01:13:41] Posted data.
[01:13:41] Initial: 40AB; - Successful: assigned to (171.64.65.62).
[01:13:41] + News From Folding@Home: Welcome to Folding@Home
[01:13:42] Loaded queue successfully.
[01:13:42] Connecting to http://171.64.65.62:8080/
[01:13:43] Posted data.
[01:13:43] Initial: 0000; - Receiving payload (expected size: 749467)
[01:13:45] - Downloaded at ~365 kB/s
[01:13:45] - Averaged speed for that direction ~297 kB/s
[01:13:45] + Received work.
[01:13:45] + Closed connections
[01:13:50] 
[01:13:50] + Processing work unit
[01:13:50] Core required: FahCore_78.exe
[01:13:50] Core found.
[01:13:50] Working on Unit 02 [October 26 01:13:50]
[01:13:50] + Working ...
[01:13:50] - Calling './FahCore_78.exe -dir work/ -suffix 02 -checkpoint 15 -verbose -lifeline 32215 -version 602'

[01:13:50] 
[01:13:50] *------------------------------*
[01:13:50] Folding@Home Gromacs Core
[01:13:50] Version 1.90 (March 8, 2006)
[01:13:50] 
[01:13:50] Preparing to commence simulation
[01:13:50] - Looking at optimizations...
[01:13:50] - Created dyn
[01:13:50] - Files status OK
[01:13:50] - Expanded 748955 -> 3748137 (decompressed 500.4 percent)
[01:13:50] - Starting from initial work packet
[01:13:50] 
[01:13:50] Project: 6501 (Run 4, Clone 1, Gen 88)
[01:13:50] 
[01:13:50] Assembly optimizations on if available.
[01:13:50] Entering M.D.
[01:13:57] Protein: UBIQUITIN MODEL1250 in water
[01:13:57] 
[01:13:57] Writing local files
[01:13:57] Extra SSE boost OK.
[01:13:57] Writing local files
[01:13:57] Completed 0 out of 250000 steps  (0%)
[01:20:31] Writing local files
[01:20:31] Completed 2500 out of 250000 steps  (1%)
[01:21:22] CoreStatus = 0 (0)
[01:21:22] Client-core communications error: ERROR 0x0
[01:21:22] Deleting current work unit & continuing...
[01:21:41] Trying to send all finished work units
[01:21:41] + No unsent completed units remaining.
[01:21:41] - Preparing to get new work unit...

~ Fireball0236
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 6501 (Run 4, Clone 1, Gen 88)

Post by bruce »

I've reported Project: 6501 (Run 4, Clone 1, Gen 88) as a bad WU.
Post Reply