Project: 3798 (Run 30, Clone 1, Gen 50) - Lost WU?

Moderators: Site Moderators, FAHC Science Team

Post Reply
anko1
Posts: 438
Joined: Mon Dec 03, 2007 1:31 am
Hardware configuration: Old Faithful CPU: Windows Graphical 5.03; Intel Pentium 4 Processor 540
(3.2GHz) HT;Windows XP
Big Red: Windows SMP Console 6.29; Windows GPU console 6.20r1; Intel Q9450 2.66G; ASUS P5Q 775 P45; [BFG 9800GTX+ old graphics card] NVidia GeForce 8800 GTX [as of 5/9/09]; Windows XP Pro SP3
Lenovo Think Pad: Windows 6.29 w/ SMP; Windows GPU Console 6.20r1 systray; Intel QX9300; NVIDIA Quadro FX-3700M; Windows XP Professional
Location: SF Peninsula

Project: 3798 (Run 30, Clone 1, Gen 50) - Lost WU?

Post by anko1 »

This WU completed, and didn't upload, but after the next WU, the program reported no unsent WUs. Unfortunately, I didn't see this until the WU got written over (those 3798s are fast!). Don't know if this is a problem with the new server code being tested, so maybe it should be posted elsewhere?

Code: Select all

[13:52:06] Unit 2 finished with 100 percent of time to deadline remaining.
[13:52:06] Updated performance fraction: 0.998213
[13:52:06] Sending work to server


[13:52:06] + Attempting to send results
[13:52:06] - Reading file work/wuresults_02.dat from core
[13:52:06]   (Read 519090 bytes from disk)
[13:52:06] Connecting to http://171.64.122.139:8080/
[13:52:21] Posted data.
[13:52:21] Initial: 0000; - Uploaded at ~33 kB/s
[13:52:21] - Averaged speed for that direction ~60 kB/s
[13:52:21] + Results successfully sent
[13:52:21] Thank you for your contribution to Folding@Home.
[13:52:21] + Number of Units Completed: 517                                                                   {prior unit (Unit 2) = 517}

[13:52:25] Trying to send all finished work units
[13:52:25] + No unsent completed units remaining.
[13:52:25] - Preparing to get new work unit...
[13:52:25] + Attempting to get work packet
[13:52:25] - Will indicate memory of 2046 MB
[13:52:25] - Connecting to assignment server
[13:52:25] Connecting to http://assign.stanford.edu:8080/
[13:52:25] Posted data.
[13:52:25] Initial: 40AB; - Successful: assigned to (171.64.122.139).
[13:52:25] + News From Folding@Home: Welcome to Folding@Home
[13:52:25] Loaded queue successfully.
[13:52:25] Connecting to http://171.64.122.139:8080/
[13:52:26] Posted data.
[13:52:26] Initial: 0000; - Receiving payload (expected size: 238436)
[13:52:30] - Downloaded at ~58 kB/s
[13:52:30] - Averaged speed for that direction ~169 kB/s
[13:52:30] + Received work.
[13:52:30] Trying to send all finished work units
[13:52:30] + No unsent completed units remaining.
[13:52:30] + Closed connections
[13:52:30] 
[13:52:30] + Processing work unit
[13:52:30] Core required: FahCore_78.exe
[13:52:30] Core found.
[13:52:30] Working on Unit 03 [December 20 13:52:30]
[13:52:30] + Working ...
[13:52:30] - Calling 'FahCore_78.exe -dir work/ -suffix 03 -checkpoint 15 -verbose -lifeline 3988 -version 504'

[13:52:30] 
[13:52:30] *------------------------------*
[13:52:30] Folding@Home Gromacs Core
[13:52:30] Version 1.90 (March 8, 2006)
[13:52:30] 
[13:52:30] Preparing to commence simulation
[13:52:30] - Looking at optimizations...
[13:52:30] - Created dyn
[13:52:30] - Files status OK
[13:52:30] - Expanded 237924 -> 1167708 (decompressed 490.7 percent)
[13:52:30] - Starting from initial work packet
[13:52:30] 
[13:52:30] Project: 3798 (Run 30, Clone 1, Gen 50)
[13:52:30] 
[13:52:30] Assembly optimizations on if available.
[13:52:30] Entering M.D.
[13:52:36] Protein: p3798
[13:52:36] 
[13:52:36] Writing local files
[13:54:18] Extra SSE boost OK.
[13:54:18] Writing local files
[13:54:18] Completed 0 out of 1500 steps  (0)
[13:54:56] Writing local files
[13:54:56] Completed 500 out of 1500 steps  (33)
[13:55:33] Writing local files
[13:55:33] Completed 1000 out of 1500 steps  (67)
[13:56:09] Writing local files
[13:56:09] Completed 1500 out of 1500 steps  (100)
[13:56:09] Writing final coordinates.
[13:56:10] Past main M.D. loop
[13:57:09] 
[13:57:09] Finished Work Unit:
[13:57:09] - Reading up to 550656 from "work/wudata_03.arc": Read 550656
[13:57:10] - Reading up to 0 from "work/wudata_03.xtc": Read 0
[13:57:10] goefile size: 0
[13:57:10] Leaving Run
[13:57:11] - Writing 573212 bytes of core data to disk...
[13:57:11] Done: 572700 -> 518498 (compressed to 90.5 percent)
[13:57:11]   ... Done.
[13:57:11] - Shutting down core
[13:57:11] 
[13:57:11] Folding@home Core Shutdown: FINISHED_UNIT
[13:57:14] CoreStatus = 64 (100)
[13:57:14] Unit 3 finished with 100 percent of time to deadline remaining.
[13:57:14] Updated performance fraction: 0.998242
[13:57:14] Sending work to server


[13:57:14] + Attempting to send results
[13:57:14] - Reading file work/wuresults_03.dat from core
[13:57:14]   (Read 519010 bytes from disk)
[13:57:14] Connecting to http://171.64.122.139:8080/
[14:27:21] - Couldn't send HTTP request to server
[14:27:21] + Could not connect to Work Server (results)
[14:27:21]     (171.64.122.139:8080)
[14:27:21] - Error: Could not transmit unit 03 (completed December 20) to work server.
[14:27:21] - 1 failed uploads of this unit.
[14:27:21]   Keeping unit 03 in queue.
[14:27:21] Trying to send all finished work units


[14:27:21] + Attempting to send results
[14:27:21] - Reading file work/wuresults_03.dat from core
[14:27:21]   (Read 519010 bytes from disk)
[14:27:21] Connecting to http://171.64.122.139:8080/
[14:27:30] Posted data.
[14:27:30] Initial: 0000; - Uploaded at ~56 kB/s
[14:27:30] - Averaged speed for that direction ~59 kB/s
[14:27:30] - Server reports problem with unit.
[14:27:30] + Sent 0 of 1 completed units to the server                                                                        {Unit 3 not sent}
[14:27:30] - Preparing to get new work unit...
[14:27:30] + Attempting to get work packet
[14:27:30] - Will indicate memory of 2046 MB
[14:27:30] - Connecting to assignment server
[14:27:30] Connecting to http://assign.stanford.edu:8080/
[14:27:30] Posted data.
[14:27:30] Initial: 40AB; - Successful: assigned to (171.64.122.139).
[14:27:30] + News From Folding@Home: Welcome to Folding@Home
[14:27:31] Loaded queue successfully.
[14:27:31] Connecting to http://171.64.122.139:8080/
[14:27:31] Posted data.
[14:27:31] Initial: 0000; - Receiving payload (expected size: 239273)
[14:27:33] - Downloaded at ~116 kB/s
[14:27:33] - Averaged speed for that direction ~158 kB/s
[14:27:33] + Received work.
[14:27:33] Trying to send all finished work units
[14:27:33] + No unsent completed units remaining.                                                                                           {?????}
[14:27:33] + Closed connections
[14:27:33] 
[14:27:33] + Processing work unit
[14:27:33] Core required: FahCore_78.exe
[14:27:33] Core found.
[14:27:33] Working on Unit 04 [December 20 14:27:33]
[14:27:33] + Working ...
[14:27:33] - Calling 'FahCore_78.exe -dir work/ -suffix 04 -checkpoint 15 -verbose -lifeline 3988 -version 504'

[14:27:33] 
[14:27:33] *------------------------------*
[14:27:33] Folding@Home Gromacs Core
[14:27:33] Version 1.90 (March 8, 2006)
[14:27:33] 
[14:27:33] Preparing to commence simulation
[14:27:33] - Looking at optimizations...
[14:27:33] - Created dyn
[14:27:33] - Files status OK
[14:27:33] - Expanded 238761 -> 1167708 (decompressed 489.0 percent)
[14:27:33] - Starting from initial work packet
[14:27:33] 
[14:27:33] Project: 3798 (Run 91, Clone 4, Gen 50)
[14:27:33] 
[14:27:33] Assembly optimizations on if available.
[14:27:33] Entering M.D.
[14:27:39] Protein: p3798
[14:27:39] 
[14:27:39] Writing local files
[14:29:24] Extra SSE boost OK.
[14:29:24] Writing local files
[14:29:24] Completed 0 out of 1500 steps  (0)
[14:29:59] Writing local files
[14:29:59] Completed 500 out of 1500 steps  (33)
[14:30:36] Writing local files
[14:30:36] Completed 1000 out of 1500 steps  (67)
[14:31:12] Writing local files
[14:31:12] Completed 1500 out of 1500 steps  (100)
[14:31:12] Writing final coordinates.
[14:31:12] Past main M.D. loop
[14:32:12] 
[14:32:12] Finished Work Unit:
[14:32:12] - Reading up to 550656 from "work/wudata_04.arc": Read 550656
[14:32:12] - Reading up to 0 from "work/wudata_04.xtc": Read 0
[14:32:12] goefile size: 0
[14:32:12] Leaving Run
[14:32:14] - Writing 573212 bytes of core data to disk...
[14:32:14] Done: 572700 -> 518707 (compressed to 90.5 percent)
[14:32:14]   ... Done.
[14:32:14] - Shutting down core
[14:32:14] 
[14:32:14] Folding@home Core Shutdown: FINISHED_UNIT
[14:32:17] CoreStatus = 64 (100)
[14:32:17] Unit 4 finished with 100 percent of time to deadline remaining.
[14:32:17] Updated performance fraction: 0.998265
[14:32:17] Sending work to server


[14:32:17] + Attempting to send results
[14:32:17] - Reading file work/wuresults_04.dat from core
[14:32:17]   (Read 519219 bytes from disk)
[14:32:17] Connecting to http://171.64.122.139:8080/
[14:32:26] Posted data.
[14:32:26] Initial: 0000; - Uploaded at ~56 kB/s
[14:32:26] - Averaged speed for that direction ~58 kB/s
[14:32:26] + Results successfully sent
[14:32:26] Thank you for your contribution to Folding@Home.
[14:32:26] + Number of Units Completed: 518                                                                         {so Unit 3 not counted}

[14:32:30] Trying to send all finished work units
[14:32:30] + No unsent completed units remaining.
toTOW
Site Moderator
Posts: 6429
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 3798 (Run 30, Clone 1, Gen 50) - Lost WU?

Post by toTOW »

Hi anko1 (team 47815),
Your WU (P3798 R30 C1 G50) was added to the stats database on 2008-12-20 06:18:29 for 5 points of credit.

There is something strange with this WU : it has been completed successfully many, many times ... there's might be an issue in the new server code :(
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
anko1
Posts: 438
Joined: Mon Dec 03, 2007 1:31 am
Hardware configuration: Old Faithful CPU: Windows Graphical 5.03; Intel Pentium 4 Processor 540
(3.2GHz) HT;Windows XP
Big Red: Windows SMP Console 6.29; Windows GPU console 6.20r1; Intel Q9450 2.66G; ASUS P5Q 775 P45; [BFG 9800GTX+ old graphics card] NVidia GeForce 8800 GTX [as of 5/9/09]; Windows XP Pro SP3
Lenovo Think Pad: Windows 6.29 w/ SMP; Windows GPU Console 6.20r1 systray; Intel QX9300; NVIDIA Quadro FX-3700M; Windows XP Professional
Location: SF Peninsula

Re: Project: 3798 (Run 30, Clone 1, Gen 50) - Lost WU?

Post by anko1 »

Thanks for checking. I'm surprised, but not complaining ;-) , to get the points since it never showed as being received.
anko1
Posts: 438
Joined: Mon Dec 03, 2007 1:31 am
Hardware configuration: Old Faithful CPU: Windows Graphical 5.03; Intel Pentium 4 Processor 540
(3.2GHz) HT;Windows XP
Big Red: Windows SMP Console 6.29; Windows GPU console 6.20r1; Intel Q9450 2.66G; ASUS P5Q 775 P45; [BFG 9800GTX+ old graphics card] NVidia GeForce 8800 GTX [as of 5/9/09]; Windows XP Pro SP3
Lenovo Think Pad: Windows 6.29 w/ SMP; Windows GPU Console 6.20r1 systray; Intel QX9300; NVIDIA Quadro FX-3700M; Windows XP Professional
Location: SF Peninsula

Re: Project: 3798 (Run 30, Clone 1, Gen 50) - Lost WU?

Post by anko1 »

I wonder if for some reason the code doesn't go beyond Gen 50. I just noticed that I did 3798 pretty much solid from 12/20 6:38 till 12/26 12/24 1:20 (about 285 WUs) and they were all Gen 50. Not impossible, but improbable?
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 3798 (Run 30, Clone 1, Gen 50) - Lost WU?

Post by bruce »

anko1 wrote:This WU completed, and didn't upload, but after the next WU, the program reported no unsent WUs. Unfortunately, I didn't see this until the WU got written over (those 3798s are fast!).
The client does not overwrite data in the queue unless it has no choice -- such as when all 10 queue positions contain un-uploaded WUs -- which is highly improbable.
anko1
Posts: 438
Joined: Mon Dec 03, 2007 1:31 am
Hardware configuration: Old Faithful CPU: Windows Graphical 5.03; Intel Pentium 4 Processor 540
(3.2GHz) HT;Windows XP
Big Red: Windows SMP Console 6.29; Windows GPU console 6.20r1; Intel Q9450 2.66G; ASUS P5Q 775 P45; [BFG 9800GTX+ old graphics card] NVidia GeForce 8800 GTX [as of 5/9/09]; Windows XP Pro SP3
Lenovo Think Pad: Windows 6.29 w/ SMP; Windows GPU Console 6.20r1 systray; Intel QX9300; NVIDIA Quadro FX-3700M; Windows XP Professional
Location: SF Peninsula

Re: Project: 3798 (Run 30, Clone 1, Gen 50) - Lost WU?

Post by anko1 »

Thanks for the info, Bruce. There wasn't anything there, so I assumed it got written over, but I guess the queue was empty b/c the unit actually got sent.
Post Reply