Posts by Crystal Pellet

1) Message boards : Number crunching : cancel wu's (Message 153)
Posted 22 Oct 2020 by Crystal Pellet
Post:
We continue regular calculations ...
I suppose your regular calculations are stopped at the first opportunity after 2.5 hours and forced to stop after 3 hours run time.
2) Message boards : Number crunching : cancel wu's (Message 143)
Posted 20 Oct 2020 by Crystal Pellet
Post:
Hi zupa,

All ten tasks of your application 9_Gaia@home are valid and includes cpu times.
http://150.254.66.104/gaiaathome/results.php?hostid=908&offset=0&show_names=0&state=0&appid=96
3) Message boards : Number crunching : cancel wu's (Message 137)
Posted 19 Oct 2020 by Crystal Pellet
Post:
I see. hmm..
I have 1 wu where stop signal break calulation after 1.5 h and credit add to results.
4 out of 16 tasks were stopped after 1.5 hours run time.
4) Message boards : Number crunching : cancel wu's (Message 135)
Posted 19 Oct 2020 by Crystal Pellet
Post:
Results are not reporting their used CPU-time.
5) Message boards : Number crunching : cancel wu's (Message 134)
Posted 19 Oct 2020 by Crystal Pellet
Post:
3_Gaia@home - test for new vesrion of 2_Gaia@home (350 wus) (normal time of calculation: 1h, stop signal 1,5h)
Let's see how these 360 workunits behave.
I've 16 tasks running and 6 ready to start.
6) Message boards : Number crunching : cancel wu's (Message 125)
Posted 19 Oct 2020 by Crystal Pellet
Post:
I will try to change 2_Gaia@home like this:
I will use the kernel signal to terminate after 3h and I will try to save some temporary results so that the program exits properly and you won't lose your credits.
I hope I can do it ...

what do you think about it ?

You could give that a try and I hope those temporary results are still useful for you.
7) Message boards : Number crunching : cancel wu's (Message 120)
Posted 19 Oct 2020 by Crystal Pellet
Post:
2_Gaia@home :
The 2_Gaia@home check system time before start main loop of calculation.
Then, it checks the system time on each loop steps.
The main loop is broken if time difference between start and actual time is greater than 2h.
The 2_Gaia@home is finishing work and prepare results.

The number of lines in the output file is different for different processors.

Also, a surprise for me is why in some cases the loop does not end. :(
These are sporadic cases.
I think I found out why this is happening with 2_Gaia@home

We numerically calculate the motion of the star cluster near the Sun in the gravitational field of the Galaxy.
For each star of cluster we draw clones using the covariance matrix from the Gaia catalog.

Sometimes a random clone requires a very small integration step which increases the computation time for the loop step (2_Gaia@home app).
Unfortunately, we are not able to predict such a situation
:(

So, for us cruchers it's OK to abort tasks running longer than ~3 hours, cause when running longer, you don't get a valid result and we will not get credit for the wasted time.
Problem is that such a task would be sent to another 'victim', so maybe a temporary solution to reduce wasted time
(until you have a better solution within your application) is to reduce the rsc_fpops_bound from 86400000000000 to 21600000000000.
8) Message boards : Number crunching : Computation error: EXIT_TIME_LIMIT_EXCEEDED (Message 118)
Posted 19 Oct 2020 by Crystal Pellet
Post:
Finally the task http://150.254.66.104/gaiaathome/result.php?resultid=2338280 mentioned here

ended with the error EXIT_TIME_LIMIT_EXCEEDED after almost 36 hours runtime.
9) Message boards : Number crunching : cancel wu's (Message 111)
Posted 18 Oct 2020 by Crystal Pellet
Post:
At the moment of the problem, I try to cause the server to interrupt the computation in order to protect the computing time on your processors.
At the moment, it is developing a version of the program, the priority of which will be to protect the computation time (the task will be completed after a certain time, about 2 hours).
This is not happening with this task on my machine:
Application 2_Gaia@home 1.00 
Name 2_7281
State Running
Received Sat 17 Oct 2020 13:38:59 CEST
Report deadline Mon 19 Oct 2020 13:38:58 CEST
Estimated computation size 3,600 GFLOPs
CPU time 1d 01:46:33
CPU time since checkpoint 1d 01:46:33
Elapsed time 1d 01:46:35
Estimated time remaining 00:00:00
Fraction done 100.000%
Virtual memory size 11.54 MB
Working set size 8.90 MB
Directory  slots/1
Process ID 4876
Progress rate 3.960% per hour
Executable 2_Gaia@home[20201017.07]_x86_64-pc-linux-gnu

and it's not (yet) aborted by the server.
10) Message boards : Number crunching : Computation error: EXIT_TIME_LIMIT_EXCEEDED (Message 91)
Posted 13 Oct 2020 by Crystal Pellet
Post:
After 5 hours 26 min 52 sec runtime: http://150.254.66.104/gaiaathome/result.php?resultid=2291547
11) Message boards : Number crunching : App Versions (Message 88)
Posted 13 Oct 2020 by Crystal Pellet
Post:
Same here, all the tasks I had erred out this morning after about 5 hrs run time ...
There must be a very few valids between all those errors, cause the server reports 4 GigaFLOPS average computing for this newest application.
12) Message boards : Number crunching : App Versions (Message 84)
Posted 13 Oct 2020 by Crystal Pellet
Post:
As I can see, all tasks from application 2_Gaia@home[20201012.22] are ending into an error , are completed, waiting for validation or are cancelled by the server ??
13) Questions and Answers : Getting started : Can't create an account or join (Message 82)
Posted 13 Oct 2020 by Crystal Pellet
Post:
Unfortunately, it still doesn't work for new team creation.
For me (us) it worked http://150.254.66.104/gaiaathome/team_display.php?teamid=94 Created 9 Oct 2020
14) Message boards : Number crunching : persistent files (Message 77)
Posted 9 Oct 2020 by Crystal Pellet
Post:
I did 'only' 114 tasks on 1 PC so far and have already loaded 13GB Gaia-project data.
Only 3.5GB left for BOINC and I can't extend the space or move the data.
Since bin-files are not purged after the job, I've to reset the project regulary.




©2024 GAVIP-GC