Questions and Answers :
Bugs :
Are there checkpoints?
Message board moderation
Author | Message |
---|---|
Send message Joined: 13 May 22 Posts: 19 Credit: 146,233 RAC: 0 |
I have restarted my computer several times, and tasks all begin at 00.000% at startup instead of from a checkpoint . Some tasks fail at restart as well. Windows 11 Pro (x64) BOINC 7.16.20 LODA 2205.14 Intel i9-10850 |
Send message Joined: 9 May 22 Posts: 250 Credit: 449,267 RAC: 198 |
The mining is stateless. Tasks don't need to finish to produce results. All findings are sent immediately to the server. So it is no problem if you restart in between. |
Send message Joined: 13 May 22 Posts: 28 Credit: 16,466 RAC: 0 |
The mining is stateless. Tasks don't need to finish to produce results. All findings are sent immediately to the server. So it is no problem if you restart in between.Except that if you exit BOINC, shutting down the client, the task then starts again from the beginning. It might not be a problem for the project but it does mean that for the tasks I had running yesterday evening, over twelve hours total of computation time was wasted. Adding checkpoints would allow tasks to be resumed from the point they had reached before the com;puter was shut down for the night. |
Send message Joined: 9 May 22 Posts: 250 Credit: 449,267 RAC: 198 |
Yes, you're right, it's a valid feature request. For now what you could do is to use the BOINC feature "No new tasks" when you plan to shut down your machine. Regarding the checkpoints: we can implement this, but I don't know how to do it in a secure way. Hackers could manipulate checkpoints to do a "fast-forward": it could be used to let miners jump from 0% to 100% without having any work done. I don't know how to prevent this. PS: the tasks should take only about 4h. |
Send message Joined: 17 May 22 Posts: 8 Credit: 1,773,690 RAC: 1 |
In my experience, lack of checkpointing can deter users from participating. I also tend to think that implementing checkpoints shows best regard for volunteer's time and resources. These are just my opinions. I will participate for a while, but as a cruncher who has several computers, I get discouraged when, say a power outage causes several hundred hours of computing to be lost. This has happened to me in the past. It may seem trivial to some, but more important to others. Most mature projects have checkpointing. I do not know how they implement it in a tamper-proof way. If it can't be implemented in LODA, then perhaps shorter task durations would mitigate the loss to volunteers caused by unscheduled reboots. |
Send message Joined: 9 May 22 Posts: 250 Credit: 449,267 RAC: 198 |
As announced in the last News item, we plan to add checkpoints to LODA. See also this issue on Github: https://github.com/loda-lang/loda-cpp/issues/143 |
Send message Joined: 13 May 22 Posts: 28 Credit: 16,466 RAC: 0 |
As announced in the last News item, we plan to add checkpoints to LODA. See also this issue on Github: I checked link, it says, "No description provided." |
Send message Joined: 9 May 22 Posts: 250 Credit: 449,267 RAC: 198 |
There is no description in the ticket, but the goal is to implement checkpoints. We plan to add it in the next app version. |
Send message Joined: 9 May 22 Posts: 250 Credit: 449,267 RAC: 198 |
The new app version (220523) supports checkpointing. |
Send message Joined: 13 May 22 Posts: 9 Credit: 10,580,204 RAC: 3,043 |
The new app version (220523) supports checkpointing. thank you |
Send message Joined: 23 May 22 Posts: 11 Credit: 462,342 RAC: 0 |
So far BoincTasks has yet to indicate a checkpoint has been made on 2205.23 WUs. Sometimes the checkpointing does not report properly but it is actually checkpointing. I've requested Suspend on Checkpoint but have yet to see a WU suspend. How frequent are the checkpoints? |
Send message Joined: 13 May 22 Posts: 18 Credit: 1,192,438 RAC: 1,253 |
Depending on the project, you can try to open the folder containing the files in the slot directory of your task and you may find some txt (or xml) files including more information on the on-going checkpointing. But I don't know how it is managed for LODA (and don't have access to a computer with LODA running on it at the moment). |
Send message Joined: 23 May 22 Posts: 11 Credit: 462,342 RAC: 0 |
Depending on the project, you can try to open the folder containing the files in the slot directory of your task and you may find some txt (or xml) files including more information on the on-going checkpointing. I just clicked through all my slot folders and I didn't find anything for LODA. I also looked through the LODA project folder. |
Send message Joined: 14 May 22 Posts: 7 Credit: 100,055 RAC: 0 |
Checkpointing may have been implemented but tasks are failing if they have been suspended and then restarted. |
Send message Joined: 9 May 22 Posts: 250 Credit: 449,267 RAC: 198 |
We had an unplanned downtime of our API server. It is now up and running again. Sorry for the inconvenience. |
Send message Joined: 23 May 22 Posts: 11 Credit: 462,342 RAC: 0 |
Still have not seen a checkpoint after restart stabilization. Elapsed Time = Time Since Last Checkpoint. |
Send message Joined: 9 May 22 Posts: 250 Credit: 449,267 RAC: 198 |
When you restart the BOINC Manager, do the tasks resume from 0% or from where you left off? They should continue from the previous percentage. |
Send message Joined: 23 May 22 Posts: 11 Credit: 462,342 RAC: 0 |
When you restart the BOINC Manager, do the tasks resume from 0% or from where you left off? They should continue from the previous percentage. I suspended 3 WUs: 00:38:47 15.8%, 00:43:48 11.1%, and 00:01:53 0.4%. Then rebooted computer and unsuspended those 3 WUs. They restarted with elapsed time at zero and the same percent complete they suspended with: 00:00:00 15.8%, 00:00:00 11.1%, and 00:00:00 0.4%. Does being "stateless" mean you don't have to write a snapshot of some break-point to storage? |
Send message Joined: 9 May 22 Posts: 250 Credit: 449,267 RAC: 198 |
We write checkpoints and it is working also on your machine. The computation of the work units is resumed. The elapsed time is computed by BOINC directly (not our app). |
Send message Joined: 14 May 22 Posts: 6 Credit: 220 RAC: 0 |
We write checkpoints and it is working also on your machine. The computation of the work units is resumed. The elapsed time is computed by BOINC directly (not our app). no! it simply is that silly wrapper which is unable to check this. create a native boinc app and you will be fine. |
©2024 LODA Language