Author | Message |
Dingo
Send message Joined: 13 May 22 Posts: 16 Credit: 1,442,177 RAC: 0
|
I noticed this task that was run on my machine dingo4 and also validated by dingo4. This is not the normal way BOINC works as it could lead to cheating.
Task:
https://boinc.loda-lang.org/loda/workunit.php?wuid=2285
Proud Founder of
Have a look at my WebCam
|
|
Sergey Kovalchuk
Send message Joined: 13 May 22 Posts: 14 Credit: 1,206 RAC: 0
|
it's a miner type application
why is the validation quorum set for tasks at all?
what will be compared in two different results?
report example
2022-05-14 11:51:03|INFO |Fetched http://api.loda-lang.org/miner/v1/oeis/b022861.txt.gz
2022-05-14 11:51:07|INFO |Fetched http://api.loda-lang.org/miner/v1/oeis/b022873.txt.gz
2022-05-14 11:51:07|ALERT|First program for A022873: a(n) = [ a(n-1)/a(1) + a(n-1)/a(2) + ... + a(n-1)/a(n-1) ] for n >= 3. Terms: 2,1,1,2,6,19,61,197,638,2068. Submitted by Sergey Kovalchuk
2022-05-14 11:51:19|INFO |Fetched http://api.loda-lang.org/miner/v1/oeis/b168810.txt.gz
2022-05-14 11:51:26|INFO |Fetched http://api.loda-lang.org/miner/v1/oeis/b022867.txt.gz
2022-05-14 11:51:26|ALERT|First program for A022867: a(n) = [ a(n-1)/a(1) + a(n-1)/a(2) + ... + a(n-1)/a(n-1) ] for n >= 3. Terms: 2,2,2,3,5,10,21,45,99,219. Submitted by Sergey Kovalchuk
2022-05-14 11:51:39|INFO |Processed 128566 programs
|
|
Christian Krause Project administrator
Send message Joined: 9 May 22 Posts: 250 Credit: 449,267 RAC: 198
|
There is an additional validation on a server (separate from the BOINC infrastructure). I need to look into the BOINC config.
|
|
Catchercradle
Send message Joined: 13 May 22 Posts: 28 Credit: 16,466 RAC: 0
|
Maximum number of attempts for a work unit is I believe set when a batch of work is uploaded to the server. I don't know exactly where in the files uploaded to the server though. I see one of my tasks has _6 at the end of the task name indicating it is the 7th attempt whic seems a little excessive!
|
|
Conan
Send message Joined: 13 May 22 Posts: 37 Credit: 3,056,211 RAC: 2,938
|
I am seeing a number of my tasks failing validation due to "Exceeded time limit of 18,000 seconds"
They all run just short of 5 hours.
I have also seen a number of my takes that fail (probably the "fails validation" tasks) that run a lot slower than most of my other tasks.
I have a 12 core 24 thread computer with 32 GB of RAM, at times all 24 threads are using 1 GB of RAM per task.
Perhaps some of my tasks are failing (the ones I have with the error code exit 195) because they need more RAM and have a hard limit on them making them stop start trying to get more RAM and so run slower than normal?
By running slower they then hit the 18,000 second time limit and fail to validate, or just fail altogether?
Conan
|
|
Conan
Send message Joined: 13 May 22 Posts: 37 Credit: 3,056,211 RAC: 2,938
|
This "reached time limit of 18,000 seconds" is becoming an issue, I have now gotten 33 validation fails due to this problem.
On this new host I downloaded 262 work units, 151 have so far completed normally, 33 have validation fails and 27 have the "exit 195 error".
So of the 211 work units so far processed 60 have failed (28%}, that is a very high failure rate don't you think?
I noticed that the work unit seems to know from just after it starts that it is going to run a long time as the estimated run time is higher than a normal work unit.
Also after reaching a certain percentage done point (anywhere from less that 1% to over 97%) the percentage no longer advances even though the work unit is using the CPU and appears to be working.
Conan
|
|
Christian Krause Project administrator
Send message Joined: 9 May 22 Posts: 250 Credit: 449,267 RAC: 198
|
We have fixed the "reached time limit of 18,000 seconds" issue in the new app version 220610. It should not occur anymore. Please let us know in case you still see validation errors.
|
|
Conan
Send message Joined: 13 May 22 Posts: 37 Credit: 3,056,211 RAC: 2,938
|
Thanks for that Christian for that, I currently have a number of the older version 2022.06, when they flush through I will test the 2022.10 version.
Thanks
Conan
[PS] Just an aside I have seen random work units run to over 8 hours, one even validated, so they don't all stop at 4 hours 57 minutes 33 seconds (18,000 seconds).
|
|
Werinbert
Send message Joined: 14 May 22 Posts: 7 Credit: 100,055 RAC: 0
|
I am still getting error "reached time limit" on my 2206.10 tasks...
|
|
Werinbert
Send message Joined: 14 May 22 Posts: 7 Credit: 100,055 RAC: 0
|
I find it interesting that many of my time limit errors are pretty much the same length of time as my tasks that validate.
|
|
Conan
Send message Joined: 13 May 22 Posts: 37 Credit: 3,056,211 RAC: 2,938
|
I am still getting error "reached time limit" on my 2206.10 tasks...
I just got one on this WU
Timed out at 11,159.30 seconds.
Not the 18,000 seconds I had before, just 3 hours 6 minutes instead of 4 hours 57 minutes.
So the 2206.10 tasks still have issues.
Conan
[PS] Also had a 2206.10 that started got to 0.100% and was still at 0.100% after 2 1/2 hours with 75 Days to go, so I aborted that one.
|
|
Christian Krause Project administrator
Send message Joined: 9 May 22 Posts: 250 Credit: 449,267 RAC: 198
|
App version 2206.11 is available with improved configuration. We'll monitor the results.
|
|
DaveW
Send message Joined: 3 Jun 22 Posts: 21 Credit: 100,051 RAC: 0
|
|
|
[AF] Kalianthys
Send message Joined: 14 May 22 Posts: 3 Credit: 6,383,496 RAC: 1
|
App version 2206.11 is available with improved configuration. We'll monitor the results.
This version has too high an error rate.
see here : https://boinc.loda-lang.org/loda/results.php?hostid=727&offset=0&show_names=0&state=6&appid=
<core_client_version>7.16.16</core_client_version>
<![CDATA[
<message>
exceeded elapsed time limit 12233.05 (86400.00G/0.91G)</message>
<stderr_txt>
07:36:55 (694504): wrapper (7.5.26014): starting
07:36:55 (694504): wrapper: running ../../projects/boinc.loda-lang.org_loda/loda-220611-linux-x86 (boinc -H 4)
</stderr_txt>
]]>
Kali
|
|
Christian Krause Project administrator
Send message Joined: 9 May 22 Posts: 250 Credit: 449,267 RAC: 198
|
Yes, we also observed timeouts with 220611. To mitigate the issues, we have reduced the task size to 50% in the new app version 220612. We'd appreciate if you try it out. We'll investigate all issues.
|
|
Conan
Send message Joined: 13 May 22 Posts: 37 Credit: 3,056,211 RAC: 2,938
|
Yes, we also observed timeouts with 220611. To mitigate the issues, we have reduced the task size to 50% in the new app version 220612. We'd appreciate if you try it out. We'll investigate all issues.
Thanks Christian,
Running 2206.12 now and have had 3 failures in the first 19 work units.
This one had the same issue that 2206.10 did with the exact same time out time of 11159.30 seconds, a bit like a re-badged work unit but getting the same result.
This WU had the "exit with code 195" issue and I had one other of this type.
But mostly they are running much better that earlier versions, at least so far.
Conan
|
|
Christian Krause Project administrator
Send message Joined: 9 May 22 Posts: 250 Credit: 449,267 RAC: 198
|
I found logs only for the first WU that you mentioned. It's strange, because the logs show only 10 mins of runtime. Not sure what happened there.
|
|
Dr Who Fan
Send message Joined: 13 May 22 Posts: 35 Credit: 174,375 RAC: 90
|
Task 447922
Outcome: Validation Error
Application version LODA v2206.12
windows_x86_64
Run time 9 hours 1 min 48 sec
CPU time [BLANK]
Stderr output
<core_client_version>7.16.20</core_client_version>
<![CDATA[
<stderr_txt>
14:16:31 (2120): wrapper (7.7.26016): starting
14:16:31 (2120): wrapper: running ../../projects/boinc.loda-lang.org_loda/loda-220612-windows.exe (boinc -H 2)
00:28:34 (2120): task loda reached time limit 36000
00:28:34 (2120): called boinc_finish(0)
</stderr_txt>
]]>
Logs for wu_1655043173_6700_0_r1460745107_0
2022-06-13 14:16:31|INFO |Starting LODA v22.6.12. See https://loda-lang.org/
2022-06-13 14:16:31|INFO |Platform: windows, user name: Dr Who Fan
2022-06-13 14:16:31|INFO |Using LODA home directory "C:\BOINCData/projects/boinc.loda-lang.org_loda\"
2022-06-13 14:16:31|INFO |Checking environment
2022-06-13 14:16:31|WARN |Setting environment variable: COMSPEC=C:\WINDOWS\system32\cmd.exe
2022-06-13 14:16:31|WARN |Setting environment variable: SYSTEMROOT=C:\WINDOWS
2022-06-13 14:16:31|WARN |Setting environment variable: PATH=C:\WINDOWS\system32;C:\Program Files\Git\cmd;C:\Program Files\Git\usr\bin
2022-06-13 14:16:31|WARN |Setting environment variable: TMP=C:\BOINCData/projects/boinc.loda-lang.org_loda\
2022-06-13 14:16:31|WARN |Setting environment variable: TEMP=C:\BOINCData/projects/boinc.loda-lang.org_loda\
2022-06-13 14:16:43|INFO |Fetched https://raw.githubusercontent.com/loda-lang/loda-cpp/main/miners.default.json
2022-06-13 14:16:43|WARN |Sequence list not found: C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\full_check.txt
2022-06-13 14:16:43|INFO |Updating OEIS index (last update 3 days ago)
2022-06-13 14:17:12|INFO |Fetched http://api.loda-lang.org/miner/v1/oeis/stripped.gz
2022-06-13 14:17:22|INFO |Fetched http://api.loda-lang.org/miner/v1/oeis/names.gz
2022-06-13 14:17:23|INFO |Updating programs repository
Updating files: 36% (552/1531)
Updating files: 37% (567/1531)
Updating files: 38% (582/1531)
Updating files: 39% (598/1531)
Updating files: 40% (613/1531)
Updating files: 41% (628/1531)
Updating files: 42% (644/1531)
Updating files: 43% (659/1531)
Updating files: 44% (674/1531)
Updating files: 45% (689/1531)
Updating files: 46% (705/1531)
Updating files: 47% (720/1531)
Updating files: 48% (735/1531)
Updating files: 49% (751/1531)
Updating files: 50% (766/1531)
Updating files: 51% (781/1531)
Updating files: 52% (797/1531)
Updating files: 53% (812/1531)
Updating files: 54% (827/1531)
Updating files: 55% (843/1531)
Updating files: 56% (858/1531)
Updating files: 57% (873/1531)
Updating files: 58% (888/1531)
Updating files: 59% (904/1531)
Updating files: 60% (919/1531)
Updating files: 61% (934/1531)
Updating files: 62% (950/1531)
Updating files: 63% (965/1531)
Updating files: 64% (980/1531)
Updating files: 65% (996/1531)
Updating files: 66% (1011/1531)
Updating files: 67% (1026/1531)
Updating files: 68% (1042/1531)
Updating files: 69% (1057/1531)
Updating files: 70% (1072/1531)
Updating files: 71% (1088/1531)
Updating files: 71% (1097/1531)
Updating files: 72% (1103/1531)
Updating files: 73% (1118/1531)
Updating files: 74% (1133/1531)
Updating files: 75% (1149/1531)
Updating files: 76% (1164/1531)
Updating files: 77% (1179/1531)
Updating files: 78% (1195/1531)
Updating files: 79% (1210/1531)
Updating files: 80% (1225/1531)
Updating files: 81% (1241/1531)
Updating files: 81% (1242/1531)
Updating files: 82% (1256/1531)
Updating files: 83% (1271/1531)
Updating files: 84% (1287/1531)
Updating files: 85% (1302/1531)
Updating files: 86% (1317/1531)
Updating files: 87% (1332/1531)
Updating files: 88% (1348/1531)
Updating files: 89% (1363/1531)
Updating files: 90% (1378/1531)
Updating files: 91% (1394/1531)
Updating files: 92% (1409/1531)
Updating files: 92% (1411/1531)
Updating files: 93% (1424/1531)
Updating files: 94% (1440/1531)
Updating files: 95% (1455/1531)
Updating files: 96% (1470/1531)
Updating files: 97% (1486/1531)
Updating files: 98% (1501/1531)
Updating files: 99% (1516/1531)
Updating files: 100% (1531/1531)
Updating files: 100% (1531/1531), done.
2022-06-13 14:17:49|INFO |Cleaning up local programs directory
2022-06-13 14:17:49|INFO |Removed 45 old local programs
2022-06-13 14:17:49|INFO |Loading sequences from the OEIS index
2022-06-13 14:17:56|INFO |Loaded 333474/354498 sequences in 7.10s
2022-06-13 14:17:56|INFO |Regenerating program stats (last update 3 days ago)
2022-06-13 14:18:16|INFO |Processed 2049 programs
2022-06-13 14:18:37|INFO |Processed 4130 programs
2022-06-13 14:18:59|INFO |Processed 5574 programs
2022-06-13 14:19:19|INFO |Processed 7687 programs
2022-06-13 14:19:40|INFO |Processed 9897 programs
2022-06-13 14:20:00|INFO |Processed 11973 programs
2022-06-13 14:20:20|INFO |Processed 14043 programs
2022-06-13 14:20:40|INFO |Processed 16158 programs
2022-06-13 14:21:00|INFO |Processed 18020 programs
2022-06-13 14:21:22|INFO |Processed 19565 programs
2022-06-13 14:21:42|INFO |Processed 20152 programs
2022-06-13 14:22:02|INFO |Processed 21380 programs
2022-06-13 14:22:22|INFO |Processed 23332 programs
2022-06-13 14:22:43|INFO |Processed 25360 programs
2022-06-13 14:23:03|INFO |Processed 27279 programs
2022-06-13 14:23:23|INFO |Processed 29164 programs
2022-06-13 14:23:43|INFO |Processed 31018 programs
2022-06-13 14:24:04|INFO |Processed 32905 programs
2022-06-13 14:24:24|INFO |Processed 34771 programs
2022-06-13 14:24:44|INFO |Processed 36679 programs
2022-06-13 14:25:04|INFO |Processed 37815 programs
2022-06-13 14:25:24|INFO |Processed 39723 programs
2022-06-13 14:25:44|INFO |Processed 41610 programs
2022-06-13 14:26:05|INFO |Processed 43537 programs
2022-06-13 14:26:25|INFO |Processed 45454 programs
2022-06-13 14:26:45|INFO |Processed 47234 programs
2022-06-13 14:27:05|INFO |Processed 49131 programs
2022-06-13 14:27:25|INFO |Processed 51053 programs
2022-06-13 14:27:46|INFO |Processed 52272 programs
2022-06-13 14:28:08|INFO |Processed 53696 programs
2022-06-13 14:28:28|INFO |Processed 55506 programs
2022-06-13 14:28:48|INFO |Processed 57396 programs
2022-06-13 14:29:08|INFO |Processed 59309 programs
2022-06-13 14:29:28|INFO |Processed 61200 programs
2022-06-13 14:29:49|INFO |Processed 63085 programs
2022-06-13 14:30:09|INFO |Processed 64348 programs
2022-06-13 14:30:29|INFO |Processed 66097 programs
2022-06-13 14:30:49|INFO |Processed 67909 programs
2022-06-13 14:31:09|INFO |Processed 69736 programs
2022-06-13 14:31:30|INFO |Processed 71580 programs
2022-06-13 14:31:50|INFO |Processed 73400 programs
2022-06-13 14:32:10|INFO |Processed 75247 programs
2022-06-13 14:32:30|INFO |Processed 77082 programs
2022-06-13 14:32:50|INFO |Processed 78215 programs
2022-06-13 14:33:10|INFO |Processed 80006 programs
2022-06-13 14:33:31|INFO |Processed 81946 programs
2022-06-13 14:33:51|INFO |Processed 83831 programs
2022-06-13 14:34:11|INFO |Processed 85734 programs
2022-06-13 14:34:31|INFO |Processed 87683 programs
2022-06-13 14:35:08|INFO |Finished stats generation for 87981 programs
2022-06-13 14:35:11|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\000\A000143.asm
2022-06-13 14:35:12|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\018\A018836.asm
2022-06-13 14:35:12|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\018\A018842.asm
2022-06-13 14:35:12|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\025\A025861.asm
2022-06-13 14:35:12|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\025\A025863.asm
2022-06-13 14:35:12|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\025\A025897.asm
2022-06-13 14:35:12|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\025\A025922.asm
2022-06-13 14:35:12|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\029\A029066.asm
2022-06-13 14:35:12|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\029\A029068.asm
2022-06-13 14:35:12|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\029\A029106.asm
2022-06-13 14:35:12|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\029\A029131.asm
2022-06-13 14:35:13|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\045\A045145.asm
2022-06-13 14:35:14|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\098\A098389.asm
2022-06-13 14:35:20|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\280\A280258.asm
2022-06-13 14:35:20|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\280\A280259.asm
2022-06-13 14:35:20|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\285\A285811.asm
2022-06-13 14:35:22|WARN |Recursion detected in stats for C:\BOINCData/projects/boinc.loda-lang.org_loda\programs\oeis\341\A341397.asm
2022-06-13 14:35:23|INFO |Initialized 5 matchers (ignoring 39157 sequences)
2022-06-13 14:35:23|INFO |Initialized 4 generators (profile: update, overwrite: auto)
2022-06-13 14:35:24|INFO |Mining programs in client mode (extended validation mode)
2022-06-13 14:35:24|INFO |Processed 1 programs
2022-06-13 14:35:26|INFO |Fetched http://api.loda-lang.org/miner/v1/oeis/b265278.txt.gz
2022-06-13 14:36:30|INFO |Processed 2 programs
2022-06-13 14:37:52|INFO |Processed 19 programs
2022-06-13 14:37:52|INFO |Fetched http://api.loda-lang.org/miner/v1/oeis/b075818.txt.gz
2022-06-13 14:38:11|INFO |Fetched http://api.loda-lang.org/miner/v1/oeis/b106138.txt.gz
2022-06-13 14:38:29|INFO |Fetched http://api.loda-lang.org/miner/v1/oeis/b106146.txt.gz
2022-06-13 14:38:50|INFO |Fetched http://api.loda-lang.org/miner/v1/oeis/b081147.txt.gz
2022-06-13 14:39:01|INFO |Processed 11 programs
2022-06-13 14:39:01|INFO |Fetched http://api.loda-lang.org/miner/v1/oeis/b188187.txt.gz
2022-06-13 14:40:09|INFO |Processed 11 programs
2022-06-13 14:40:47|INFO |Fetched http://api.loda-lang.org/miner/v1/oeis/b018012.txt.gz
|
|
DaveW
Send message Joined: 3 Jun 22 Posts: 21 Credit: 100,051 RAC: 0
|
Surely you didn't need to post all that.
|
|
Christian Krause Project administrator
Send message Joined: 9 May 22 Posts: 250 Credit: 449,267 RAC: 198
|
Hi Dr Who Fan,
thanks for sharing the details of the WU. Based on the logs, I assume the miner was either busy validating an exceptionally slow program, or the process got somehow halted. We'll try to improve our logs and progress reporting to be able analyze such issues better in the future.
I checked your results so far: 57 successful, 2 validate errors, 2 errors. We'll try to improve the success rate further.
|
|