Question: Have the network configuration of OpenCSW been changed?

Dagobert Michelsen dam at opencsw.org
Tue Dec 3 10:43:01 CET 2019


Hi Yusuke,

> Am 03.12.2019 um 10:38 schrieb Yusuke Endoh <mame at ruby-lang.org>:
> 
> Thank you for the answer!
> 
> I suspected the network configuration because:
> 
> * the failures look related to network issue,
> * all OpenCSW machines started failing all at once,
> * the failures have never occurred on any other machines, and
> * I couldn't find any relevant commits whose timestamp is near to that timing.

There have been similar errors in the past which disappeared when I added IPv6 localhost,
but this was added from the start after the reboot, so this is probably not the issue.

> I don't know what the tests do :-)
> To be honest, debugging in OpenCSW is painful because it is slow, so I wanted to isolate the problem before debugging.

I suggest to start with the *x machine which should be reasonably fast as x86 on our VMware farm,
then probably 11s which is a moderately new M3000. The *10s are the slowest ones.


Best regards

  — Dago

> But I understand there has been no network change, so I'll give it a try to debug.  Thank you very much!
> 
> Best regards,
> 
> 2019年12月3日(火) 17:15 Dagobert Michelsen <dam at opencsw.org <mailto:dam at opencsw.org>>:
> Hi Yusuke,
> 
> Am 03.12.2019 um 00:41 schrieb Yusuke Endoh <mame at ruby-lang.org <mailto:mame at ruby-lang.org>>:
>> I'm Yusuke Endoh, a member of Ruby development team.
>> 
>> Since 2 A.M. 1st Dec. 2019 UTC, our Ruby CIs on OpenCSW have failed.
>> The failure looks related to network configuration about localhost.
>> 
>> https://rubyci.org/logs/rubyci.s3.amazonaws.com/unstable10s/ruby-master/log/20191202T171915Z.fail.html.gz <https://rubyci.org/logs/rubyci.s3.amazonaws.com/unstable10s/ruby-master/log/20191202T171915Z.fail.html.gz>
>> 
>> ```
>> DRb::DRbConnError: drbssl://::1:35909 <> - #<Errno::ECONNREFUSED: Connection refused - connect(2) for "::1" port 35909>
>> ```
>> 
>> So, I'd like to ask if something network stuff was changed?
> 
> We rebooted the machine and some of the networking setup doesn’t come up automatically due to
> routing issues between the global zone and local zones. However, I just looked and as far as I can
> say it looks good to me.
> 
>> If it is an intentional change, I'd be happy to tackle to fix the issue on the side of Ruby test suite.
>> I have seen the issue on unstable10s, 10x, 11s, and 11x.
> 
> unstable10x, unstable11s and unstable11x have not been rebooted, these are different machines,
> so if you see the error there too it is not related due to networking on the rebooted host.
> 
>> FWIW, `nslookup localhost` fails to find 127.0.0.1.
>> 
>> ```
>> rubyci at unstable10s [unstable10s]:~ > nslookup localhost                                                
>> Server:         192.168.1.6
>> Address:        192.168.1.6#53
>> 
>> ** server can't find localhost: NXDOMAIN
>> ```
> 
> It is our normal configuration to not resolve localhost via DNS, but you can resolve it locally
> with the resolver lib:
> 
> root at unstable10s [unstable10s]:/root > getent hosts localhost 
> 127.0.0.1       localhost
> root at unstable10s [unstable10s]:/root > nslookup localhost
> Server:         192.168.1.6
> Address:        192.168.1.6#53
> 
> ** server can't find localhost: NXDOMAIN
> 
> zsh: 26934 exit 1     nslookup localhost
> root at unstable10s [unstable10s]:/root > getent hosts localhost
> 127.0.0.1       localhost
> root at unstable10s [unstable10s]:/root > 
> 
> 
> Maybe you can describe what the test actually does and why it technically fails,
> I am not sure at the moment what has changed, be it the build machine configuration
> or something in your code or something else.
> 
> 
> Best regards
> 
>   — Dago
> 
> 
> -- 
> "You don't become great by trying to be great, you become great by wanting to do something,
> and then doing it so hard that you become great in the process." - xkcd #896
> 

-- 
"You don't become great by trying to be great, you become great by wanting to do something,
and then doing it so hard that you become great in the process." - xkcd #896

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.opencsw.org/pipermail/buildfarm/attachments/20191203/621579a2/attachment-0001.html>


More information about the buildfarm mailing list