Question: Have the network configuration of OpenCSW been changed?

Dagobert Michelsen dam at opencsw.org
Tue Dec 3 09:15:33 CET 2019


Hi Yusuke,

Am 03.12.2019 um 00:41 schrieb Yusuke Endoh <mame at ruby-lang.org>:
> I'm Yusuke Endoh, a member of Ruby development team.
> 
> Since 2 A.M. 1st Dec. 2019 UTC, our Ruby CIs on OpenCSW have failed.
> The failure looks related to network configuration about localhost.
> 
> https://rubyci.org/logs/rubyci.s3.amazonaws.com/unstable10s/ruby-master/log/20191202T171915Z.fail.html.gz <https://rubyci.org/logs/rubyci.s3.amazonaws.com/unstable10s/ruby-master/log/20191202T171915Z.fail.html.gz>
> 
> ```
> DRb::DRbConnError: drbssl://::1:35909 - #<Errno::ECONNREFUSED: Connection refused - connect(2) for "::1" port 35909>
> ```
> 
> So, I'd like to ask if something network stuff was changed?

We rebooted the machine and some of the networking setup doesn’t come up automatically due to
routing issues between the global zone and local zones. However, I just looked and as far as I can
say it looks good to me.

> If it is an intentional change, I'd be happy to tackle to fix the issue on the side of Ruby test suite.
> I have seen the issue on unstable10s, 10x, 11s, and 11x.

unstable10x, unstable11s and unstable11x have not been rebooted, these are different machines,
so if you see the error there too it is not related due to networking on the rebooted host.

> FWIW, `nslookup localhost` fails to find 127.0.0.1.
> 
> ```
> rubyci at unstable10s [unstable10s]:~ > nslookup localhost                                                
> Server:         192.168.1.6
> Address:        192.168.1.6#53
> 
> ** server can't find localhost: NXDOMAIN
> ```

It is our normal configuration to not resolve localhost via DNS, but you can resolve it locally
with the resolver lib:

root at unstable10s [unstable10s]:/root > getent hosts localhost 
127.0.0.1       localhost
root at unstable10s [unstable10s]:/root > nslookup localhost
Server:         192.168.1.6
Address:        192.168.1.6#53

** server can't find localhost: NXDOMAIN

zsh: 26934 exit 1     nslookup localhost
root at unstable10s [unstable10s]:/root > getent hosts localhost
127.0.0.1       localhost
root at unstable10s [unstable10s]:/root > 


Maybe you can describe what the test actually does and why it technically fails,
I am not sure at the moment what has changed, be it the build machine configuration
or something in your code or something else.


Best regards

  — Dago


-- 
"You don't become great by trying to be great, you become great by wanting to do something,
and then doing it so hard that you become great in the process." - xkcd #896

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.opencsw.org/pipermail/buildfarm/attachments/20191203/37a2837b/attachment.html>


More information about the buildfarm mailing list