I always had the impression that you guys are strong with networking stuff. For me it is always just a necessary evil. I'm not stupid, but most network stuff is just not enjoyable for me.
Anyway - I have a pretty nasty issue for the past 2-3 weeks and I am losing it if I cannot solve this shit. I have a Cubietruck (similar to Raspberry Pi 2) with ArchArm and a Desktop with Arch Linux, too. Everything is up and running and ping, ssh and webservices work fine. Owncloud sync is a bit unusual slow with some dropouts (compared to a previous setup with debian on my cubietruck)
BUT transfers of large amounts of data die off after a certain (for me not predictable) amount of time. Imagine the following cases using rsync and ssh:
- rsync ssh-desktop local - on cubietruck - can transport some 100MB - 1GB, dies randomly
- rsync local ssh-desktop - on cubietruck - dies off almost immediately (after 32,765 B)
- rsync ssh-cubie local - on desktop - dies off almost immediately (after some KB)
- rsync local ssh-cubie - on desktop - can transport some 100MB - 1GB, dies randomly
Bottom line is: outgoing transmission from cubietruck is somehow more reliable than an incoming stream. But everything is not at an acceptable state.
Just for completeness, the command looks like this:
rsync -aP user@remoteip:/path/to/file ./
and the thrown error is not very instructive:
rsync: connection unexpectedly closed (983210406 bytes received so far) [receiver]
rsync error: error in rsync protocol data stream (code 12) at io.c(226) [receiver=3.1.2]
rsync: connection unexpectedly closed (8542 bytes received so far) [generator]
rsync error: unexplained error (code 255) at io.c(226) [generator=3.1.2]
Important things to note:
- it is independent of rsync: it also stales with scp
- it is also independent from ssh: transfer also fails with samba
- no difference if iptables is active or not! (no firewall issue)
- no difference if httpd or samba services are running
- no issue with the switch (desktop and cubie are on the same switch, which is again connected to the router) - I tested this via plugging both into the router directly
- I am talking about LAN communication - everything outside the router is not an issue
I tried so many things. Measuring the bandwidth with iperf, look at tcpdump, iftop and netstat output, but I did not find anything that would indicate some issue.
One very interesting observation though: If I trace the route to 8.8.4.4 (googles dns server) with mtr, I get ~30% packet loss from the very first hop once the rsync process is running. There is no loss when the rsync process isn't running.
Instead of dumping any log I could produce in here I am asking you to take me by the hand and walk me through this step by step. It's more about checking that everything is covered, than letting someone else do my work.
I am hoping for some clearance by asking for uninvolved opinions. Maybe I was stupid and have overlooked something very obvious (I tried turning it off and on again).
I am hoping for something simple.
If anything important is missing or something is stupid/unclear - don't hesitate to speak up. I am willing to make it easy for you. let me help you help me! It's getting late over here.
Puppytax:
imguroriginal
