BB Unix Network Monitor - Message

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: {bb} Problems monitoring SMTP Service



Hi Philip,


OK. This doesn't look like a server rejecting the mail. It would appear to be a connection problem. Can you confirm a couple of things:

1. Can you check in the mail log that you can see a connection
from your BBNET machine that corresponds to the start of
a period that BB reports the machine down. I wouldn't rely
too heavily on the timestamps here unless the machines are
perfectly synchronized. Just follow the BBNET connections
in the log and see if there is a gap.



One example from this night:


Aug 31 00:52:08 smtpmachine postfix/smtpd[17060]: connect from bbnet.domain.tld[xxx.xxx.xxx.xxx]
Aug 31 00:52:08 smtpmachine postfix/smtpd[17060]: disconnect from bbnet.domain.tld[xxx.xxx.xxx.xxx]
Aug 31 00:52:26 smtpmachine postfix/smtpd[17060]: connect from bbnet.domain.tld[xxx.xxx.xxx.xxx]
Aug 31 00:52:26 smtpmachine postfix/smtpd[17060]: disconnect from bbnet.domain.tld[xxx.xxx.xxx.xxx]
Aug 31 00:52:40 smtpmachine postfix/smtpd[17060]: connect from bbnet.domain.tld[xxx.xxx.xxx.xxx]
Aug 31 00:52:40 smtpmachine postfix/smtpd[17060]: disconnect from bbnet.domain.tld[xxx.xxx.xxx.xxx]
Aug 31 00:56:48 smtpmachine postfix/smtpd[17208]: connect from bbnet.domain.tld[xxx.xxx.xxx.xxx]
Aug 31 00:56:48 smtpmachine postfix/smtpd[17208]: disconnect from bbnet.domain.tld[xxx.xxx.xxx.xxx]
Aug 31 00:56:51 smtpmachine postfix/smtpd[17208]: connect from bbnet.domain.tld[xxx.xxx.xxx.xxx]
Aug 31 00:56:51 smtpmachine postfix/smtpd[17208]: disconnect from bbnet.domain.tld[xxx.xxx.xxx.xxx]
Aug 31 00:56:53 smtpmachine postfix/smtpd[17208]: connect from bbnet.domain.tld[xxx.xxx.xxx.xxx]
Aug 31 00:56:53 smtpmachine postfix/smtpd[17208]: disconnect from bbnet.domain.tld[xxx.xxx.xxx.xxx]



At 00:52:07 BBDISPLAY claims smtpmachine's smtp service to be down for 0:04:45. The times should be synchronized quite well since they all synchronize daily against the same ntp server.


2. Can you also give us some idea of the type of hardware that
connects the subnets which are being traversed. There are
some things that will handle TCP and ICMP differently. It's
possible that "ping" is always accepted and "smtp"
occasionally blocked.


It is "the internet", simply. Is is two different colocation centers with very different upstream providers.

3. Are you testing anything on the mail servers other than
"conn" and "smtp"? If so, do these test also report
problems?


Yes, on smtpmachine ssh is tested as well. It also has regular problems, but not that much. With smtp they occur every 20 to 120 minutes, with ssh every few days. And the ssh service does not get a red "down" dot, but a black "unavailable".

Thanks for your help.

Dirk
--
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-==-=-=-=-=-=-=-=-=-=-=-=
To unsubscribe from this list, or to subscribe to the bb-digest list
send e-mail to mailto:majordomo@bb4.com with unsubscribe bb -and/or-
subscribe bb-digest in the BODY of the message.


Home | Main Index | Thread Index