What is "backlog" in TCP connections?

python sockets tcp network-programming udp

50,398

Solution 1

_{NOTE : Answers are framed without having any background in Python, but, the questions are irrelevant to language, to be answered.}

What are these queued connections?

In simple words, the backlog parameter specifies the number of pending connections the queue will hold.

When multiple clients connect to the server, the server then holds the incoming requests in a queue. The clients are arranged in the queue, and the server processes their requests one by one as and when queue-member proceeds. The nature of this kind of connection is called queued connection.

Does it make any difference for client requests? (I mean is the server that is running with socket.listen(5) different from the server that is running with socket.listen(1) in accepting connection requests or in receiving data?)

Yes, both cases are different. The first case would allow only 5 clients to be arranged to the queue; whereas in the case of backlog=1, only 1 connection can be hold in the queue, thereby resulting in the dropping of the further connection request!

Why is the minimum value zero? Shouldn't it be at least 1?

I have no idea about Python, but, as per this source, in C, a backlog argument of 0 may allow the socket to accept connections, in which case the length of the listen queue may be set to an implementation-defined minimum value.

Is there a preferred value?

This question has no well-defined answer. I'd say this depends on the nature of your application, as well as the hardware configurations and software configuration too. Again, as per the source, BackLog is silently limited to between 1 and 5, inclusive(again as per C).

Is this backlog defined for TCP connections only or does it apply for UDP and other protocols too?

NO. Please note that there's no need to listen() or accept() for unconnected datagram sockets(UDP). This is one of the perks of using unconnected datagram sockets!

But, do keep in mind, then there are TCP based datagram socket implementations (called TCPDatagramSocket) too which have backlog parameter.

Solution 2

When TCP connection is being established the so called three-way handshake is performed. Both sides exchange some packets and once they do it this connection is called complete and it is ready to be used by the application.

However this three-way handshake takes some time. And during that time the connection is queued and this is the backlog. So you can set the maximum amount of incomplete parallel connections via .listen(no) call (note that according to the posix standard the value is only a hint, it may be totally ignored). If someone tries to establish a connection above backlog limit the other side will refuse it.

So the backlog limit is about pending connections, not established.

Now higher backlog limit will be better in most cases. Note that the maximum limit is OS dependent, e.g. cat /proc/sys/net/core/somaxconn gives me 128 on my Ubuntu.

Solution 3

The function of the parameter appears to be to limit the number of incoming connect requests a server will retain in a queue assuming it can serve the current request and the small amount of queued pending requests in a reasonable amount of time while under high load. Here's a good paragraph I came against that lends a little context around this argument...

Finally, the argument to listen tells the socket library that we want it to queue up as many as 5 connect requests (the normal max) before refusing outside connections. If the rest of the code is written properly, that should be plenty.

https://docs.python.org/3/howto/sockets.html#creating-a-socket

There's text earlier up in the document that suggests clients should dip in and out of a server so you don't build up a long queue of requests in the first place...

When the connect completes, the socket s can be used to send in a request for the text of the page. The same socket will read the reply, and then be destroyed. That’s right, destroyed. Client sockets are normally only used for one exchange (or a small set of sequential exchanges).

The linked HowTo guide is a must read when getting up to speed on network programming with sockets. It really brings into focus some big picture themes about it. Now how the server socket manages this queue as far as implementation details is another story, probably an interesting one. I suppose the motivation for this design is more telling, without it the barrier for inflicting a denial of service attack would be very very low.

As far as the reason for a minimum value of 0 vs 1, we should keep in mind that 0 is still a valid value, meaning queue up nothing. That is essentially to say let there be no request queue, just reject connections outright if the server socket is currently serving a connection. The point of a currently active connection being served should always be kept in mind in this context, it's the only reason a queue would be of interest in the first place.

This brings us to the next question regarding a preferred value. This is all a design decision, do you want to queue up requests or not? If so you may pick a value you feel is warranted based on expected traffic and known hardware resources I suppose. I doubt there's anything formulaic in picking a value. This makes me wonder how lightweight a request is in the first place that you'd face a penalty in queuing anything up on the server.

UPDATE

I wanted to substantiate the comments from user207421 and went to lookup the python source. Unfortunately this level of detail is not to be found in the sockets.py source but rather in socketmodule.c#L3351-L3382 as of hash 530f506.

The comments are very illuminating, I'll copy the source verbatim below and single out the clarifying comments here which are pretty illuminating...

We try to choose a default backlog high enough to avoid connection drops for common workloads, yet not too high to limit resource usage.

and

If backlog is specified, it must be at least 0 (if it is lower, it is set to 0); it specifies the number of unaccepted connections that the system will allow before refusing new connections. If not specified, a default reasonable value is chosen.

/* s.listen(n) method */

static PyObject *
sock_listen(PySocketSockObject *s, PyObject *args)
{
    /* We try to choose a default backlog high enough to avoid connection drops
     * for common workloads, yet not too high to limit resource usage. */
    int backlog = Py_MIN(SOMAXCONN, 128);
    int res;

    if (!PyArg_ParseTuple(args, "|i:listen", &backlog))
        return NULL;

    Py_BEGIN_ALLOW_THREADS
    /* To avoid problems on systems that don't allow a negative backlog
     * (which doesn't make sense anyway) we force a minimum value of 0. */
    if (backlog < 0)
        backlog = 0;
    res = listen(s->sock_fd, backlog);
    Py_END_ALLOW_THREADS
    if (res < 0)
        return s->errorhandler();
    Py_RETURN_NONE;
}

PyDoc_STRVAR(listen_doc,
"listen([backlog])\n\
\n\
Enable a server to accept connections.  If backlog is specified, it must be\n\
at least 0 (if it is lower, it is set to 0); it specifies the number of\n\
unaccepted connections that the system will allow before refusing new\n\
connections. If not specified, a default reasonable value is chosen.");

Going further down the rabbithole into the externals I trace the following source from socketmodule...

 res = listen(s->sock_fd, backlog);

This source is over at socket.h and socket.c using linux as a concrete platform backdrop for discussion purposes.

/* Maximum queue length specifiable by listen.  */
#define SOMAXCONN   128
extern int __sys_listen(int fd, int backlog);

There's more info to be found in the man page

http://man7.org/linux/man-pages/man2/listen.2.html

int listen(int sockfd, int backlog);

And the corresponding docstring

listen() marks the socket referred to by sockfd as a passive socket, that is, as a socket that will be used to accept incoming connection requests using accept(2).

The sockfd argument is a file descriptor that refers to a socket of type SOCK_STREAM or SOCK_SEQPACKET.

The backlog argument defines the maximum length to which the queue of pending connections for sockfd may grow. If a connection request arrives when the queue is full, the client may receive an error with an indication of ECONNREFUSED or, if the underlying protocol supports retransmission, the request may be ignored so that a later reattempt at connection succeeds.

One additional source identifies the kernel as being responsible for the backlog queue.

The second argument backlog to this function specifies the maximum number of connections the kernel should queue for this socket.

They briefly go on to relate how the unaccepted / queued connections are partitioned in the backlog (a useful figure is included on the linked source).

To understand the backlog argument, we must realize that for a given listening socket, the kernel maintains two queues:

An incomplete connection queue, which contains an entry for each SYN that has arrived from a client for which the server is awaiting completion of the TCP three-way handshake. These sockets are in the SYN_RCVD state (Figure 2.4).

A completed connection queue, which contains an entry for each client with whom the TCP three-way handshake has completed. These sockets are in the ESTABLISHED state (Figure 2.4). These two queues are depicted in the figure below:

When an entry is created on the incomplete queue, the parameters from the listen socket are copied over to the newly created connection. The connection creation mechanism is completely automatic; the server process is not involved.

50,398

jxramos

Bioengineer by education, Software Developer by trade. I have primarily worked on medical instruments doing everything from GUI design, refactoring, machine learning and tool making, computer vision, scripting, and a bit of software architecture for the components and layers I happen to work upon. I work hard to provide for my family who I love without limit and software has been where I was given a first opportunity to contribute to this thing we call industry.

Updated on July 09, 2022

Comments

jxramos almost 2 years
Below, you see a python program that acts as a server listening for connection requests to port 9999:
```
# server.py 
import socket                                         
import time

# create a socket object
serversocket = socket.socket(
            socket.AF_INET, socket.SOCK_STREAM) 

# get local machine name
host = socket.gethostname()                           

port = 9999                                           

# bind to the port
serversocket.bind((host, port))                                  

# queue up to 5 requests
serversocket.listen(5)                                           

while True:
    # establish a connection
    clientsocket,addr = serversocket.accept()      

    print("Got a connection from %s" % str(addr))
    currentTime = time.ctime(time.time()) + "\r\n"
    clientsocket.send(currentTime.encode('ascii'))
    clientsocket.close()
```
The questions is what is the function of the parameter of socket.listen() method (i.e. 5).

Based on the tutorials around the internet:

The backlog argument specifies the maximum number of queued connections and should be at least 0; the maximum value is system-dependent (usually 5), the minimum value is forced to 0.

But:
1. What are these queued connections?
2. Does it make any difference for client requests? (I mean is the server that is running with socket.listen(5) different from the server that is running with socket.listen(1) in accepting connection requests or in receiving data?)
3. Why is the minimum value zero? Shouldn't it be at least 1?
4. Is there a preferred value?
5. Is this backlog defined for TCP connections only or does it apply for UDP and other protocols too?
- Rick over 3 years
  
  This answer explains far more better than the accepted one.
user207421 almost 6 years

The server will refuse it if the server is Windows. TCP on Unix, Linux, etc, simply drops the SYN, which may cause the connecting side to retry, and if the condition persists will cause connection timeout, not refusal.
user207421 almost 6 years

The server will refuse it if the server is Windows. TCP on Unix, Linux, etc, simply drops the SYN, which may cause the connecting side to retry, and if the condition persists will cause connection timeout, not refusal.
Am_I_Helpful almost 6 years

@EJP - I agree with the point you made. But, it should be the connection timeout at the client (connecting) side, but, the server side would be dropping the requests(SYN). Please check if it is clear to you now!
user207421 almost 5 years

And it is not the number of incomplete connections. They are on a different queue. It is the number of completed connections the application hasn't yet accepted. Answer is completely incorrect. See the accepted answer for the truth.
user207421 almost 5 years

Your source is incorrect. The backlog ques is for completed connections. Connect requests go on a different queue, and are moved to the backlog queue when completed.
user207421 almost 5 years

In fact it is a poor quality resource all round. It is wrong about several matters, not just this one. The claim about HTTP only using one transfer per socket is spectacularly incorrect, as is the claim that you're supposed to use shutdown() before closing
user207421 almost 5 years

That's no different from what I wrote, except that you've still left out the Windows case. If you think otherwise please state why.
jxramos almost 5 years

Wow that would indeed change things drastically. I wonder why nothing was ever caught about that document, it's been on the python documentation for sometime. It was written by Gordon McMillan.
jxramos almost 5 years

@user207421 I just looked at the c source, the backlog does appear to be surrounding incoming unaccepted requests.
jxramos almost 5 years

I'm picking up on the terminology you're employing about complete vs incomplete connections. From the similar question asked for c that terminology pops up relating to the three way handshake stuff. I wonder if that's to say that these complete/incomplete connections are a partitioning of the unaccepted connections. I think that's what the user SegFault is saying in their answer.
jxramos almost 5 years

This answer provides a succinct summary: The second argument to listen() function specifies the total queue length for a given listening socket of 2 queues - (1) complete connection queue - 3 way handshake completed for connection (2) incomplete connection queue - SYN received from client waiting for completion of 3 way TCP handshake.
Am_I_Helpful almost 5 years

@user207421 - Would you please quote the line from my post, where you think I've left the Windows case? I have already agreed to the point you made in my previous comment! Also, when I mean connections being dropped, doesn't it cover the refused (not accepted) connection as well? I think anyone can infer that.
jfs about 4 years

it is worth mentioning tcp_abort_on_overflow veithen.io/2014/01/01/how-tcp-backlog-works-in-linux.html
keramat almost 2 years

@Am_I_Helpful Can you elaborate what exactly you mean by pending connections?
Am_I_Helpful almost 2 years

@keramat - As the server can't serve all the connections concurrently, it manages a queue for storing them, to be served later. Those connection requests which are still in the queue and not processed by the server are 'pending'.