Port 5901, generally by convention Port 5900 + XDisplaynumber, is the TCP Port on which the VNC service listens. That's their relation. Actually you may use arbitrary TCP Ports with arbitrary Xdisplaynumbers.
The VNC service is meant to transports input (mouse, keyboard) from the client to the server and output (an image) from the server to the client. This does not need to have anything to with X. For example in virtualization VNC is often used to make the "physical" terminal of a virtual machine remotely accessible.
The X11 System is much more complex than that and programs use it to interact with the Xserver and with each other. In wording in the X world is a little unintuitive at first so the meaning of output, seat, screen, display, server, client need to be learned and deserve some consideration.
To your question: an Xdisplay is the logical core of any X instance. When you start a program that program is a Xclient and connects to the provided display of an Xserver. :1.0
is actually an abreviation and means "display 1 at localhost". You can provide a hostname or IP-address e.g. DISPLAY=192.168.1.2:3.0 to have the program you are about start connect to Xdisplay 3 at the (remote) computer with the given IP-address.
X and VNC are two totally different services implemented for different purposes with different goals. (One may argue that some/most VNC goals were also goals when developing X, but that's rather academic and very in depth.)
When using Xvncserver both concepts are combined back to back. So there is an Xserver with which Xclients communicate "the normal way" but instead of using the hardware available to interact with a user, the Xvncserverit implements a VNC server, to which a VNC client can connect.