xref: /freebsd/share/man/man4/divert.4 (revision 24fc79b0a4a82c4090cfb59ca9798079072445f7)
17f3dea24SPeter Wemm.\" $FreeBSD$
20b992c1dSWolfram Schneider.\"
324fc79b0SAndre Oppermann.Dd October 22, 2004
493e0e116SJulian Elischer.Dt DIVERT 4
53d45e180SRuslan Ermilov.Os
693e0e116SJulian Elischer.Sh NAME
793e0e116SJulian Elischer.Nm divert
893e0e116SJulian Elischer.Nd kernel packet diversion mechanism
993e0e116SJulian Elischer.Sh SYNOPSIS
1032eef9aeSRuslan Ermilov.In sys/types.h
1132eef9aeSRuslan Ermilov.In sys/socket.h
1232eef9aeSRuslan Ermilov.In netinet/in.h
1393e0e116SJulian Elischer.Ft int
1493e0e116SJulian Elischer.Fn socket PF_INET SOCK_RAW IPPROTO_DIVERT
1593e0e116SJulian Elischer.Sh DESCRIPTION
1693e0e116SJulian ElischerDivert sockets are similar to raw IP sockets, except that they
1793e0e116SJulian Elischercan be bound to a specific
1893e0e116SJulian Elischer.Nm
1993e0e116SJulian Elischerport via the
2093e0e116SJulian Elischer.Xr bind 2
216d249eeeSSheldon Hearnsystem call.
226d249eeeSSheldon HearnThe IP address in the bind is ignored; only the port
2393e0e116SJulian Elischernumber is significant.
2493e0e116SJulian ElischerA divert socket bound to a divert port will receive all packets diverted
2593e0e116SJulian Elischerto that port by some (here unspecified) kernel mechanism(s).
2693e0e116SJulian ElischerPackets may also be written to a divert port, in which case they
2793e0e116SJulian Elischerre-enter kernel IP packet processing.
2893e0e116SJulian Elischer.Pp
2993e0e116SJulian ElischerDivert sockets are normally used in conjunction with
30b5c508fbSRuslan Ermilov.Fx Ns 's
31b5c508fbSRuslan Ermilovpacket filtering implementation and the
3293e0e116SJulian Elischer.Xr ipfw 8
336d249eeeSSheldon Hearnprogram.
346d249eeeSSheldon HearnBy reading from and writing to a divert socket, matching packets
3593e0e116SJulian Elischercan be passed through an arbitrary ``filter'' as they travel through
3693e0e116SJulian Elischerthe host machine, special routing tricks can be done, etc.
3793e0e116SJulian Elischer.Sh READING PACKETS
3893e0e116SJulian ElischerPackets are diverted either as they are ``incoming'' or ``outgoing.''
3993e0e116SJulian ElischerIncoming packets are diverted after reception on an IP interface,
4093e0e116SJulian Elischerwhereas outgoing packets are diverted before next hop forwarding.
4193e0e116SJulian Elischer.Pp
4293e0e116SJulian ElischerDiverted packets may be read unaltered via
4393e0e116SJulian Elischer.Xr read 2 ,
4493e0e116SJulian Elischer.Xr recv 2 ,
4593e0e116SJulian Elischeror
4693e0e116SJulian Elischer.Xr recvfrom 2 .
4793e0e116SJulian ElischerIn the latter case, the address returned will have its port set to
4871678683SGiorgos Keramidassome tag supplied by the packet diverter, (usually the ipfw rule number)
499355ecfcSJulian Elischerand the IP address set to the (first) address of
5009b4b086SMike Pritchardthe interface on which the packet was received (if the packet
5193e0e116SJulian Elischerwas incoming) or
5293e0e116SJulian Elischer.Dv INADDR_ANY
53a10c9747SDaniel Harris(if the packet was outgoing).
54a10c9747SDaniel HarrisThe interface name (if defined
55a10c9747SDaniel Harrisfor the packet) will be placed in the 8 bytes following the address,
56a10c9747SDaniel Harrisif it fits.
5793e0e116SJulian Elischer.Sh WRITING PACKETS
5893e0e116SJulian ElischerWriting to a divert socket is similar to writing to a raw IP socket;
5993e0e116SJulian Elischerthe packet is injected ``as is'' into the normal kernel IP packet
6093e0e116SJulian Elischerprocessing and minimal error checking is done.
6193e0e116SJulian ElischerPackets are written as either incoming or outgoing:
6293e0e116SJulian Elischerif
6393e0e116SJulian Elischer.Xr write 2
6493e0e116SJulian Elischeror
6593e0e116SJulian Elischer.Xr send 2
6693e0e116SJulian Elischeris used to deliver the packet, or if
6793e0e116SJulian Elischer.Xr sendto 2
6893e0e116SJulian Elischeris used with a destination IP address of
6993e0e116SJulian Elischer.Dv INADDR_ANY ,
7093e0e116SJulian Elischerthen the packet is treated as if it were outgoing, i.e., destined
71b5e7e999SRuslan Ermilovfor a non-local address.
72b5e7e999SRuslan ErmilovOtherwise, the packet is assumed to be
7393e0e116SJulian Elischerincoming and full packet routing is done.
7493e0e116SJulian Elischer.Pp
7593e0e116SJulian ElischerIn the latter case, the
76436c7212SJulian ElischerIP address specified must match the address of some local interface,
77436c7212SJulian Elischeror an interface name
786d249eeeSSheldon Hearnmust be found after the IP address.
796d249eeeSSheldon HearnIf an interface name is found,
80436c7212SJulian Elischerthat interface will be used and the value of the IP address will be
81436c7212SJulian Elischerignored (other than the fact that it is not
8294ba280cSRuslan Ermilov.Dv INADDR_ANY ) .
83b5e7e999SRuslan ErmilovThis is to indicate on which interface the packet
84b5e7e999SRuslan Ermilov.Dq arrived .
8593e0e116SJulian Elischer.Pp
8693e0e116SJulian ElischerNormally, packets read as incoming should be written as incoming;
87b5e7e999SRuslan Ermilovsimilarly for outgoing packets.
88b5e7e999SRuslan ErmilovWhen reading and then writing back
8993e0e116SJulian Elischerpackets, passing the same socket address supplied by
9093e0e116SJulian Elischer.Xr recvfrom 2
9193e0e116SJulian Elischerunmodified to
9293e0e116SJulian Elischer.Xr sendto 2
939355ecfcSJulian Elischersimplifies things (see below).
949355ecfcSJulian Elischer.Pp
959355ecfcSJulian ElischerThe port part of the socket address passed to the
969355ecfcSJulian Elischer.Xr sendto 2
976d249eeeSSheldon Hearncontains a tag that should be meaningful to the diversion module.
986d249eeeSSheldon HearnIn the
999355ecfcSJulian Elischercase of
100f38ca148SRuslan Ermilov.Xr ipfw 8
101f38ca148SRuslan Ermilovthe tag is interpreted as the rule number
1029355ecfcSJulian Elischer.Em after which
1039355ecfcSJulian Elischerrule processing should restart.
10493e0e116SJulian Elischer.Sh LOOP AVOIDANCE
105d7ec3e91SRuslan ErmilovPackets written into a divert socket
106c4d9468eSRuslan Ermilov(using
107c4d9468eSRuslan Ermilov.Xr sendto 2 )
108d7ec3e91SRuslan Ermilovre-enter the packet filter at the rule number
1099355ecfcSJulian Elischerfollowing the tag given in the port part of the socket address, which
1109355ecfcSJulian Elischeris usually already set at the rule number that caused the diversion
1115203edcdSRuslan Ermilov(not the next rule if there are several at the same number).
1125203edcdSRuslan ErmilovIf the 'tag'
1139355ecfcSJulian Elischeris altered to indicate an alternative re-entry point, care should be taken
1149355ecfcSJulian Elischerto avoid loops, where the same packet is diverted more than once at the
1159355ecfcSJulian Elischersame rule.
11693e0e116SJulian Elischer.Sh DETAILS
11793e0e116SJulian ElischerTo enable divert sockets, your kernel must be compiled with the option
11824fc79b0SAndre Oppermann.Dv IPDIVERT
11924fc79b0SAndre Oppermannor you have to load the
12024fc79b0SAndre Oppermann.Dv IPDIVERT
12124fc79b0SAndre Oppermannmodule.
12224fc79b0SAndre Oppermann.Pp
12324fc79b0SAndre OppermannYou can load the
12424fc79b0SAndre Oppermann.Dv IPDIVERT
12524fc79b0SAndre Oppermannmodule at runtime by issuing the following command:
12624fc79b0SAndre Oppermann.Bd -literal -offset indent
12724fc79b0SAndre Oppermannkldload ipdivert
12824fc79b0SAndre Oppermann.Ed
12993e0e116SJulian Elischer.Pp
13093e0e116SJulian ElischerIf a packet is diverted but no socket is bound to the
13193e0e116SJulian Elischerport, or if
13293e0e116SJulian Elischer.Dv IPDIVERT
13324fc79b0SAndre Oppermannis not enabled or loaded in the kernel, the packet is dropped.
13493e0e116SJulian Elischer.Pp
13593e0e116SJulian ElischerIncoming packet fragments which get diverted are fully reassembled
13693e0e116SJulian Elischerbefore delivery; the diversion of any one fragment causes the entire
13793e0e116SJulian Elischerpacket to get diverted.
13893e0e116SJulian ElischerIf different fragments divert to different ports,
13993e0e116SJulian Elischerthen which port ultimately gets chosen is unpredictable.
14093e0e116SJulian Elischer.Pp
14113177659SAndre OppermannNote that packets arriving on the divert socket by the
14213177659SAndre Oppermann.Xr ipfw 8
14313177659SAndre Oppermann.Cm tee
14413177659SAndre Oppermannaction are delivered as-is and packet fragments do not get reassembled
14513177659SAndre Oppermannin this case.
14613177659SAndre Oppermann.Pp
14704f36f75SBrian SomersPackets are received and sent unchanged, except that
148dd121c1eSArchie Cobbspackets read as outgoing have invalid IP header checksums, and
14904f36f75SBrian Somerspackets written as outgoing have their IP header checksums overwritten
15093e0e116SJulian Elischerwith the correct value.
15193e0e116SJulian ElischerPackets written as incoming and having incorrect checksums will be dropped.
15293e0e116SJulian ElischerOtherwise, all header fields are unchanged (and therefore in network order).
15393e0e116SJulian Elischer.Pp
15404f36f75SBrian SomersBinding to port numbers less than 1024 requires super-user access, as does
15504f36f75SBrian Somerscreating a socket of type SOCK_RAW.
15693e0e116SJulian Elischer.Sh ERRORS
15793e0e116SJulian ElischerWriting to a divert socket can return these errors, along with
15893e0e116SJulian Elischerthe usual errors possible when writing raw packets:
15993e0e116SJulian Elischer.Bl -tag -width Er
16093e0e116SJulian Elischer.It Bq Er EINVAL
16193e0e116SJulian ElischerThe packet had an invalid header, or the IP options in the packet
16293e0e116SJulian Elischerand the socket options set were incompatible.
16393e0e116SJulian Elischer.It Bq Er EADDRNOTAVAIL
16493e0e116SJulian ElischerThe destination address contained an IP address not equal to
16593e0e116SJulian Elischer.Dv INADDR_ANY
16693e0e116SJulian Elischerthat was not associated with any interface.
16793e0e116SJulian Elischer.El
16893e0e116SJulian Elischer.Sh SEE ALSO
16993e0e116SJulian Elischer.Xr bind 2 ,
1700b992c1dSWolfram Schneider.Xr recvfrom 2 ,
171aab5e1b6SMike Pritchard.Xr sendto 2 ,
1720b992c1dSWolfram Schneider.Xr socket 2 ,
1730b992c1dSWolfram Schneider.Xr ipfw 8
17493e0e116SJulian Elischer.Sh BUGS
17593e0e116SJulian ElischerThis is an attempt to provide a clean way for user mode processes
17693e0e116SJulian Elischerto implement various IP tricks like address translation, but it
17793e0e116SJulian Elischercould be cleaner, and it's too dependent on
17893e0e116SJulian Elischer.Xr ipfw 8 .
17993e0e116SJulian Elischer.Pp
18093e0e116SJulian ElischerIt's questionable whether incoming fragments should be reassembled
1816d249eeeSSheldon Hearnbefore being diverted.
1826d249eeeSSheldon HearnFor example, if only some fragments of a
18393e0e116SJulian Elischerpacket destined for another machine don't get routed through the
1846d249eeeSSheldon Hearnlocal machine, the packet is lost.
1856d249eeeSSheldon HearnThis should probably be
18693e0e116SJulian Elischera settable socket option in any case.
187aaf1f16eSPhilippe Charnier.Sh AUTHORS
188eddc45e7SJeroen Ruigrok van der Werven.An Archie Cobbs Aq archie@FreeBSD.org ,
189aaf1f16eSPhilippe CharnierWhistle Communications Corp.
190