xref: /freebsd/sbin/ipfw/ipfw.8 (revision 1dfcff294e44d4b45813288ef4095c36abb22f0e)
1.\"
2.\" $FreeBSD$
3.\"
4.Dd June 4, 2022
5.Dt IPFW 8
6.Os
7.Sh NAME
8.Nm ipfw , dnctl
9.Nd User interface for firewall, traffic shaper, packet scheduler,
10in-kernel NAT.
11.Sh SYNOPSIS
12.Ss FIREWALL CONFIGURATION
13.Nm
14.Op Fl cq
15.Cm add
16.Ar rule
17.Nm
18.Op Fl acdefnNStT
19.Op Cm set Ar N
20.Brq Cm list | show
21.Op Ar rule | first-last ...
22.Nm
23.Op Fl f | q
24.Op Cm set Ar N
25.Cm flush
26.Nm
27.Op Fl q
28.Op Cm set Ar N
29.Brq Cm delete | zero | resetlog
30.Op Ar number ...
31.Pp
32.Nm
33.Cm set Oo Cm disable Ar number ... Oc Op Cm enable Ar number ...
34.Nm
35.Cm set move
36.Op Cm rule
37.Ar number Cm to Ar number
38.Nm
39.Cm set swap Ar number number
40.Nm
41.Cm set show
42.Ss SYSCTL SHORTCUTS
43.Nm
44.Cm enable
45.Brq Cm firewall | altq | one_pass | debug | verbose | dyn_keepalive
46.Nm
47.Cm disable
48.Brq Cm firewall | altq | one_pass | debug | verbose | dyn_keepalive
49.Ss LOOKUP TABLES
50.Nm
51.Oo Cm set Ar N Oc Cm table Ar name Cm create Ar create-options
52.Nm
53.Oo Cm set Ar N Oc Cm table
54.Brq Ar name | all
55.Cm destroy
56.Nm
57.Oo Cm set Ar N Oc Cm table Ar name Cm modify Ar modify-options
58.Nm
59.Oo Cm set Ar N Oc Cm table Ar name Cm swap Ar name
60.Nm
61.Oo Cm set Ar N Oc Cm table Ar name Cm add Ar table-key Op Ar value
62.Nm
63.Oo Cm set Ar N Oc Cm table Ar name Cm add Op Ar table-key Ar value ...
64.Nm
65.Oo Cm set Ar N Oc Cm table Ar name Cm atomic add Op Ar table-key Ar value ...
66.Nm
67.Oo Cm set Ar N Oc Cm table Ar name Cm delete Op Ar table-key ...
68.Nm
69.Oo Cm set Ar N Oc Cm table Ar name Cm lookup Ar addr
70.Nm
71.Oo Cm set Ar N Oc Cm table Ar name Cm lock
72.Nm
73.Oo Cm set Ar N Oc Cm table Ar name Cm unlock
74.Nm
75.Oo Cm set Ar N Oc Cm table
76.Brq Ar name | all
77.Cm list
78.Nm
79.Oo Cm set Ar N Oc Cm table
80.Brq Ar name | all
81.Cm info
82.Nm
83.Oo Cm set Ar N Oc Cm table
84.Brq Ar name | all
85.Cm detail
86.Nm
87.Oo Cm set Ar N Oc Cm table
88.Brq Ar name | all
89.Cm flush
90.Ss DUMMYNET CONFIGURATION (TRAFFIC SHAPER AND PACKET SCHEDULER)
91.Nm dnctl
92.Brq Cm pipe | queue | sched
93.Ar number
94.Cm config
95.Ar config-options
96.Nm dnctl
97.Op Fl s Op Ar field
98.Brq Cm pipe | queue | sched
99.Brq Cm delete | list | show
100.Op Ar number ...
101.Ss IN-KERNEL NAT
102.Nm
103.Op Fl q
104.Cm nat
105.Ar number
106.Cm config
107.Ar config-options
108.Nm
109.Cm nat
110.Ar number
111.Cm show
112.Brq Cm config | log
113.Ss STATEFUL IPv6/IPv4 NETWORK ADDRESS AND PROTOCOL TRANSLATION
114.Nm
115.Oo Cm set Ar N Oc Cm nat64lsn Ar name Cm create Ar create-options
116.Nm
117.Oo Cm set Ar N Oc Cm nat64lsn Ar name Cm config Ar config-options
118.Nm
119.Oo Cm set Ar N Oc Cm nat64lsn
120.Brq Ar name | all
121.Brq Cm list | show
122.Op Cm states
123.Nm
124.Oo Cm set Ar N Oc Cm nat64lsn
125.Brq Ar name | all
126.Cm destroy
127.Nm
128.Oo Cm set Ar N Oc Cm nat64lsn Ar name Cm stats Op Cm reset
129.Ss STATELESS IPv6/IPv4 NETWORK ADDRESS AND PROTOCOL TRANSLATION
130.Nm
131.Oo Cm set Ar N Oc Cm nat64stl Ar name Cm create Ar create-options
132.Nm
133.Oo Cm set Ar N Oc Cm nat64stl Ar name Cm config Ar config-options
134.Nm
135.Oo Cm set Ar N Oc Cm nat64stl
136.Brq Ar name | all
137.Brq Cm list | show
138.Nm
139.Oo Cm set Ar N Oc Cm nat64stl
140.Brq Ar name | all
141.Cm destroy
142.Nm
143.Oo Cm set Ar N Oc Cm nat64stl Ar name Cm stats Op Cm reset
144.Ss XLAT464 CLAT IPv6/IPv4 NETWORK ADDRESS AND PROTOCOL TRANSLATION
145.Nm
146.Oo Cm set Ar N Oc Cm nat64clat Ar name Cm create Ar create-options
147.Nm
148.Oo Cm set Ar N Oc Cm nat64clat Ar name Cm config Ar config-options
149.Nm
150.Oo Cm set Ar N Oc Cm nat64clat
151.Brq Ar name | all
152.Brq Cm list | show
153.Nm
154.Oo Cm set Ar N Oc Cm nat64clat
155.Brq Ar name | all
156.Cm destroy
157.Nm
158.Oo Cm set Ar N Oc Cm nat64clat Ar name Cm stats Op Cm reset
159.Ss IPv6-to-IPv6 NETWORK PREFIX TRANSLATION
160.Nm
161.Oo Cm set Ar N Oc Cm nptv6 Ar name Cm create Ar create-options
162.Nm
163.Oo Cm set Ar N Oc Cm nptv6
164.Brq Ar name | all
165.Brq Cm list | show
166.Nm
167.Oo Cm set Ar N Oc Cm nptv6
168.Brq Ar name | all
169.Cm destroy
170.Nm
171.Oo Cm set Ar N Oc Cm nptv6 Ar name Cm stats Op Cm reset
172.Ss INTERNAL DIAGNOSTICS
173.Nm
174.Cm internal iflist
175.Nm
176.Cm internal talist
177.Nm
178.Cm internal vlist
179.Ss LIST OF RULES AND PREPROCESSING
180.Nm
181.Op Fl cfnNqS
182.Oo
183.Fl p Ar preproc
184.Oo
185.Ar preproc-flags
186.Oc
187.Oc
188.Ar pathname
189.Sh DESCRIPTION
190The
191.Nm
192utility is the user interface for controlling the
193.Xr ipfw 4
194firewall, the
195.Xr dummynet 4
196traffic shaper/packet scheduler, and the
197in-kernel NAT services.
198.Pp
199A firewall configuration, or
200.Em ruleset ,
201is made of a list of
202.Em rules
203numbered from 1 to 65535.
204Packets are passed to the firewall
205from a number of different places in the protocol stack
206(depending on the source and destination of the packet,
207it is possible for the firewall to be
208invoked multiple times on the same packet).
209The packet passed to the firewall is compared
210against each of the rules in the
211.Em ruleset ,
212in rule-number order
213(multiple rules with the same number are permitted, in which case
214they are processed in order of insertion).
215When a match is found, the action corresponding to the
216matching rule is performed.
217.Pp
218Depending on the action and certain system settings, packets
219can be reinjected into the firewall at some rule after the
220matching one for further processing.
221.Pp
222A ruleset always includes a
223.Em default
224rule (numbered 65535) which cannot be modified or deleted,
225and matches all packets.
226The action associated with the
227.Em default
228rule can be either
229.Cm deny
230or
231.Cm allow
232depending on how the kernel is configured.
233.Pp
234If the ruleset includes one or more rules with the
235.Cm keep-state ,
236.Cm record-state ,
237.Cm limit
238or
239.Cm set-limit
240option,
241the firewall will have a
242.Em stateful
243behaviour, i.e., upon a match it will create
244.Em dynamic rules ,
245i.e., rules that match packets with the same 5-tuple
246(protocol, source and destination addresses and ports)
247as the packet which caused their creation.
248Dynamic rules, which have a limited lifetime, are checked
249at the first occurrence of a
250.Cm check-state ,
251.Cm keep-state
252or
253.Cm limit
254rule, and are typically used to open the firewall on-demand to
255legitimate traffic only.
256Please note, that
257.Cm keep-state
258and
259.Cm limit
260imply implicit
261.Cm check-state
262for all packets (not only these matched by the rule) but
263.Cm record-state
264and
265.Cm set-limit
266have no implicit
267.Cm check-state .
268See the
269.Sx STATEFUL FIREWALL
270and
271.Sx EXAMPLES
272Sections below for more information on the stateful behaviour of
273.Nm .
274.Pp
275All rules (including dynamic ones) have a few associated counters:
276a packet count, a byte count, a log count and a timestamp
277indicating the time of the last match.
278Counters can be displayed or reset with
279.Nm
280commands.
281.Pp
282Each rule belongs to one of 32 different
283.Em sets
284, and there are
285.Nm
286commands to atomically manipulate sets, such as enable,
287disable, swap sets, move all rules in a set to another
288one, delete all rules in a set.
289These can be useful to
290install temporary configurations, or to test them.
291See Section
292.Sx SETS OF RULES
293for more information on
294.Em sets .
295.Pp
296Rules can be added with the
297.Cm add
298command; deleted individually or in groups with the
299.Cm delete
300command, and globally (except those in set 31) with the
301.Cm flush
302command; displayed, optionally with the content of the
303counters, using the
304.Cm show
305and
306.Cm list
307commands.
308Finally, counters can be reset with the
309.Cm zero
310and
311.Cm resetlog
312commands.
313.Ss COMMAND OPTIONS
314The following general options are available when invoking
315.Nm :
316.Bl -tag -width indent
317.It Fl a
318Show counter values when listing rules.
319The
320.Cm show
321command implies this option.
322.It Fl b
323Only show the action and the comment, not the body of a rule.
324Implies
325.Fl c .
326.It Fl c
327When entering or showing rules, print them in compact form,
328i.e., omitting the "ip from any to any" string
329when this does not carry any additional information.
330.It Fl d
331When listing, show dynamic rules in addition to static ones.
332.It Fl D
333When listing, show only dynamic states.
334When deleting, delete only dynamic states.
335.It Fl f
336Run without prompting for confirmation for commands that can cause problems if misused,
337i.e.,
338.Cm flush .
339If there is no tty associated with the process, this is implied.
340The
341.Cm delete
342command with this flag ignores possible errors,
343i.e., nonexistent rule number.
344And for batched commands execution continues with the next command.
345.It Fl i
346When listing a table (see the
347.Sx LOOKUP TABLES
348section below for more information on lookup tables), format values
349as IP addresses.
350By default, values are shown as integers.
351.It Fl n
352Only check syntax of the command strings, without actually passing
353them to the kernel.
354.It Fl N
355Try to resolve addresses and service names in output.
356.It Fl q
357Be quiet when executing the
358.Cm add ,
359.Cm nat ,
360.Cm zero ,
361.Cm resetlog
362or
363.Cm flush
364commands;
365(implies
366.Fl f ) .
367This is useful when updating rulesets by executing multiple
368.Nm
369commands in a script
370(e.g.,
371.Ql sh\ /etc/rc.firewall ) ,
372or by processing a file with many
373.Nm
374rules across a remote login session.
375It also stops a table add or delete
376from failing if the entry already exists or is not present.
377.Pp
378The reason why this option may be important is that
379for some of these actions,
380.Nm
381may print a message; if the action results in blocking the
382traffic to the remote client,
383the remote login session will be closed
384and the rest of the ruleset will not be processed.
385Access to the console would then be required to recover.
386.It Fl S
387When listing rules, show the
388.Em set
389each rule belongs to.
390If this flag is not specified, disabled rules will not be
391listed.
392.It Fl s Op Ar field
393When listing pipes, sort according to one of the four
394counters (total or current packets or bytes).
395.It Fl t
396When listing, show last match timestamp converted with
397.Fn ctime .
398.It Fl T
399When listing, show last match timestamp as seconds from the epoch.
400This form can be more convenient for postprocessing by scripts.
401.El
402.Ss LIST OF RULES AND PREPROCESSING
403To ease configuration, rules can be put into a file which is
404processed using
405.Nm
406as shown in the last synopsis line.
407An absolute
408.Ar pathname
409must be used.
410The file will be read line by line and applied as arguments to the
411.Nm
412utility.
413.Pp
414Optionally, a preprocessor can be specified using
415.Fl p Ar preproc
416where
417.Ar pathname
418is to be piped through.
419Useful preprocessors include
420.Xr cpp 1
421and
422.Xr m4 1 .
423If
424.Ar preproc
425does not start with a slash
426.Pq Ql /
427as its first character, the usual
428.Ev PATH
429name search is performed.
430Care should be taken with this in environments where not all
431file systems are mounted (yet) by the time
432.Nm
433is being run (e.g.\& when they are mounted over NFS).
434Once
435.Fl p
436has been specified, any additional arguments are passed on to the preprocessor
437for interpretation.
438This allows for flexible configuration files (like conditionalizing
439them on the local hostname) and the use of macros to centralize
440frequently required arguments like IP addresses.
441.Ss TRAFFIC SHAPER CONFIGURATION
442The
443.Nm dnctl
444.Cm pipe , queue
445and
446.Cm sched
447commands are used to configure the traffic shaper and packet scheduler.
448See the
449.Sx TRAFFIC SHAPER (DUMMYNET) CONFIGURATION
450Section below for details.
451.Pp
452If the world and the kernel get out of sync the
453.Nm
454ABI may break, preventing you from being able to add any rules.
455This can adversely affect the booting process.
456You can use
457.Nm
458.Cm disable
459.Cm firewall
460to temporarily disable the firewall to regain access to the network,
461allowing you to fix the problem.
462.Sh PACKET FLOW
463A packet is checked against the active ruleset in multiple places
464in the protocol stack, under control of several sysctl variables.
465These places and variables are shown below, and it is important to
466have this picture in mind in order to design a correct ruleset.
467.Bd -literal -offset indent
468       ^    to upper layers    V
469       |                       |
470       +----------->-----------+
471       ^                       V
472 [ip(6)_input]           [ip(6)_output]     net.inet(6).ip(6).fw.enable=1
473       |                       |
474       ^                       V
475 [ether_demux]        [ether_output_frame]  net.link.ether.ipfw=1
476       |                       |
477       +-->--[bdg_forward]-->--+            net.link.bridge.ipfw=1
478       ^                       V
479       |      to devices       |
480.Ed
481.Pp
482The number of
483times the same packet goes through the firewall can
484vary between 0 and 4 depending on packet source and
485destination, and system configuration.
486.Pp
487Note that as packets flow through the stack, headers can be
488stripped or added to it, and so they may or may not be available
489for inspection.
490E.g., incoming packets will include the MAC header when
491.Nm
492is invoked from
493.Cm ether_demux() ,
494but the same packets will have the MAC header stripped off when
495.Nm
496is invoked from
497.Cm ip_input()
498or
499.Cm ip6_input() .
500.Pp
501Also note that each packet is always checked against the complete ruleset,
502irrespective of the place where the check occurs, or the source of the packet.
503If a rule contains some match patterns or actions which are not valid
504for the place of invocation (e.g.\& trying to match a MAC header within
505.Cm ip_input
506or
507.Cm ip6_input ),
508the match pattern will not match, but a
509.Cm not
510operator in front of such patterns
511.Em will
512cause the pattern to
513.Em always
514match on those packets.
515It is thus the responsibility of
516the programmer, if necessary, to write a suitable ruleset to
517differentiate among the possible places.
518.Cm skipto
519rules can be useful here, as an example:
520.Bd -literal -offset indent
521# packets from ether_demux or bdg_forward
522ipfw add 10 skipto 1000 all from any to any layer2 in
523# packets from ip_input
524ipfw add 10 skipto 2000 all from any to any not layer2 in
525# packets from ip_output
526ipfw add 10 skipto 3000 all from any to any not layer2 out
527# packets from ether_output_frame
528ipfw add 10 skipto 4000 all from any to any layer2 out
529.Ed
530.Pp
531(yes, at the moment there is no way to differentiate between
532ether_demux and bdg_forward).
533.Pp
534Also note that only actions
535.Cm allow ,
536.Cm deny ,
537.Cm netgraph ,
538.Cm ngtee
539and related to
540.Cm dummynet
541are processed for
542.Cm layer2
543frames and all other actions act as if they were
544.Cm allow
545for such frames.
546Full set of actions is supported for IP packets without
547.Cm layer2
548headers only.
549For example,
550.Cm divert
551action does not divert
552.Cm layer2
553frames.
554.Sh SYNTAX
555In general, each keyword or argument must be provided as
556a separate command line argument, with no leading or trailing
557spaces.
558Keywords are case-sensitive, whereas arguments may
559or may not be case-sensitive depending on their nature
560(e.g.\& uid's are, hostnames are not).
561.Pp
562Some arguments (e.g., port or address lists) are comma-separated
563lists of values.
564In this case, spaces after commas ',' are allowed to make
565the line more readable.
566You can also put the entire
567command (including flags) into a single argument.
568E.g., the following forms are equivalent:
569.Bd -literal -offset indent
570ipfw -q add deny src-ip 10.0.0.0/24,127.0.0.1/8
571ipfw -q add deny src-ip 10.0.0.0/24, 127.0.0.1/8
572ipfw "-q add deny src-ip 10.0.0.0/24, 127.0.0.1/8"
573.Ed
574.Sh RULE FORMAT
575The format of firewall rules is the following:
576.Bd -ragged -offset indent
577.Bk -words
578.Op Ar rule_number
579.Op Cm set Ar set_number
580.Op Cm prob Ar match_probability
581.Ar action
582.Op Cm log Op Cm logamount Ar number
583.Op Cm altq Ar queue
584.Oo
585.Bro Cm tag | untag
586.Brc Ar number
587.Oc
588.Ar body
589.Ek
590.Ed
591.Pp
592where the body of the rule specifies which information is used
593for filtering packets, among the following:
594.Pp
595.Bl -tag -width "Source and dest. addresses and ports" -offset XXX -compact
596.It Layer2 header fields
597When available
598.It IPv4 and IPv6 Protocol
599SCTP, TCP, UDP, ICMP, etc.
600.It Source and dest. addresses and ports
601.It Direction
602See Section
603.Sx PACKET FLOW
604.It Transmit and receive interface
605By name or address
606.It Misc. IP header fields
607Version, type of service, datagram length, identification,
608fragmentation flags,
609Time To Live
610.It IP options
611.It IPv6 Extension headers
612Fragmentation, Hop-by-Hop options,
613Routing Headers, Source routing rthdr0, Mobile IPv6 rthdr2, IPSec options.
614.It IPv6 Flow-ID
615.It Misc. TCP header fields
616TCP flags (SYN, FIN, ACK, RST, etc.),
617sequence number, acknowledgment number,
618window
619.It TCP options
620.It ICMP types
621for ICMP packets
622.It ICMP6 types
623for ICMP6 packets
624.It User/group ID
625When the packet can be associated with a local socket.
626.It Divert status
627Whether a packet came from a divert socket (e.g.,
628.Xr natd 8 ) .
629.It Fib annotation state
630Whether a packet has been tagged for using a specific FIB (routing table)
631in future forwarding decisions.
632.El
633.Pp
634Note that some of the above information, e.g.\& source MAC or IP addresses and
635TCP/UDP ports, can be easily spoofed, so filtering on those fields
636alone might not guarantee the desired results.
637.Bl -tag -width indent
638.It Ar rule_number
639Each rule is associated with a
640.Ar rule_number
641in the range 1..65535, with the latter reserved for the
642.Em default
643rule.
644Rules are checked sequentially by rule number.
645Multiple rules can have the same number, in which case they are
646checked (and listed) according to the order in which they have
647been added.
648If a rule is entered without specifying a number, the kernel will
649assign one in such a way that the rule becomes the last one
650before the
651.Em default
652rule.
653Automatic rule numbers are assigned by incrementing the last
654non-default rule number by the value of the sysctl variable
655.Ar net.inet.ip.fw.autoinc_step
656which defaults to 100.
657If this is not possible (e.g.\& because we would go beyond the
658maximum allowed rule number), the number of the last
659non-default value is used instead.
660.It Cm set Ar set_number
661Each rule is associated with a
662.Ar set_number
663in the range 0..31.
664Sets can be individually disabled and enabled, so this parameter
665is of fundamental importance for atomic ruleset manipulation.
666It can be also used to simplify deletion of groups of rules.
667If a rule is entered without specifying a set number,
668set 0 will be used.
669.br
670Set 31 is special in that it cannot be disabled,
671and rules in set 31 are not deleted by the
672.Nm ipfw flush
673command (but you can delete them with the
674.Nm ipfw delete set 31
675command).
676Set 31 is also used for the
677.Em default
678rule.
679.It Cm prob Ar match_probability
680A match is only declared with the specified probability
681(floating point number between 0 and 1).
682This can be useful for a number of applications such as
683random packet drop or
684(in conjunction with
685.Nm dummynet )
686to simulate the effect of multiple paths leading to out-of-order
687packet delivery.
688.Pp
689Note: this condition is checked before any other condition, including
690ones such as
691.Cm keep-state
692or
693.Cm check-state
694which might have
695side effects.
696.It Cm log Op Cm logamount Ar number
697Packets matching a rule with the
698.Cm log
699keyword will be made available for logging in two ways:
700if the sysctl variable
701.Va net.inet.ip.fw.verbose
702is set to 0 (default), one can use
703.Xr bpf 4
704attached to the
705.Li ipfw0
706pseudo interface.
707This pseudo interface can be created manually after a system
708boot by using the following command:
709.Bd -literal -offset indent
710# ifconfig ipfw0 create
711.Ed
712.Pp
713Or, automatically at boot time by adding the following
714line to the
715.Xr rc.conf 5
716file:
717.Bd -literal -offset indent
718firewall_logif="YES"
719.Ed
720.Pp
721There is zero overhead when no
722.Xr bpf 4
723is attached to the pseudo interface.
724.Pp
725If
726.Va net.inet.ip.fw.verbose
727is set to 1, packets will be logged to
728.Xr syslogd 8
729with a
730.Dv LOG_SECURITY
731facility up to a maximum of
732.Cm logamount
733packets.
734If no
735.Cm logamount
736is specified, the limit is taken from the sysctl variable
737.Va net.inet.ip.fw.verbose_limit .
738In both cases, a value of 0 means unlimited logging.
739.Pp
740Once the limit is reached, logging can be re-enabled by
741clearing the logging counter or the packet counter for that entry, see the
742.Cm resetlog
743command.
744.Pp
745Note: logging is done after all other packet matching conditions
746have been successfully verified, and before performing the final
747action (accept, deny, etc.) on the packet.
748.It Cm tag Ar number
749When a packet matches a rule with the
750.Cm tag
751keyword, the numeric tag for the given
752.Ar number
753in the range 1..65534 will be attached to the packet.
754The tag acts as an internal marker (it is not sent out over
755the wire) that can be used to identify these packets later on.
756This can be used, for example, to provide trust between interfaces
757and to start doing policy-based filtering.
758A packet can have multiple tags at the same time.
759Tags are "sticky", meaning once a tag is applied to a packet by a
760matching rule it exists until explicit removal.
761Tags are kept with the packet everywhere within the kernel, but are
762lost when packet leaves the kernel, for example, on transmitting
763packet out to the network or sending packet to a
764.Xr divert 4
765socket.
766.Pp
767To check for previously applied tags, use the
768.Cm tagged
769rule option.
770To delete previously applied tag, use the
771.Cm untag
772keyword.
773.Pp
774Note: since tags are kept with the packet everywhere in kernelspace,
775they can be set and unset anywhere in the kernel network subsystem
776(using the
777.Xr mbuf_tags 9
778facility), not only by means of the
779.Xr ipfw 4
780.Cm tag
781and
782.Cm untag
783keywords.
784For example, there can be a specialized
785.Xr netgraph 4
786node doing traffic analyzing and tagging for later inspecting
787in firewall.
788.It Cm untag Ar number
789When a packet matches a rule with the
790.Cm untag
791keyword, the tag with the number
792.Ar number
793is searched among the tags attached to this packet and,
794if found, removed from it.
795Other tags bound to packet, if present, are left untouched.
796.It Cm altq Ar queue
797When a packet matches a rule with the
798.Cm altq
799keyword, the ALTQ identifier for the given
800.Ar queue
801(see
802.Xr altq 4 )
803will be attached.
804Note that this ALTQ tag is only meaningful for packets going "out" of IPFW,
805and not being rejected or going to divert sockets.
806Note that if there is insufficient memory at the time the packet is
807processed, it will not be tagged, so it is wise to make your ALTQ
808"default" queue policy account for this.
809If multiple
810.Cm altq
811rules match a single packet, only the first one adds the ALTQ classification
812tag.
813In doing so, traffic may be shaped by using
814.Cm count Cm altq Ar queue
815rules for classification early in the ruleset, then later applying
816the filtering decision.
817For example,
818.Cm check-state
819and
820.Cm keep-state
821rules may come later and provide the actual filtering decisions in
822addition to the fallback ALTQ tag.
823.Pp
824You must run
825.Xr pfctl 8
826to set up the queues before IPFW will be able to look them up by name,
827and if the ALTQ disciplines are rearranged, the rules in containing the
828queue identifiers in the kernel will likely have gone stale and need
829to be reloaded.
830Stale queue identifiers will probably result in misclassification.
831.Pp
832All system ALTQ processing can be turned on or off via
833.Nm
834.Cm enable Ar altq
835and
836.Nm
837.Cm disable Ar altq .
838The usage of
839.Va net.inet.ip.fw.one_pass
840is irrelevant to ALTQ traffic shaping, as the actual rule action is followed
841always after adding an ALTQ tag.
842.El
843.Ss RULE ACTIONS
844A rule can be associated with one of the following actions, which
845will be executed when the packet matches the body of the rule.
846.Bl -tag -width indent
847.It Cm allow | accept | pass | permit
848Allow packets that match rule.
849The search terminates.
850.It Cm check-state Op Ar :flowname | Cm :any
851Checks the packet against the dynamic ruleset.
852If a match is found, execute the action associated with
853the rule which generated this dynamic rule, otherwise
854move to the next rule.
855.br
856.Cm Check-state
857rules do not have a body.
858If no
859.Cm check-state
860rule is found, the dynamic ruleset is checked at the first
861.Cm keep-state
862or
863.Cm limit
864rule.
865The
866.Ar :flowname
867is symbolic name assigned to dynamic rule by
868.Cm keep-state
869opcode.
870The special flowname
871.Cm :any
872can be used to ignore states flowname when matching.
873The
874.Cm :default
875keyword is special name used for compatibility with old rulesets.
876.It Cm count
877Update counters for all packets that match rule.
878The search continues with the next rule.
879.It Cm deny | drop
880Discard packets that match this rule.
881The search terminates.
882.It Cm divert Ar port
883Divert packets that match this rule to the
884.Xr divert 4
885socket bound to port
886.Ar port .
887The search terminates.
888.It Cm fwd | forward Ar ipaddr | tablearg Ns Op , Ns Ar port
889Change the next-hop on matching packets to
890.Ar ipaddr ,
891which can be an IP address or a host name.
892The next hop can also be supplied by the last table
893looked up for the packet by using the
894.Cm tablearg
895keyword instead of an explicit address.
896The search terminates if this rule matches.
897.Pp
898If
899.Ar ipaddr
900is a local address, then matching packets will be forwarded to
901.Ar port
902(or the port number in the packet if one is not specified in the rule)
903on the local machine.
904.br
905If
906.Ar ipaddr
907is not a local address, then the port number
908(if specified) is ignored, and the packet will be
909forwarded to the remote address, using the route as found in
910the local routing table for that IP.
911.br
912A
913.Ar fwd
914rule will not match layer2 packets (those received
915on ether_input, ether_output, or bridged).
916.br
917The
918.Cm fwd
919action does not change the contents of the packet at all.
920In particular, the destination address remains unmodified, so
921packets forwarded to another system will usually be rejected by that system
922unless there is a matching rule on that system to capture them.
923For packets forwarded locally,
924the local address of the socket will be
925set to the original destination address of the packet.
926This makes the
927.Xr netstat 1
928entry look rather weird but is intended for
929use with transparent proxy servers.
930.It Cm nat Ar nat_nr | global | tablearg
931Pass packet to a
932nat instance
933(for network address translation, address redirect, etc.):
934see the
935.Sx NETWORK ADDRESS TRANSLATION (NAT)
936Section for further information.
937.It Cm nat64lsn Ar name
938Pass packet to a stateful NAT64 instance (for IPv6/IPv4 network address and
939protocol translation): see the
940.Sx IPv6/IPv4 NETWORK ADDRESS AND PROTOCOL TRANSLATION
941Section for further information.
942.It Cm nat64stl Ar name
943Pass packet to a stateless NAT64 instance (for IPv6/IPv4 network address and
944protocol translation): see the
945.Sx IPv6/IPv4 NETWORK ADDRESS AND PROTOCOL TRANSLATION
946Section for further information.
947.It Cm nat64clat Ar name
948Pass packet to a CLAT NAT64 instance (for client-side IPv6/IPv4 network address and
949protocol translation): see the
950.Sx IPv6/IPv4 NETWORK ADDRESS AND PROTOCOL TRANSLATION
951Section for further information.
952.It Cm nptv6 Ar name
953Pass packet to a NPTv6 instance (for IPv6-to-IPv6 network prefix translation):
954see the
955.Sx IPv6-to-IPv6 NETWORK PREFIX TRANSLATION (NPTv6)
956Section for further information.
957.It Cm pipe Ar pipe_nr
958Pass packet to a
959.Nm dummynet
960.Dq pipe
961(for bandwidth limitation, delay, etc.).
962See the
963.Sx TRAFFIC SHAPER (DUMMYNET) CONFIGURATION
964Section for further information.
965The search terminates; however, on exit from the pipe and if
966the
967.Xr sysctl 8
968variable
969.Va net.inet.ip.fw.one_pass
970is not set, the packet is passed again to the firewall code
971starting from the next rule.
972.It Cm queue Ar queue_nr
973Pass packet to a
974.Nm dummynet
975.Dq queue
976(for bandwidth limitation using WF2Q+).
977.It Cm reject
978(Deprecated).
979Synonym for
980.Cm unreach host .
981.It Cm reset
982Discard packets that match this rule, and if the
983packet is a TCP packet, try to send a TCP reset (RST) notice.
984The search terminates.
985.It Cm reset6
986Discard packets that match this rule, and if the
987packet is a TCP packet, try to send a TCP reset (RST) notice.
988The search terminates.
989.It Cm skipto Ar number | tablearg
990Skip all subsequent rules numbered less than
991.Ar number .
992The search continues with the first rule numbered
993.Ar number
994or higher.
995It is possible to use the
996.Cm tablearg
997keyword with a skipto for a
998.Em computed
999skipto.
1000Skipto may work either in O(log(N)) or in O(1) depending
1001on amount of memory and/or sysctl variables.
1002See the
1003.Sx SYSCTL VARIABLES
1004section for more details.
1005.It Cm call Ar number | tablearg
1006The current rule number is saved in the internal stack and
1007ruleset processing continues with the first rule numbered
1008.Ar number
1009or higher.
1010If later a rule with the
1011.Cm return
1012action is encountered, the processing returns to the first rule
1013with number of this
1014.Cm call
1015rule plus one or higher
1016(the same behaviour as with packets returning from
1017.Xr divert 4
1018socket after a
1019.Cm divert
1020action).
1021This could be used to make somewhat like an assembly language
1022.Dq subroutine
1023calls to rules with common checks for different interfaces, etc.
1024.Pp
1025Rule with any number could be called, not just forward jumps as with
1026.Cm skipto .
1027So, to prevent endless loops in case of mistakes, both
1028.Cm call
1029and
1030.Cm return
1031actions don't do any jumps and simply go to the next rule if memory
1032cannot be allocated or stack overflowed/underflowed.
1033.Pp
1034Internally stack for rule numbers is implemented using
1035.Xr mbuf_tags 9
1036facility and currently has size of 16 entries.
1037As mbuf tags are lost when packet leaves the kernel,
1038.Cm divert
1039should not be used in subroutines to avoid endless loops
1040and other undesired effects.
1041.It Cm return
1042Takes rule number saved to internal stack by the last
1043.Cm call
1044action and returns ruleset processing to the first rule
1045with number greater than number of corresponding
1046.Cm call
1047rule.
1048See description of the
1049.Cm call
1050action for more details.
1051.Pp
1052Note that
1053.Cm return
1054rules usually end a
1055.Dq subroutine
1056and thus are unconditional, but
1057.Nm
1058command-line utility currently requires every action except
1059.Cm check-state
1060to have body.
1061While it is sometimes useful to return only on some packets,
1062usually you want to print just
1063.Dq return
1064for readability.
1065A workaround for this is to use new syntax and
1066.Fl c
1067switch:
1068.Bd -literal -offset indent
1069# Add a rule without actual body
1070ipfw add 2999 return via any
1071
1072# List rules without "from any to any" part
1073ipfw -c list
1074.Ed
1075.Pp
1076This cosmetic annoyance may be fixed in future releases.
1077.It Cm tee Ar port
1078Send a copy of packets matching this rule to the
1079.Xr divert 4
1080socket bound to port
1081.Ar port .
1082The search continues with the next rule.
1083.It Cm unreach Ar code
1084Discard packets that match this rule, and try to send an ICMP
1085unreachable notice with code
1086.Ar code ,
1087where
1088.Ar code
1089is a number from 0 to 255, or one of these aliases:
1090.Cm net , host , protocol , port ,
1091.Cm needfrag , srcfail , net-unknown , host-unknown ,
1092.Cm isolated , net-prohib , host-prohib , tosnet ,
1093.Cm toshost , filter-prohib , host-precedence
1094or
1095.Cm precedence-cutoff .
1096The search terminates.
1097.It Cm unreach6 Ar code
1098Discard packets that match this rule, and try to send an ICMPv6
1099unreachable notice with code
1100.Ar code ,
1101where
1102.Ar code
1103is a number from 0, 1, 3 or 4, or one of these aliases:
1104.Cm no-route, admin-prohib, address
1105or
1106.Cm port .
1107The search terminates.
1108.It Cm netgraph Ar cookie
1109Divert packet into netgraph with given
1110.Ar cookie .
1111The search terminates.
1112If packet is later returned from netgraph it is either
1113accepted or continues with the next rule, depending on
1114.Va net.inet.ip.fw.one_pass
1115sysctl variable.
1116.It Cm ngtee Ar cookie
1117A copy of packet is diverted into netgraph, original
1118packet continues with the next rule.
1119See
1120.Xr ng_ipfw 4
1121for more information on
1122.Cm netgraph
1123and
1124.Cm ngtee
1125actions.
1126.It Cm setfib Ar fibnum | tablearg
1127The packet is tagged so as to use the FIB (routing table)
1128.Ar fibnum
1129in any subsequent forwarding decisions.
1130In the current implementation, this is limited to the values 0 through 15, see
1131.Xr setfib 2 .
1132Processing continues at the next rule.
1133It is possible to use the
1134.Cm tablearg
1135keyword with setfib.
1136If the tablearg value is not within the compiled range of fibs,
1137the packet's fib is set to 0.
1138.It Cm setdscp Ar DSCP | number | tablearg
1139Set specified DiffServ codepoint for an IPv4/IPv6 packet.
1140Processing continues at the next rule.
1141Supported values are:
1142.Pp
1143.Cm cs0
1144.Pq Dv 000000 ,
1145.Cm cs1
1146.Pq Dv 001000 ,
1147.Cm cs2
1148.Pq Dv 010000 ,
1149.Cm cs3
1150.Pq Dv 011000 ,
1151.Cm cs4
1152.Pq Dv 100000 ,
1153.Cm cs5
1154.Pq Dv 101000 ,
1155.Cm cs6
1156.Pq Dv 110000 ,
1157.Cm cs7
1158.Pq Dv 111000 ,
1159.Cm af11
1160.Pq Dv 001010 ,
1161.Cm af12
1162.Pq Dv 001100 ,
1163.Cm af13
1164.Pq Dv 001110 ,
1165.Cm af21
1166.Pq Dv 010010 ,
1167.Cm af22
1168.Pq Dv 010100 ,
1169.Cm af23
1170.Pq Dv 010110 ,
1171.Cm af31
1172.Pq Dv 011010 ,
1173.Cm af32
1174.Pq Dv 011100 ,
1175.Cm af33
1176.Pq Dv 011110 ,
1177.Cm af41
1178.Pq Dv 100010 ,
1179.Cm af42
1180.Pq Dv 100100 ,
1181.Cm af43
1182.Pq Dv 100110 ,
1183.Cm ef
1184.Pq Dv 101110 ,
1185.Cm be
1186.Pq Dv 000000 .
1187Additionally, DSCP value can be specified by number (0..63).
1188It is also possible to use the
1189.Cm tablearg
1190keyword with setdscp.
1191If the tablearg value is not within the 0..63 range, lower 6 bits of supplied
1192value are used.
1193.It Cm tcp-setmss Ar mss
1194Set the Maximum Segment Size (MSS) in the TCP segment to value
1195.Ar mss .
1196The kernel module
1197.Cm ipfw_pmod
1198should be loaded or kernel should have
1199.Cm options IPFIREWALL_PMOD
1200to be able use this action.
1201This command does not change a packet if original MSS value is lower than
1202specified value.
1203Both TCP over IPv4 and over IPv6 are supported.
1204Regardless of matched a packet or not by the
1205.Cm tcp-setmss
1206rule, the search continues with the next rule.
1207.It Cm reass
1208Queue and reassemble IPv4 fragments.
1209If the packet is not fragmented, counters are updated and
1210processing continues with the next rule.
1211If the packet is the last logical fragment, the packet is reassembled and, if
1212.Va net.inet.ip.fw.one_pass
1213is set to 0, processing continues with the next rule.
1214Otherwise, the packet is allowed to pass and the search terminates.
1215If the packet is a fragment in the middle of a logical group of fragments,
1216it is consumed and
1217processing stops immediately.
1218.Pp
1219Fragment handling can be tuned via
1220.Va net.inet.ip.maxfragpackets
1221and
1222.Va net.inet.ip.maxfragsperpacket
1223which limit, respectively, the maximum number of processable
1224fragments (default: 800) and
1225the maximum number of fragments per packet (default: 16).
1226.Pp
1227NOTA BENE: since fragments do not contain port numbers,
1228they should be avoided with the
1229.Nm reass
1230rule.
1231Alternatively, direction-based (like
1232.Nm in
1233/
1234.Nm out
1235) and source-based (like
1236.Nm via
1237) match patterns can be used to select fragments.
1238.Pp
1239Usually a simple rule like:
1240.Bd -literal -offset indent
1241# reassemble incoming fragments
1242ipfw add reass all from any to any in
1243.Ed
1244.Pp
1245is all you need at the beginning of your ruleset.
1246.It Cm abort
1247Discard packets that match this rule, and if the packet is an SCTP packet,
1248try to send an SCTP packet containing an ABORT chunk.
1249The search terminates.
1250.It Cm abort6
1251Discard packets that match this rule, and if the packet is an SCTP packet,
1252try to send an SCTP packet containing an ABORT chunk.
1253The search terminates.
1254.El
1255.Ss RULE BODY
1256The body of a rule contains zero or more patterns (such as
1257specific source and destination addresses or ports,
1258protocol options, incoming or outgoing interfaces, etc.)
1259that the packet must match in order to be recognised.
1260In general, the patterns are connected by (implicit)
1261.Cm and
1262operators -- i.e., all must match in order for the
1263rule to match.
1264Individual patterns can be prefixed by the
1265.Cm not
1266operator to reverse the result of the match, as in
1267.Pp
1268.Dl "ipfw add 100 allow ip from not 1.2.3.4 to any"
1269.Pp
1270Additionally, sets of alternative match patterns
1271.Pq Em or-blocks
1272can be constructed by putting the patterns in
1273lists enclosed between parentheses ( ) or braces { }, and
1274using the
1275.Cm or
1276operator as follows:
1277.Pp
1278.Dl "ipfw add 100 allow ip from { x or not y or z } to any"
1279.Pp
1280Only one level of parentheses is allowed.
1281Beware that most shells have special meanings for parentheses
1282or braces, so it is advisable to put a backslash \\ in front of them
1283to prevent such interpretations.
1284.Pp
1285The body of a rule must in general include a source and destination
1286address specifier.
1287The keyword
1288.Ar any
1289can be used in various places to specify that the content of
1290a required field is irrelevant.
1291.Pp
1292The rule body has the following format:
1293.Bd -ragged -offset indent
1294.Op Ar proto Cm from Ar src Cm to Ar dst
1295.Op Ar options
1296.Ed
1297.Pp
1298The first part (proto from src to dst) is for backward
1299compatibility with earlier versions of
1300.Fx .
1301In modern
1302.Fx
1303any match pattern (including MAC headers, IP protocols,
1304addresses and ports) can be specified in the
1305.Ar options
1306section.
1307.Pp
1308Rule fields have the following meaning:
1309.Bl -tag -width indent
1310.It Ar proto : protocol | Cm { Ar protocol Cm or ... }
1311.It Ar protocol : Oo Cm not Oc Ar protocol-name | protocol-number
1312An IP protocol specified by number or name
1313(for a complete list see
1314.Pa /etc/protocols ) ,
1315or one of the following keywords:
1316.Bl -tag -width indent
1317.It Cm ip4 | ipv4
1318Matches IPv4 packets.
1319.It Cm ip6 | ipv6
1320Matches IPv6 packets.
1321.It Cm ip | all
1322Matches any packet.
1323.El
1324.Pp
1325The
1326.Cm ipv6
1327in
1328.Cm proto
1329option will be treated as inner protocol.
1330And, the
1331.Cm ipv4
1332is not available in
1333.Cm proto
1334option.
1335.Pp
1336The
1337.Cm { Ar protocol Cm or ... }
1338format (an
1339.Em or-block )
1340is provided for convenience only but its use is deprecated.
1341.It Ar src No and Ar dst : Bro Cm addr | Cm { Ar addr Cm or ... } Brc Op Oo Cm not Oc Ar ports
1342An address (or a list, see below)
1343optionally followed by
1344.Ar ports
1345specifiers.
1346.Pp
1347The second format
1348.Em ( or-block
1349with multiple addresses) is provided for convenience only and
1350its use is discouraged.
1351.It Ar addr : Oo Cm not Oc Bro
1352.Cm any | me | me6 |
1353.Cm table Ns Pq Ar name Ns Op , Ns Ar value
1354.Ar | addr-list | addr-set
1355.Brc
1356.Bl -tag -width indent
1357.It Cm any
1358Matches any IP address.
1359.It Cm me
1360Matches any IP address configured on an interface in the system.
1361.It Cm me6
1362Matches any IPv6 address configured on an interface in the system.
1363The address list is evaluated at the time the packet is
1364analysed.
1365.It Cm table Ns Pq Ar name Ns Op , Ns Ar value
1366Matches any IPv4 or IPv6 address for which an entry exists in the lookup table
1367.Ar number .
1368If an optional 32-bit unsigned
1369.Ar value
1370is also specified, an entry will match only if it has this value.
1371See the
1372.Sx LOOKUP TABLES
1373section below for more information on lookup tables.
1374.El
1375.It Ar addr-list : ip-addr Ns Op Ns , Ns Ar addr-list
1376.It Ar ip-addr :
1377A host or subnet address specified in one of the following ways:
1378.Bl -tag -width indent
1379.It Ar numeric-ip | hostname
1380Matches a single IPv4 address, specified as dotted-quad or a hostname.
1381Hostnames are resolved at the time the rule is added to the firewall list.
1382.It Ar addr Ns / Ns Ar masklen
1383Matches all addresses with base
1384.Ar addr
1385(specified as an IP address, a network number, or a hostname)
1386and mask width of
1387.Cm masklen
1388bits.
1389As an example, 1.2.3.4/25 or 1.2.3.0/25 will match
1390all IP numbers from 1.2.3.0 to 1.2.3.127 .
1391.It Ar addr Ns : Ns Ar mask
1392Matches all addresses with base
1393.Ar addr
1394(specified as an IP address, a network number, or a hostname)
1395and the mask of
1396.Ar mask ,
1397specified as a dotted quad.
1398As an example, 1.2.3.4:255.0.255.0 or 1.0.3.0:255.0.255.0 will match
13991.*.3.*.
1400This form is advised only for non-contiguous
1401masks.
1402It is better to resort to the
1403.Ar addr Ns / Ns Ar masklen
1404format for contiguous masks, which is more compact and less
1405error-prone.
1406.El
1407.It Ar addr-set : addr Ns Oo Ns / Ns Ar masklen Oc Ns Cm { Ns Ar list Ns Cm }
1408.It Ar list : Bro Ar num | num-num Brc Ns Op Ns , Ns Ar list
1409Matches all addresses with base address
1410.Ar addr
1411(specified as an IP address, a network number, or a hostname)
1412and whose last byte is in the list between braces { } .
1413Note that there must be no spaces between braces and
1414numbers (spaces after commas are allowed).
1415Elements of the list can be specified as single entries
1416or ranges.
1417The
1418.Ar masklen
1419field is used to limit the size of the set of addresses,
1420and can have any value between 24 and 32.
1421If not specified,
1422it will be assumed as 24.
1423.br
1424This format is particularly useful to handle sparse address sets
1425within a single rule.
1426Because the matching occurs using a
1427bitmask, it takes constant time and dramatically reduces
1428the complexity of rulesets.
1429.br
1430As an example, an address specified as 1.2.3.4/24{128,35-55,89}
1431or 1.2.3.0/24{128,35-55,89}
1432will match the following IP addresses:
1433.br
14341.2.3.128, 1.2.3.35 to 1.2.3.55, 1.2.3.89 .
1435.It Ar addr6-list : ip6-addr Ns Op Ns , Ns Ar addr6-list
1436.It Ar ip6-addr :
1437A host or subnet specified one of the following ways:
1438.Bl -tag -width indent
1439.It Ar numeric-ip | hostname
1440Matches a single IPv6 address as allowed by
1441.Xr inet_pton 3
1442or a hostname.
1443Hostnames are resolved at the time the rule is added to the firewall
1444list.
1445.It Ar addr Ns / Ns Ar masklen
1446Matches all IPv6 addresses with base
1447.Ar addr
1448(specified as allowed by
1449.Xr inet_pton 3
1450or a hostname)
1451and mask width of
1452.Cm masklen
1453bits.
1454.It Ar addr Ns / Ns Ar mask
1455Matches all IPv6 addresses with base
1456.Ar addr
1457(specified as allowed by
1458.Xr inet_pton 3
1459or a hostname)
1460and the mask of
1461.Ar mask ,
1462specified as allowed by
1463.Xr inet_pton 3 .
1464As an example, fe::640:0:0/ffff::ffff:ffff:0:0 will match
1465fe:*:*:*:0:640:*:*.
1466This form is advised only for non-contiguous
1467masks.
1468It is better to resort to the
1469.Ar addr Ns / Ns Ar masklen
1470format for contiguous masks, which is more compact and less
1471error-prone.
1472.El
1473.Pp
1474No support for sets of IPv6 addresses is provided because IPv6 addresses
1475are typically random past the initial prefix.
1476.It Ar ports : Bro Ar port | port Ns \&- Ns Ar port Ns Brc Ns Op , Ns Ar ports
1477For protocols which support port numbers (such as SCTP, TCP and UDP), optional
1478.Cm ports
1479may be specified as one or more ports or port ranges, separated
1480by commas but no spaces, and an optional
1481.Cm not
1482operator.
1483The
1484.Ql \&-
1485notation specifies a range of ports (including boundaries).
1486.Pp
1487Service names (from
1488.Pa /etc/services )
1489may be used instead of numeric port values.
1490The length of the port list is limited to 30 ports or ranges,
1491though one can specify larger ranges by using an
1492.Em or-block
1493in the
1494.Cm options
1495section of the rule.
1496.Pp
1497A backslash
1498.Pq Ql \e
1499can be used to escape the dash
1500.Pq Ql -
1501character in a service name (from a shell, the backslash must be
1502typed twice to avoid the shell itself interpreting it as an escape
1503character).
1504.Pp
1505.Dl "ipfw add count tcp from any ftp\e\e-data-ftp to any"
1506.Pp
1507Fragmented packets which have a non-zero offset (i.e., not the first
1508fragment) will never match a rule which has one or more port
1509specifications.
1510See the
1511.Cm frag
1512option for details on matching fragmented packets.
1513.El
1514.Ss RULE OPTIONS (MATCH PATTERNS)
1515Additional match patterns can be used within
1516rules.
1517Zero or more of these so-called
1518.Em options
1519can be present in a rule, optionally prefixed by the
1520.Cm not
1521operand, and possibly grouped into
1522.Em or-blocks .
1523.Pp
1524The following match patterns can be used (listed in alphabetical order):
1525.Bl -tag -width indent
1526.It Cm // this is a comment .
1527Inserts the specified text as a comment in the rule.
1528Everything following // is considered as a comment and stored in the rule.
1529You can have comment-only rules, which are listed as having a
1530.Cm count
1531action followed by the comment.
1532.It Cm bridged
1533Alias for
1534.Cm layer2 .
1535.It Cm defer-immediate-action | defer-action
1536A rule with this option will not perform normal action
1537upon a match.
1538This option is intended to be used with
1539.Cm record-state
1540or
1541.Cm keep-state
1542as the dynamic rule, created but ignored on match, will work
1543as intended.
1544Rules with both
1545.Cm record-state
1546and
1547.Cm defer-immediate-action
1548create a dynamic rule and continue with the next rule without actually
1549performing the action part of this rule.
1550When the rule is later activated via the state table, the action is
1551performed as usual.
1552.It Cm diverted
1553Matches only packets generated by a divert socket.
1554.It Cm diverted-loopback
1555Matches only packets coming from a divert socket back into the IP stack
1556input for delivery.
1557.It Cm diverted-output
1558Matches only packets going from a divert socket back outward to the IP
1559stack output for delivery.
1560.It Cm dst-ip Ar ip-address
1561Matches IPv4 packets whose destination IP is one of the address(es)
1562specified as argument.
1563.It Bro Cm dst-ip6 | dst-ipv6 Brc Ar ip6-address
1564Matches IPv6 packets whose destination IP is one of the address(es)
1565specified as argument.
1566.It Cm dst-port Ar ports
1567Matches IP packets whose destination port is one of the port(s)
1568specified as argument.
1569.It Cm established
1570Matches TCP packets that have the RST or ACK bits set.
1571.It Cm ext6hdr Ar header
1572Matches IPv6 packets containing the extended header given by
1573.Ar header .
1574Supported headers are:
1575.Pp
1576Fragment,
1577.Pq Cm frag ,
1578Hop-to-hop options
1579.Pq Cm hopopt ,
1580any type of Routing Header
1581.Pq Cm route ,
1582Source routing Routing Header Type 0
1583.Pq Cm rthdr0 ,
1584Mobile IPv6 Routing Header Type 2
1585.Pq Cm rthdr2 ,
1586Destination options
1587.Pq Cm dstopt ,
1588IPSec authentication headers
1589.Pq Cm ah ,
1590and IPsec encapsulated security payload headers
1591.Pq Cm esp .
1592.It Cm fib Ar fibnum
1593Matches a packet that has been tagged to use
1594the given FIB (routing table) number.
1595.It Cm flow Ar table Ns Pq Ar name Ns Op , Ns Ar value
1596Search for the flow entry in lookup table
1597.Ar name .
1598If not found, the match fails.
1599Otherwise, the match succeeds and
1600.Cm tablearg
1601is set to the value extracted from the table.
1602.Pp
1603This option can be useful to quickly dispatch traffic based on
1604certain packet fields.
1605See the
1606.Sx LOOKUP TABLES
1607section below for more information on lookup tables.
1608.It Cm flow-id Ar labels
1609Matches IPv6 packets containing any of the flow labels given in
1610.Ar labels .
1611.Ar labels
1612is a comma separated list of numeric flow labels.
1613.It Cm dst-mac Ar table Ns Pq Ar name Ns Op , Ns Ar value
1614Search for the destination MAC address entry in lookup table
1615.Ar name .
1616If not found, the match fails.
1617Otherwise, the match succeeds and
1618.Cm tablearg
1619is set to the value extracted from the table.
1620.It Cm src-mac Ar table Ns Pq Ar name Ns Op , Ns Ar value
1621Search for the source MAC address entry in lookup table
1622.Ar name .
1623If not found, the match fails.
1624Otherwise, the match succeeds and
1625.Cm tablearg
1626is set to the value extracted from the table.
1627.It Cm frag Ar spec
1628Matches IPv4 packets whose
1629.Cm ip_off
1630field contains the comma separated list of IPv4 fragmentation
1631options specified in
1632.Ar spec .
1633The recognized options are:
1634.Cm df
1635.Pq Dv don't fragment ,
1636.Cm mf
1637.Pq Dv more fragments ,
1638.Cm rf
1639.Pq Dv reserved fragment bit
1640.Cm offset
1641.Pq Dv non-zero fragment offset .
1642The absence of a particular options may be denoted
1643with a
1644.Ql \&! .
1645.Pp
1646Empty list of options defaults to matching on non-zero fragment offset.
1647Such rule would match all not the first fragment datagrams,
1648both IPv4 and IPv6.
1649This is a backward compatibility with older rulesets.
1650.It Cm gid Ar group
1651Matches all TCP or UDP packets sent by or received for a
1652.Ar group .
1653A
1654.Ar group
1655may be specified by name or number.
1656.It Cm jail Ar jail
1657Matches all TCP or UDP packets sent by or received for the
1658jail whose ID or name is
1659.Ar jail .
1660.It Cm icmptypes Ar types
1661Matches ICMP packets whose ICMP type is in the list
1662.Ar types .
1663The list may be specified as any combination of
1664individual types (numeric) separated by commas.
1665.Em Ranges are not allowed .
1666The supported ICMP types are:
1667.Pp
1668echo reply
1669.Pq Cm 0 ,
1670destination unreachable
1671.Pq Cm 3 ,
1672source quench
1673.Pq Cm 4 ,
1674redirect
1675.Pq Cm 5 ,
1676echo request
1677.Pq Cm 8 ,
1678router advertisement
1679.Pq Cm 9 ,
1680router solicitation
1681.Pq Cm 10 ,
1682time-to-live exceeded
1683.Pq Cm 11 ,
1684IP header bad
1685.Pq Cm 12 ,
1686timestamp request
1687.Pq Cm 13 ,
1688timestamp reply
1689.Pq Cm 14 ,
1690information request
1691.Pq Cm 15 ,
1692information reply
1693.Pq Cm 16 ,
1694address mask request
1695.Pq Cm 17
1696and address mask reply
1697.Pq Cm 18 .
1698.It Cm icmp6types Ar types
1699Matches ICMP6 packets whose ICMP6 type is in the list of
1700.Ar types .
1701The list may be specified as any combination of
1702individual types (numeric) separated by commas.
1703.Em Ranges are not allowed .
1704.It Cm in | out
1705Matches incoming or outgoing packets, respectively.
1706.Cm in
1707and
1708.Cm out
1709are mutually exclusive (in fact,
1710.Cm out
1711is implemented as
1712.Cm not in Ns No ).
1713.It Cm ipid Ar id-list
1714Matches IPv4 packets whose
1715.Cm ip_id
1716field has value included in
1717.Ar id-list ,
1718which is either a single value or a list of values or ranges
1719specified in the same way as
1720.Ar ports .
1721.It Cm iplen Ar len-list
1722Matches IP packets whose total length, including header and data, is
1723in the set
1724.Ar len-list ,
1725which is either a single value or a list of values or ranges
1726specified in the same way as
1727.Ar ports .
1728.It Cm ipoptions Ar spec
1729Matches packets whose IPv4 header contains the comma separated list of
1730options specified in
1731.Ar spec .
1732The supported IP options are:
1733.Pp
1734.Cm ssrr
1735(strict source route),
1736.Cm lsrr
1737(loose source route),
1738.Cm rr
1739(record packet route) and
1740.Cm ts
1741(timestamp).
1742The absence of a particular option may be denoted
1743with a
1744.Ql \&! .
1745.It Cm ipprecedence Ar precedence
1746Matches IPv4 packets whose precedence field is equal to
1747.Ar precedence .
1748.It Cm ipsec
1749Matches packets that have IPSEC history associated with them
1750(i.e., the packet comes encapsulated in IPSEC, the kernel
1751has IPSEC support, and can correctly decapsulate it).
1752.Pp
1753Note that specifying
1754.Cm ipsec
1755is different from specifying
1756.Cm proto Ar ipsec
1757as the latter will only look at the specific IP protocol field,
1758irrespective of IPSEC kernel support and the validity of the IPSEC data.
1759.Pp
1760Further note that this flag is silently ignored in kernels without
1761IPSEC support.
1762It does not affect rule processing when given and the
1763rules are handled as if with no
1764.Cm ipsec
1765flag.
1766.It Cm iptos Ar spec
1767Matches IPv4 packets whose
1768.Cm tos
1769field contains the comma separated list of
1770service types specified in
1771.Ar spec .
1772The supported IP types of service are:
1773.Pp
1774.Cm lowdelay
1775.Pq Dv IPTOS_LOWDELAY ,
1776.Cm throughput
1777.Pq Dv IPTOS_THROUGHPUT ,
1778.Cm reliability
1779.Pq Dv IPTOS_RELIABILITY ,
1780.Cm mincost
1781.Pq Dv IPTOS_MINCOST ,
1782.Cm congestion
1783.Pq Dv IPTOS_ECN_CE .
1784The absence of a particular type may be denoted
1785with a
1786.Ql \&! .
1787.It Cm dscp spec Ns Op , Ns Ar spec
1788Matches IPv4/IPv6 packets whose
1789.Cm DS
1790field value is contained in
1791.Ar spec
1792mask.
1793Multiple values can be specified via
1794the comma separated list.
1795Value can be one of keywords used in
1796.Cm setdscp
1797action or exact number.
1798.It Cm ipttl Ar ttl-list
1799Matches IPv4 packets whose time to live is included in
1800.Ar ttl-list ,
1801which is either a single value or a list of values or ranges
1802specified in the same way as
1803.Ar ports .
1804.It Cm ipversion Ar ver
1805Matches IP packets whose IP version field is
1806.Ar ver .
1807.It Cm keep-state Op Ar :flowname
1808Upon a match, the firewall will create a dynamic rule, whose
1809default behaviour is to match bidirectional traffic between
1810source and destination IP/port using the same protocol.
1811The rule has a limited lifetime (controlled by a set of
1812.Xr sysctl 8
1813variables), and the lifetime is refreshed every time a matching
1814packet is found.
1815The
1816.Ar :flowname
1817is used to assign additional to addresses, ports and protocol parameter
1818to dynamic rule.
1819It can be used for more accurate matching by
1820.Cm check-state
1821rule.
1822The
1823.Cm :default
1824keyword is special name used for compatibility with old rulesets.
1825.It Cm layer2
1826Matches only layer2 packets, i.e., those passed to
1827.Nm
1828from
1829.Fn ether_demux
1830and
1831.Fn ether_output_frame .
1832.It Cm limit Bro Cm src-addr | src-port | dst-addr | dst-port Brc Ar N Op Ar :flowname
1833The firewall will only allow
1834.Ar N
1835connections with the same
1836set of parameters as specified in the rule.
1837One or more
1838of source and destination addresses and ports can be
1839specified.
1840.It Cm lookup Bro Cm dst-ip | dst-port | dst-mac | src-ip | src-port | src-mac | uid | jail Brc Ar name
1841Search an entry in lookup table
1842.Ar name
1843that matches the field specified as argument.
1844If not found, the match fails.
1845Otherwise, the match succeeds and
1846.Cm tablearg
1847is set to the value extracted from the table.
1848.Pp
1849This option can be useful to quickly dispatch traffic based on
1850certain packet fields.
1851See the
1852.Sx LOOKUP TABLES
1853section below for more information on lookup tables.
1854.It Cm { MAC | mac } Ar dst-mac src-mac
1855Match packets with a given
1856.Ar dst-mac
1857and
1858.Ar src-mac
1859addresses, specified as the
1860.Cm any
1861keyword (matching any MAC address), or six groups of hex digits
1862separated by colons,
1863and optionally followed by a mask indicating the significant bits.
1864The mask may be specified using either of the following methods:
1865.Bl -enum -width indent
1866.It
1867A slash
1868.Pq /
1869followed by the number of significant bits.
1870For example, an address with 33 significant bits could be specified as:
1871.Pp
1872.Dl "MAC 10:20:30:40:50:60/33 any"
1873.It
1874An ampersand
1875.Pq &
1876followed by a bitmask specified as six groups of hex digits separated
1877by colons.
1878For example, an address in which the last 16 bits are significant could
1879be specified as:
1880.Pp
1881.Dl "MAC 10:20:30:40:50:60&00:00:00:00:ff:ff any"
1882.Pp
1883Note that the ampersand character has a special meaning in many shells
1884and should generally be escaped.
1885.El
1886Note that the order of MAC addresses (destination first,
1887source second) is
1888the same as on the wire, but the opposite of the one used for
1889IP addresses.
1890.It Cm mac-type Ar mac-type
1891Matches packets whose Ethernet Type field
1892corresponds to one of those specified as argument.
1893.Ar mac-type
1894is specified in the same way as
1895.Cm port numbers
1896(i.e., one or more comma-separated single values or ranges).
1897You can use symbolic names for known values such as
1898.Em vlan , ipv4, ipv6 .
1899Values can be entered as decimal or hexadecimal (if prefixed by 0x),
1900and they are always printed as hexadecimal (unless the
1901.Cm -N
1902option is used, in which case symbolic resolution will be attempted).
1903.It Cm proto Ar protocol
1904Matches packets with the corresponding IP protocol.
1905.It Cm record-state
1906Upon a match, the firewall will create a dynamic rule as if
1907.Cm keep-state
1908was specified.
1909However, this option doesn't imply an implicit
1910.Cm check-state
1911in contrast to
1912.Cm keep-state .
1913.It Cm recv | xmit | via Brq Ar ifX | Ar if Ns Cm * | Ar table Ns Po Ar name Ns Oo , Ns Ar value Oc Pc | Ar ipno | Ar any
1914Matches packets received, transmitted or going through,
1915respectively, the interface specified by exact name
1916.Po Ar ifX Pc ,
1917by device name
1918.Po Ar if* Pc ,
1919by IP address, or through some interface.
1920Table
1921.Ar name
1922may be used to match interface by its kernel ifindex.
1923See the
1924.Sx LOOKUP TABLES
1925section below for more information on lookup tables.
1926.Pp
1927The
1928.Cm via
1929keyword causes the interface to always be checked.
1930If
1931.Cm recv
1932or
1933.Cm xmit
1934is used instead of
1935.Cm via ,
1936then only the receive or transmit interface (respectively)
1937is checked.
1938By specifying both, it is possible to match packets based on
1939both receive and transmit interface, e.g.:
1940.Pp
1941.Dl "ipfw add deny ip from any to any out recv ed0 xmit ed1"
1942.Pp
1943The
1944.Cm recv
1945interface can be tested on either incoming or outgoing packets,
1946while the
1947.Cm xmit
1948interface can only be tested on outgoing packets.
1949So
1950.Cm out
1951is required (and
1952.Cm in
1953is invalid) whenever
1954.Cm xmit
1955is used.
1956.Pp
1957A packet might not have a receive or transmit interface: packets
1958originating from the local host have no receive interface,
1959while packets destined for the local host have no transmit
1960interface.
1961.It Cm set-limit Bro Cm src-addr | src-port | dst-addr | dst-port Brc Ar N
1962Works like
1963.Cm limit
1964but does not have an implicit
1965.Cm check-state
1966attached to it.
1967.It Cm setup
1968Matches TCP packets that have the SYN bit set but no ACK bit.
1969This is the short form of
1970.Dq Li tcpflags\ syn,!ack .
1971.It Cm sockarg
1972Matches packets that are associated to a local socket and
1973for which the SO_USER_COOKIE socket option has been set
1974to a non-zero value.
1975As a side effect, the value of the
1976option is made available as
1977.Cm tablearg
1978value, which in turn can be used as
1979.Cm skipto
1980or
1981.Cm pipe
1982number.
1983.It Cm src-ip Ar ip-address
1984Matches IPv4 packets whose source IP is one of the address(es)
1985specified as an argument.
1986.It Cm src-ip6 Ar ip6-address
1987Matches IPv6 packets whose source IP is one of the address(es)
1988specified as an argument.
1989.It Cm src-port Ar ports
1990Matches IP packets whose source port is one of the port(s)
1991specified as argument.
1992.It Cm tagged Ar tag-list
1993Matches packets whose tags are included in
1994.Ar tag-list ,
1995which is either a single value or a list of values or ranges
1996specified in the same way as
1997.Ar ports .
1998Tags can be applied to the packet using
1999.Cm tag
2000rule action parameter (see it's description for details on tags).
2001.It Cm tcpack Ar ack
2002TCP packets only.
2003Match if the TCP header acknowledgment number field is set to
2004.Ar ack .
2005.It Cm tcpdatalen Ar tcpdatalen-list
2006Matches TCP packets whose length of TCP data is
2007.Ar tcpdatalen-list ,
2008which is either a single value or a list of values or ranges
2009specified in the same way as
2010.Ar ports .
2011.It Cm tcpflags Ar spec
2012TCP packets only.
2013Match if the TCP header contains the comma separated list of
2014flags specified in
2015.Ar spec .
2016The supported TCP flags are:
2017.Pp
2018.Cm fin ,
2019.Cm syn ,
2020.Cm rst ,
2021.Cm psh ,
2022.Cm ack
2023and
2024.Cm urg .
2025The absence of a particular flag may be denoted
2026with a
2027.Ql \&! .
2028A rule which contains a
2029.Cm tcpflags
2030specification can never match a fragmented packet which has
2031a non-zero offset.
2032See the
2033.Cm frag
2034option for details on matching fragmented packets.
2035.It Cm tcpmss Ar tcpmss-list
2036Matches TCP packets whose MSS (maximum segment size) value is set to
2037.Ar tcpmss-list ,
2038which is either a single value or a list of values or ranges
2039specified in the same way as
2040.Ar ports .
2041.It Cm tcpseq Ar seq
2042TCP packets only.
2043Match if the TCP header sequence number field is set to
2044.Ar seq .
2045.It Cm tcpwin Ar tcpwin-list
2046Matches TCP packets whose  header window field is set to
2047.Ar tcpwin-list ,
2048which is either a single value or a list of values or ranges
2049specified in the same way as
2050.Ar ports .
2051.It Cm tcpoptions Ar spec
2052TCP packets only.
2053Match if the TCP header contains the comma separated list of
2054options specified in
2055.Ar spec .
2056The supported TCP options are:
2057.Pp
2058.Cm mss
2059(maximum segment size),
2060.Cm window
2061(tcp window advertisement),
2062.Cm sack
2063(selective ack),
2064.Cm ts
2065(rfc1323 timestamp) and
2066.Cm cc
2067(rfc1644 t/tcp connection count).
2068The absence of a particular option may be denoted
2069with a
2070.Ql \&! .
2071.It Cm uid Ar user
2072Match all TCP or UDP packets sent by or received for a
2073.Ar user .
2074A
2075.Ar user
2076may be matched by name or identification number.
2077.It Cm verrevpath
2078For incoming packets,
2079a routing table lookup is done on the packet's source address.
2080If the interface on which the packet entered the system matches the
2081outgoing interface for the route,
2082the packet matches.
2083If the interfaces do not match up,
2084the packet does not match.
2085All outgoing packets or packets with no incoming interface match.
2086.Pp
2087The name and functionality of the option is intentionally similar to
2088the Cisco IOS command:
2089.Pp
2090.Dl ip verify unicast reverse-path
2091.Pp
2092This option can be used to make anti-spoofing rules to reject all
2093packets with source addresses not from this interface.
2094See also the option
2095.Cm antispoof .
2096.It Cm versrcreach
2097For incoming packets,
2098a routing table lookup is done on the packet's source address.
2099If a route to the source address exists, but not the default route
2100or a blackhole/reject route, the packet matches.
2101Otherwise, the packet does not match.
2102All outgoing packets match.
2103.Pp
2104The name and functionality of the option is intentionally similar to
2105the Cisco IOS command:
2106.Pp
2107.Dl ip verify unicast source reachable-via any
2108.Pp
2109This option can be used to make anti-spoofing rules to reject all
2110packets whose source address is unreachable.
2111.It Cm antispoof
2112For incoming packets, the packet's source address is checked if it
2113belongs to a directly connected network.
2114If the network is directly connected, then the interface the packet
2115came on in is compared to the interface the network is connected to.
2116When incoming interface and directly connected interface are not the
2117same, the packet does not match.
2118Otherwise, the packet does match.
2119All outgoing packets match.
2120.Pp
2121This option can be used to make anti-spoofing rules to reject all
2122packets that pretend to be from a directly connected network but do
2123not come in through that interface.
2124This option is similar to but more restricted than
2125.Cm verrevpath
2126because it engages only on packets with source addresses of directly
2127connected networks instead of all source addresses.
2128.El
2129.Sh LOOKUP TABLES
2130Lookup tables are useful to handle large sparse sets of
2131addresses or other search keys (e.g., ports, jail IDs, interface names).
2132In the rest of this section we will use the term ``key''.
2133Table name needs to match the following spec:
2134.Ar table-name .
2135Tables with the same name can be created in different
2136.Ar sets .
2137However, rule links to the tables in
2138.Ar set 0
2139by default.
2140This behavior can be controlled by
2141.Va net.inet.ip.fw.tables_sets
2142variable.
2143See the
2144.Sx SETS OF RULES
2145section for more information.
2146There may be up to 65535 different lookup tables.
2147.Pp
2148The following table types are supported:
2149.Bl -tag -width indent
2150.It Ar table-type : Ar addr | iface | number | flow | mac
2151.It Ar table-key : Ar addr Ns Oo / Ns Ar masklen Oc | iface-name | number | flow-spec
2152.It Ar flow-spec : Ar flow-field Ns Op , Ns Ar flow-spec
2153.It Ar flow-field : src-ip | proto | src-port | dst-ip | dst-port
2154.It Cm addr
2155Matches IPv4 or IPv6 address.
2156Each entry is represented by an
2157.Ar addr Ns Op / Ns Ar masklen
2158and will match all addresses with base
2159.Ar addr
2160(specified as an IPv4/IPv6 address, or a hostname) and mask width of
2161.Ar masklen
2162bits.
2163If
2164.Ar masklen
2165is not specified, it defaults to 32 for IPv4 and 128 for IPv6.
2166When looking up an IP address in a table, the most specific
2167entry will match.
2168.It Cm iface
2169Matches interface names.
2170Each entry is represented by string treated as interface name.
2171Wildcards are not supported.
2172.It Cm number
2173Matches protocol ports, uids/gids or jail IDs.
2174Each entry is represented by 32-bit unsigned integer.
2175Ranges are not supported.
2176.It Cm flow
2177Matches packet fields specified by
2178.Ar flow
2179type suboptions with table entries.
2180.It Cm mac
2181Matches MAC address.
2182Each entry is represented by an
2183.Ar addr Ns Op / Ns Ar masklen
2184and will match all addresses with base
2185.Ar addr
2186and mask width of
2187.Ar masklen
2188bits.
2189If
2190.Ar masklen
2191is not specified, it defaults to 48.
2192When looking up an MAC address in a table, the most specific
2193entry will match.
2194.El
2195.Pp
2196Tables require explicit creation via
2197.Cm create
2198before use.
2199.Pp
2200The following creation options are supported:
2201.Bl -tag -width indent
2202.It Ar create-options : Ar create-option | create-options
2203.It Ar create-option : Cm type Ar table-type | Cm valtype Ar value-mask | Cm algo Ar algo-desc |
2204.Cm limit Ar number | Cm locked | Cm missing | Cm or-flush
2205.It Cm type
2206Table key type.
2207.It Cm valtype
2208Table value mask.
2209.It Cm algo
2210Table algorithm to use (see below).
2211.It Cm limit
2212Maximum number of items that may be inserted into table.
2213.It Cm locked
2214Restrict any table modifications.
2215.It Cm missing
2216Do not fail if table already exists and has exactly same options as new one.
2217.It Cm or-flush
2218Flush existing table with same name instead of returning error.
2219Implies
2220.Cm missing
2221so existing table must be compatible with new one.
2222.El
2223.Pp
2224Some of these options may be modified later via
2225.Cm modify
2226keyword.
2227The following options can be changed:
2228.Bl -tag -width indent
2229.It Ar modify-options : Ar modify-option | modify-options
2230.It Ar modify-option : Cm limit Ar number
2231.It Cm limit
2232Alter maximum number of items that may be inserted into table.
2233.El
2234.Pp
2235Additionally, table can be locked or unlocked using
2236.Cm lock
2237or
2238.Cm unlock
2239commands.
2240.Pp
2241Tables of the same
2242.Ar type
2243can be swapped with each other using
2244.Cm swap Ar name
2245command.
2246Swap may fail if tables limits are set and data exchange
2247would result in limits hit.
2248Operation is performed atomically.
2249.Pp
2250One or more entries can be added to a table at once using
2251.Cm add
2252command.
2253Addition of all items are performed atomically.
2254By default, error in addition of one entry does not influence
2255addition of other entries.
2256However, non-zero error code is returned in that case.
2257Special
2258.Cm atomic
2259keyword may be specified before
2260.Cm add
2261to indicate all-or-none add request.
2262.Pp
2263One or more entries can be removed from a table at once using
2264.Cm delete
2265command.
2266By default, error in removal of one entry does not influence
2267removing of other entries.
2268However, non-zero error code is returned in that case.
2269.Pp
2270It may be possible to check what entry will be found on particular
2271.Ar table-key
2272using
2273.Cm lookup
2274.Ar table-key
2275command.
2276This functionality is optional and may be unsupported in some algorithms.
2277.Pp
2278The following operations can be performed on
2279.Ar one
2280or
2281.Cm all
2282tables:
2283.Bl -tag -width indent
2284.It Cm list
2285List all entries.
2286.It Cm flush
2287Removes all entries.
2288.It Cm info
2289Shows generic table information.
2290.It Cm detail
2291Shows generic table information and algo-specific data.
2292.El
2293.Pp
2294The following lookup algorithms are supported:
2295.Bl -tag -width indent
2296.It Ar algo-desc : algo-name | "algo-name algo-data"
2297.It Ar algo-name : Ar addr: radix | addr: hash | iface: array | number: array | flow: hash | mac: radix
2298.It Cm addr: radix
2299Separate Radix trees for IPv4 and IPv6, the same way as the routing table (see
2300.Xr route 4 ) .
2301Default choice for
2302.Ar addr
2303type.
2304.It Cm addr:hash
2305Separate auto-growing hashes for IPv4 and IPv6.
2306Accepts entries with the same mask length specified initially via
2307.Cm "addr:hash masks=/v4,/v6"
2308algorithm creation options.
2309Assume /32 and /128 masks by default.
2310Search removes host bits (according to mask) from supplied address and checks
2311resulting key in appropriate hash.
2312Mostly optimized for /64 and byte-ranged IPv6 masks.
2313.It Cm iface:array
2314Array storing sorted indexes for entries which are presented in the system.
2315Optimized for very fast lookup.
2316.It Cm number:array
2317Array storing sorted u32 numbers.
2318.It Cm flow:hash
2319Auto-growing hash storing flow entries.
2320Search calculates hash on required packet fields and searches for matching
2321entries in selected bucket.
2322.It Cm mac: radix
2323Radix tree for MAC address
2324.El
2325.Pp
2326The
2327.Cm tablearg
2328feature provides the ability to use a value, looked up in the table, as
2329the argument for a rule action, action parameter or rule option.
2330This can significantly reduce number of rules in some configurations.
2331If two tables are used in a rule, the result of the second (destination)
2332is used.
2333.Pp
2334Each record may hold one or more values according to
2335.Ar value-mask .
2336This mask is set on table creation via
2337.Cm valtype
2338option.
2339The following value types are supported:
2340.Bl -tag -width indent
2341.It Ar value-mask : Ar value-type Ns Op , Ns Ar value-mask
2342.It Ar value-type : Ar skipto | pipe | fib | nat | dscp | tag | divert |
2343.Ar netgraph | limit | ipv4
2344.It Cm skipto
2345rule number to jump to.
2346.It Cm pipe
2347Pipe number to use.
2348.It Cm fib
2349fib number to match/set.
2350.It Cm nat
2351nat number to jump to.
2352.It Cm dscp
2353dscp value to match/set.
2354.It Cm tag
2355tag number to match/set.
2356.It Cm divert
2357port number to divert traffic to.
2358.It Cm netgraph
2359hook number to move packet to.
2360.It Cm limit
2361maximum number of connections.
2362.It Cm ipv4
2363IPv4 nexthop to fwd packets to.
2364.It Cm ipv6
2365IPv6 nexthop to fwd packets to.
2366.El
2367.Pp
2368The
2369.Cm tablearg
2370argument can be used with the following actions:
2371.Cm nat, pipe, queue, divert, tee, netgraph, ngtee, fwd, skipto, setfib ,
2372action parameters:
2373.Cm tag, untag ,
2374rule options:
2375.Cm limit, tagged .
2376.Pp
2377When used with the
2378.Cm skipto
2379action, the user should be aware that the code will walk the ruleset
2380up to a rule equal to, or past, the given number.
2381.Pp
2382See the
2383.Sx EXAMPLES
2384Section for example usage of tables and the tablearg keyword.
2385.Sh SETS OF RULES
2386Each rule or table belongs to one of 32 different
2387.Em sets
2388, numbered 0 to 31.
2389Set 31 is reserved for the default rule.
2390.Pp
2391By default, rules or tables are put in set 0, unless you use the
2392.Cm set N
2393attribute when adding a new rule or table.
2394Sets can be individually and atomically enabled or disabled,
2395so this mechanism permits an easy way to store multiple configurations
2396of the firewall and quickly (and atomically) switch between them.
2397.Pp
2398By default, tables from set 0 are referenced when adding rule with
2399table opcodes regardless of rule set.
2400This behavior can be changed by setting
2401.Va net.inet.ip.fw.tables_sets
2402variable to 1.
2403Rule's set will then be used for table references.
2404.Pp
2405The command to enable/disable sets is
2406.Bd -ragged -offset indent
2407.Nm
2408.Cm set Oo Cm disable Ar number ... Oc Op Cm enable Ar number ...
2409.Ed
2410.Pp
2411where multiple
2412.Cm enable
2413or
2414.Cm disable
2415sections can be specified.
2416Command execution is atomic on all the sets specified in the command.
2417By default, all sets are enabled.
2418.Pp
2419When you disable a set, its rules behave as if they do not exist
2420in the firewall configuration, with only one exception:
2421.Bd -ragged -offset indent
2422dynamic rules created from a rule before it had been disabled
2423will still be active until they expire.
2424In order to delete
2425dynamic rules you have to explicitly delete the parent rule
2426which generated them.
2427.Ed
2428.Pp
2429The set number of rules can be changed with the command
2430.Bd -ragged -offset indent
2431.Nm
2432.Cm set move
2433.Brq Cm rule Ar rule-number | old-set
2434.Cm to Ar new-set
2435.Ed
2436.Pp
2437Also, you can atomically swap two rulesets with the command
2438.Bd -ragged -offset indent
2439.Nm
2440.Cm set swap Ar first-set second-set
2441.Ed
2442.Pp
2443See the
2444.Sx EXAMPLES
2445Section on some possible uses of sets of rules.
2446.Sh STATEFUL FIREWALL
2447Stateful operation is a way for the firewall to dynamically
2448create rules for specific flows when packets that
2449match a given pattern are detected.
2450Support for stateful
2451operation comes through the
2452.Cm check-state , keep-state , record-state , limit
2453and
2454.Cm set-limit
2455options of
2456.Nm rules .
2457.Pp
2458Dynamic rules are created when a packet matches a
2459.Cm keep-state ,
2460.Cm record-state ,
2461.Cm limit
2462or
2463.Cm set-limit
2464rule, causing the creation of a
2465.Em dynamic
2466rule which will match all and only packets with
2467a given
2468.Em protocol
2469between a
2470.Em src-ip/src-port dst-ip/dst-port
2471pair of addresses
2472.Em ( src
2473and
2474.Em dst
2475are used here only to denote the initial match addresses, but they
2476are completely equivalent afterwards).
2477Rules created by
2478.Cm keep-state
2479option also have a
2480.Ar :flowname
2481taken from it.
2482This name is used in matching together with addresses, ports and protocol.
2483Dynamic rules will be checked at the first
2484.Cm check-state, keep-state
2485or
2486.Cm limit
2487occurrence, and the action performed upon a match will be the same
2488as in the parent rule.
2489.Pp
2490Note that no additional attributes other than protocol and IP addresses
2491and ports and :flowname are checked on dynamic rules.
2492.Pp
2493The typical use of dynamic rules is to keep a closed firewall configuration,
2494but let the first TCP SYN packet from the inside network install a
2495dynamic rule for the flow so that packets belonging to that session
2496will be allowed through the firewall:
2497.Pp
2498.Dl "ipfw add check-state :OUTBOUND"
2499.Dl "ipfw add allow tcp from my-subnet to any setup keep-state :OUTBOUND"
2500.Dl "ipfw add deny tcp from any to any"
2501.Pp
2502A similar approach can be used for UDP, where an UDP packet coming
2503from the inside will install a dynamic rule to let the response through
2504the firewall:
2505.Pp
2506.Dl "ipfw add check-state :OUTBOUND"
2507.Dl "ipfw add allow udp from my-subnet to any keep-state :OUTBOUND"
2508.Dl "ipfw add deny udp from any to any"
2509.Pp
2510Dynamic rules expire after some time, which depends on the status
2511of the flow and the setting of some
2512.Cm sysctl
2513variables.
2514See Section
2515.Sx SYSCTL VARIABLES
2516for more details.
2517For TCP sessions, dynamic rules can be instructed to periodically
2518send keepalive packets to refresh the state of the rule when it is
2519about to expire.
2520.Pp
2521See Section
2522.Sx EXAMPLES
2523for more examples on how to use dynamic rules.
2524.Sh TRAFFIC SHAPER (DUMMYNET) CONFIGURATION
2525.Nm
2526is also the user interface for the
2527.Nm dummynet
2528traffic shaper, packet scheduler and network emulator, a subsystem that
2529can artificially queue, delay or drop packets
2530emulating the behaviour of certain network links
2531or queueing systems.
2532.Pp
2533.Nm dummynet
2534operates by first using the firewall to select packets
2535using any match pattern that can be used in
2536.Nm
2537rules.
2538Matching packets are then passed to either of two
2539different objects, which implement the traffic regulation:
2540.Bl -hang -offset XXXX
2541.It Em pipe
2542A
2543.Em pipe
2544emulates a
2545.Em link
2546with given bandwidth and propagation delay,
2547driven by a FIFO scheduler and a single queue with programmable
2548queue size and packet loss rate.
2549Packets are appended to the queue as they come out from
2550.Nm ipfw ,
2551and then transferred in FIFO order to the link at the desired rate.
2552.It Em queue
2553A
2554.Em queue
2555is an abstraction used to implement packet scheduling
2556using one of several packet scheduling algorithms.
2557Packets sent to a
2558.Em queue
2559are first grouped into flows according to a mask on the 5-tuple.
2560Flows are then passed to the scheduler associated to the
2561.Em queue ,
2562and each flow uses scheduling parameters (weight and others)
2563as configured in the
2564.Em queue
2565itself.
2566A scheduler in turn is connected to an emulated link,
2567and arbitrates the link's bandwidth among backlogged flows according to
2568weights and to the features of the scheduling algorithm in use.
2569.El
2570.Pp
2571In practice,
2572.Em pipes
2573can be used to set hard limits to the bandwidth that a flow can use, whereas
2574.Em queues
2575can be used to determine how different flows share the available bandwidth.
2576.Pp
2577A graphical representation of the binding of queues,
2578flows, schedulers and links is below.
2579.Bd -literal -offset indent
2580                 (flow_mask|sched_mask)  sched_mask
2581         +---------+   weight Wx  +-------------+
2582         |         |->-[flow]-->--|             |-+
2583    -->--| QUEUE x |   ...        |             | |
2584         |         |->-[flow]-->--| SCHEDuler N | |
2585         +---------+              |             | |
2586             ...                  |             +--[LINK N]-->--
2587         +---------+   weight Wy  |             | +--[LINK N]-->--
2588         |         |->-[flow]-->--|             | |
2589    -->--| QUEUE y |   ...        |             | |
2590         |         |->-[flow]-->--|             | |
2591         +---------+              +-------------+ |
2592                                    +-------------+
2593.Ed
2594It is important to understand the role of the SCHED_MASK
2595and FLOW_MASK, which are configured through the commands
2596.Dl "ipfw sched N config mask SCHED_MASK ..."
2597and
2598.Dl "ipfw queue X config mask FLOW_MASK ..." .
2599.Pp
2600The SCHED_MASK is used to assign flows to one or more
2601scheduler instances, one for each
2602value of the packet's 5-tuple after applying SCHED_MASK.
2603As an example, using ``src-ip 0xffffff00'' creates one instance
2604for each /24 destination subnet.
2605.Pp
2606The FLOW_MASK, together with the SCHED_MASK, is used to split
2607packets into flows.
2608As an example, using
2609``src-ip 0x000000ff''
2610together with the previous SCHED_MASK makes a flow for
2611each individual source address.
2612In turn, flows for each /24
2613subnet will be sent to the same scheduler instance.
2614.Pp
2615The above diagram holds even for the
2616.Em pipe
2617case, with the only restriction that a
2618.Em pipe
2619only supports a SCHED_MASK, and forces the use of a FIFO
2620scheduler (these are for backward compatibility reasons;
2621in fact, internally, a
2622.Nm dummynet's
2623pipe is implemented exactly as above).
2624.Pp
2625There are two modes of
2626.Nm dummynet
2627operation:
2628.Dq normal
2629and
2630.Dq fast .
2631The
2632.Dq normal
2633mode tries to emulate a real link: the
2634.Nm dummynet
2635scheduler ensures that the packet will not leave the pipe faster than it
2636would on the real link with a given bandwidth.
2637The
2638.Dq fast
2639mode allows certain packets to bypass the
2640.Nm dummynet
2641scheduler (if packet flow does not exceed pipe's bandwidth).
2642This is the reason why the
2643.Dq fast
2644mode requires less CPU cycles per packet (on average) and packet latency
2645can be significantly lower in comparison to a real link with the same
2646bandwidth.
2647The default mode is
2648.Dq normal .
2649The
2650.Dq fast
2651mode can be enabled by setting the
2652.Va net.inet.ip.dummynet.io_fast
2653.Xr sysctl 8
2654variable to a non-zero value.
2655.Ss PIPE, QUEUE AND SCHEDULER CONFIGURATION
2656The
2657.Em pipe ,
2658.Em queue
2659and
2660.Em scheduler
2661configuration commands are the following:
2662.Bd -ragged -offset indent
2663.Cm pipe Ar number Cm config Ar pipe-configuration
2664.Pp
2665.Cm queue Ar number Cm config Ar queue-configuration
2666.Pp
2667.Cm sched Ar number Cm config Ar sched-configuration
2668.Ed
2669.Pp
2670The following parameters can be configured for a pipe:
2671.Pp
2672.Bl -tag -width indent -compact
2673.It Cm bw Ar bandwidth | device
2674Bandwidth, measured in
2675.Sm off
2676.Op Cm K | M | G
2677.Brq Cm bit/s | Byte/s .
2678.Sm on
2679.Pp
2680A value of 0 (default) means unlimited bandwidth.
2681The unit must immediately follow the number, as in
2682.Pp
2683.Dl "dnctl pipe 1 config bw 300Kbit/s"
2684.Pp
2685If a device name is specified instead of a numeric value, as in
2686.Pp
2687.Dl "dnctl pipe 1 config bw tun0"
2688.Pp
2689then the transmit clock is supplied by the specified device.
2690At the moment only the
2691.Xr tun 4
2692device supports this
2693functionality, for use in conjunction with
2694.Xr ppp 8 .
2695.Pp
2696.It Cm delay Ar ms-delay
2697Propagation delay, measured in milliseconds.
2698The value is rounded to the next multiple of the clock tick
2699(typically 10ms, but it is a good practice to run kernels
2700with
2701.Dq "options HZ=1000"
2702to reduce
2703the granularity to 1ms or less).
2704The default value is 0, meaning no delay.
2705.Pp
2706.It Cm burst Ar size
2707If the data to be sent exceeds the pipe's bandwidth limit
2708(and the pipe was previously idle), up to
2709.Ar size
2710bytes of data are allowed to bypass the
2711.Nm dummynet
2712scheduler, and will be sent as fast as the physical link allows.
2713Any additional data will be transmitted at the rate specified
2714by the
2715.Nm pipe
2716bandwidth.
2717The burst size depends on how long the pipe has been idle;
2718the effective burst size is calculated as follows:
2719MAX(
2720.Ar size
2721,
2722.Nm bw
2723* pipe_idle_time).
2724.Pp
2725.It Cm profile Ar filename
2726A file specifying the additional overhead incurred in the transmission
2727of a packet on the link.
2728.Pp
2729Some link types introduce extra delays in the transmission
2730of a packet, e.g., because of MAC level framing, contention on
2731the use of the channel, MAC level retransmissions and so on.
2732From our point of view, the channel is effectively unavailable
2733for this extra time, which is constant or variable depending
2734on the link type.
2735Additionally, packets may be dropped after this
2736time (e.g., on a wireless link after too many retransmissions).
2737We can model the additional delay with an empirical curve
2738that represents its distribution.
2739.Bd -literal -offset indent
2740      cumulative probability
2741      1.0 ^
2742          |
2743      L   +-- loss-level          x
2744          |                 ******
2745          |                *
2746          |           *****
2747          |          *
2748          |        **
2749          |       *
2750          +-------*------------------->
2751                      delay
2752.Ed
2753The empirical curve may have both vertical and horizontal lines.
2754Vertical lines represent constant delay for a range of
2755probabilities.
2756Horizontal lines correspond to a discontinuity in the delay
2757distribution: the pipe will use the largest delay for a
2758given probability.
2759.Pp
2760The file format is the following, with whitespace acting as
2761a separator and '#' indicating the beginning a comment:
2762.Bl -tag -width indent
2763.It Cm name Ar identifier
2764optional name (listed by "dnctl pipe show")
2765to identify the delay distribution;
2766.It Cm bw Ar value
2767the bandwidth used for the pipe.
2768If not specified here, it must be present
2769explicitly as a configuration parameter for the pipe;
2770.It Cm loss-level Ar L
2771the probability above which packets are lost.
2772(0.0 <= L <= 1.0, default 1.0 i.e., no loss);
2773.It Cm samples Ar N
2774the number of samples used in the internal
2775representation of the curve (2..1024; default 100);
2776.It Cm "delay prob" | "prob delay"
2777One of these two lines is mandatory and defines
2778the format of the following lines with data points.
2779.It Ar XXX Ar YYY
27802 or more lines representing points in the curve,
2781with either delay or probability first, according
2782to the chosen format.
2783The unit for delay is milliseconds.
2784Data points do not need to be sorted.
2785Also, the number of actual lines can be different
2786from the value of the "samples" parameter:
2787.Nm
2788utility will sort and interpolate
2789the curve as needed.
2790.El
2791.Pp
2792Example of a profile file:
2793.Bd -literal -offset indent
2794name    bla_bla_bla
2795samples 100
2796loss-level    0.86
2797prob    delay
27980       200	# minimum overhead is 200ms
27990.5     200
28000.5     300
28010.8     1000
28020.9     1300
28031       1300
2804#configuration file end
2805.Ed
2806.El
2807.Pp
2808The following parameters can be configured for a queue:
2809.Pp
2810.Bl -tag -width indent -compact
2811.It Cm pipe Ar pipe_nr
2812Connects a queue to the specified pipe.
2813Multiple queues (with the same or different weights) can be connected to
2814the same pipe, which specifies the aggregate rate for the set of queues.
2815.Pp
2816.It Cm weight Ar weight
2817Specifies the weight to be used for flows matching this queue.
2818The weight must be in the range 1..100, and defaults to 1.
2819.El
2820.Pp
2821The following case-insensitive parameters can be configured for a
2822scheduler:
2823.Pp
2824.Bl -tag -width indent -compact
2825.It Cm type Ar {fifo | wf2q+ | rr | qfq | fq_codel | fq_pie}
2826specifies the scheduling algorithm to use.
2827.Bl -tag -width indent -compact
2828.It Cm fifo
2829is just a FIFO scheduler (which means that all packets
2830are stored in the same queue as they arrive to the scheduler).
2831FIFO has O(1) per-packet time complexity, with very low
2832constants (estimate 60-80ns on a 2GHz desktop machine)
2833but gives no service guarantees.
2834.It Cm wf2q+
2835implements the WF2Q+ algorithm, which is a Weighted Fair Queueing
2836algorithm which permits flows to share bandwidth according to
2837their weights.
2838Note that weights are not priorities; even a flow
2839with a minuscule weight will never starve.
2840WF2Q+ has O(log N) per-packet processing cost, where N is the number
2841of flows, and is the default algorithm used by previous versions
2842dummynet's queues.
2843.It Cm rr
2844implements the Deficit Round Robin algorithm, which has O(1) processing
2845costs (roughly, 100-150ns per packet)
2846and permits bandwidth allocation according to weights, but
2847with poor service guarantees.
2848.It Cm qfq
2849implements the QFQ algorithm, which is a very fast variant of
2850WF2Q+, with similar service guarantees and O(1) processing
2851costs (roughly, 200-250ns per packet).
2852.It Cm fq_codel
2853implements the FQ-CoDel (FlowQueue-CoDel) scheduler/AQM algorithm, which
2854uses a modified Deficit Round Robin scheduler to manage two lists of sub-queues
2855(old sub-queues and new sub-queues) for providing brief periods of priority to
2856lightweight or short burst flows.
2857By default, the total number of sub-queues is 1024.
2858FQ-CoDel's internal, dynamically
2859created sub-queues are controlled by separate instances of CoDel AQM.
2860.It Cm fq_pie
2861implements the FQ-PIE (FlowQueue-PIE) scheduler/AQM algorithm, which similar to
2862.Cm fq_codel
2863but uses per sub-queue PIE AQM instance to control the queue delay.
2864.El
2865.Pp
2866.Cm fq_codel
2867inherits AQM parameters and options from
2868.Cm codel
2869(see below), and
2870.Cm fq_pie
2871inherits AQM parameters and options from
2872.Cm pie
2873(see below).
2874Additionally, both of
2875.Cm fq_codel
2876and
2877.Cm fq_pie
2878have shared scheduler parameters which are:
2879.Bl -tag -width indent
2880.It Cm quantum
2881.Ar m
2882specifies the quantum (credit) of the scheduler.
2883.Ar m
2884is the number of bytes a queue can serve before being moved to the tail
2885of old queues list.
2886The default is 1514 bytes, and the maximum acceptable value
2887is 9000 bytes.
2888.It Cm limit
2889.Ar m
2890specifies the hard size limit (in unit of packets) of all queues managed by an
2891instance of the scheduler.
2892The default value of
2893.Ar m
2894is 10240 packets, and the maximum acceptable value is 20480 packets.
2895.It Cm flows
2896.Ar m
2897specifies the total number of flow queues (sub-queues) that fq_*
2898creates and manages.
2899By default, 1024 sub-queues are created when an instance
2900of the fq_{codel/pie} scheduler is created.
2901The maximum acceptable value is
290265536.
2903.El
2904.Pp
2905Note that any token after
2906.Cm fq_codel
2907or
2908.Cm fq_pie
2909is considered a parameter for fq_{codel/pie}.
2910So, ensure all scheduler
2911configuration options not related to fq_{codel/pie} are written before
2912.Cm fq_codel/fq_pie
2913tokens.
2914.El
2915.Pp
2916In addition to the type, all parameters allowed for a pipe can also
2917be specified for a scheduler.
2918.Pp
2919Finally, the following parameters can be configured for both
2920pipes and queues:
2921.Pp
2922.Bl -tag -width XXXX -compact
2923.It Cm buckets Ar hash-table-size
2924Specifies the size of the hash table used for storing the
2925various queues.
2926Default value is 64 controlled by the
2927.Xr sysctl 8
2928variable
2929.Va net.inet.ip.dummynet.hash_size ,
2930allowed range is 16 to 65536.
2931.Pp
2932.It Cm mask Ar mask-specifier
2933Packets sent to a given pipe or queue by an
2934.Nm
2935rule can be further classified into multiple flows, each of which is then
2936sent to a different
2937.Em dynamic
2938pipe or queue.
2939A flow identifier is constructed by masking the IP addresses,
2940ports and protocol types as specified with the
2941.Cm mask
2942options in the configuration of the pipe or queue.
2943For each different flow identifier, a new pipe or queue is created
2944with the same parameters as the original object, and matching packets
2945are sent to it.
2946.Pp
2947Thus, when
2948.Em dynamic pipes
2949are used, each flow will get the same bandwidth as defined by the pipe,
2950whereas when
2951.Em dynamic queues
2952are used, each flow will share the parent's pipe bandwidth evenly
2953with other flows generated by the same queue (note that other queues
2954with different weights might be connected to the same pipe).
2955.br
2956Available mask specifiers are a combination of one or more of the following:
2957.Pp
2958.Cm dst-ip Ar mask ,
2959.Cm dst-ip6 Ar mask ,
2960.Cm src-ip Ar mask ,
2961.Cm src-ip6 Ar mask ,
2962.Cm dst-port Ar mask ,
2963.Cm src-port Ar mask ,
2964.Cm flow-id Ar mask ,
2965.Cm proto Ar mask
2966or
2967.Cm all ,
2968.Pp
2969where the latter means all bits in all fields are significant.
2970.Pp
2971.It Cm noerror
2972When a packet is dropped by a
2973.Nm dummynet
2974queue or pipe, the error
2975is normally reported to the caller routine in the kernel, in the
2976same way as it happens when a device queue fills up.
2977Setting this
2978option reports the packet as successfully delivered, which can be
2979needed for some experimental setups where you want to simulate
2980loss or congestion at a remote router.
2981.Pp
2982.It Cm plr Ar packet-loss-rate
2983Packet loss rate.
2984Argument
2985.Ar packet-loss-rate
2986is a floating-point number between 0 and 1, with 0 meaning no
2987loss, 1 meaning 100% loss.
2988The loss rate is internally represented on 31 bits.
2989.Pp
2990.It Cm queue Brq Ar slots | size Ns Cm Kbytes
2991Queue size, in
2992.Ar slots
2993or
2994.Cm KBytes .
2995Default value is 50 slots, which
2996is the typical queue size for Ethernet devices.
2997Note that for slow speed links you should keep the queue
2998size short or your traffic might be affected by a significant
2999queueing delay.
3000E.g., 50 max-sized Ethernet packets (1500 bytes) mean 600Kbit
3001or 20s of queue on a 30Kbit/s pipe.
3002Even worse effects can result if you get packets from an
3003interface with a much larger MTU, e.g.\& the loopback interface
3004with its 16KB packets.
3005The
3006.Xr sysctl 8
3007variables
3008.Em net.inet.ip.dummynet.pipe_byte_limit
3009and
3010.Em net.inet.ip.dummynet.pipe_slot_limit
3011control the maximum lengths that can be specified.
3012.Pp
3013.It Cm red | gred Ar w_q Ns / Ns Ar min_th Ns / Ns Ar max_th Ns / Ns Ar max_p
3014[ecn]
3015Make use of the RED (Random Early Detection) queue management algorithm.
3016.Ar w_q
3017and
3018.Ar max_p
3019are floating
3020point numbers between 0 and 1 (inclusive), while
3021.Ar min_th
3022and
3023.Ar max_th
3024are integer numbers specifying thresholds for queue management
3025(thresholds are computed in bytes if the queue has been defined
3026in bytes, in slots otherwise).
3027The two parameters can also be of the same value if needed.
3028The
3029.Nm dummynet
3030also supports the gentle RED variant (gred) and ECN (Explicit Congestion
3031Notification) as optional.
3032Three
3033.Xr sysctl 8
3034variables can be used to control the RED behaviour:
3035.Bl -tag -width indent
3036.It Va net.inet.ip.dummynet.red_lookup_depth
3037specifies the accuracy in computing the average queue
3038when the link is idle (defaults to 256, must be greater than zero)
3039.It Va net.inet.ip.dummynet.red_avg_pkt_size
3040specifies the expected average packet size (defaults to 512, must be
3041greater than zero)
3042.It Va net.inet.ip.dummynet.red_max_pkt_size
3043specifies the expected maximum packet size, only used when queue
3044thresholds are in bytes (defaults to 1500, must be greater than zero).
3045.El
3046.Pp
3047.It Cm codel Oo Cm target Ar time Oc Oo Cm interval Ar time Oc Oo Cm ecn |
3048.Cm noecn Oc
3049Make use of the CoDel (Controlled-Delay) queue management algorithm.
3050.Ar time
3051is interpreted as milliseconds by default but seconds (s), milliseconds (ms) or
3052microseconds (us) can be specified instead.
3053CoDel drops or marks (ECN) packets
3054depending on packet sojourn time in the queue.
3055.Cm target
3056.Ar time
3057(5ms by default) is the minimum acceptable persistent queue delay that CoDel
3058allows.
3059CoDel does not drop packets directly after packets sojourn time becomes
3060higher than
3061.Cm target
3062.Ar time
3063but waits for
3064.Cm interval
3065.Ar time
3066(100ms default) before dropping.
3067.Cm interval
3068.Ar time
3069should be set to maximum RTT for all expected connections.
3070.Cm ecn
3071enables (disabled by default) packet marking (instead of dropping) for
3072ECN-enabled TCP flows when queue delay becomes high.
3073.Pp
3074Note that any token after
3075.Cm codel
3076is considered a parameter for CoDel.
3077So, ensure all pipe/queue
3078configuration options are written before
3079.Cm codel
3080token.
3081.Pp
3082The
3083.Xr sysctl 8
3084variables
3085.Va net.inet.ip.dummynet.codel.target
3086and
3087.Va net.inet.ip.dummynet.codel.interval
3088can be used to set CoDel default parameters.
3089.Pp
3090.It Cm pie Oo Cm target Ar time Oc Oo Cm tupdate Ar time Oc Oo
3091.Cm alpha Ar n Oc Oo Cm beta Ar n Oc Oo Cm max_burst Ar time Oc Oo
3092.Cm max_ecnth Ar n Oc Oo Cm ecn | Cm noecn Oc Oo Cm capdrop |
3093.Cm nocapdrop Oc Oo Cm drand | Cm nodrand Oc Oo Cm onoff
3094.Oc Oo Cm dre | Cm ts Oc
3095Make use of the PIE (Proportional Integral controller Enhanced) queue management
3096algorithm.
3097PIE drops or marks packets depending on a calculated drop probability during
3098en-queue process, with the aim of achieving high throughput while keeping queue
3099delay low.
3100At regular time intervals of
3101.Cm tupdate
3102.Ar time
3103(15ms by default) a background process (re)calculates the probability based on queue delay
3104deviations from
3105.Cm target
3106.Ar time
3107(15ms by default) and queue delay trends.
3108PIE approximates current queue
3109delay by using a departure rate estimation method, or (optionally) by using a
3110packet timestamp method similar to CoDel.
3111.Ar time
3112is interpreted as milliseconds by default but seconds (s), milliseconds (ms) or
3113microseconds (us) can be specified instead.
3114The other PIE parameters and options are as follows:
3115.Bl -tag -width indent
3116.It Cm alpha Ar n
3117.Ar n
3118is a floating point number between 0 and 7 which specifies the weight of queue
3119delay deviations that is used in drop probability calculation.
31200.125 is the default.
3121.It Cm beta Ar n
3122.Ar n
3123is a floating point number between 0 and 7 which specifies is the weight of queue
3124delay trend that is used in drop probability calculation.
31251.25 is the default.
3126.It Cm max_burst Ar time
3127The maximum period of time that PIE does not drop/mark packets.
3128150ms is the
3129default and 10s is the maximum value.
3130.It Cm max_ecnth Ar n
3131Even when ECN is enabled, PIE drops packets instead of marking them when drop
3132probability becomes higher than ECN probability threshold
3133.Cm max_ecnth Ar n
3134, the default is 0.1 (i.e 10%) and 1 is the maximum value.
3135.It Cm ecn | noecn
3136enable or disable ECN marking for ECN-enabled TCP flows.
3137Disabled by default.
3138.It Cm capdrop | nocapdrop
3139enable or disable cap drop adjustment.
3140Cap drop adjustment is enabled by default.
3141.It Cm drand | nodrand
3142enable or disable drop probability de-randomisation.
3143De-randomisation eliminates
3144the problem of dropping packets too close or too far.
3145De-randomisation is enabled by default.
3146.It Cm onoff
3147enable turning PIE on and off depending on queue load.
3148If this option is enabled,
3149PIE turns on when over 1/3 of queue becomes full.
3150This option is disabled by
3151default.
3152.It Cm dre | ts
3153Calculate queue delay using departure rate estimation
3154.Cm dre
3155or timestamps
3156.Cm ts .
3157.Cm dre
3158is used by default.
3159.El
3160.Pp
3161Note that any token after
3162.Cm pie
3163is considered a parameter for PIE.
3164So ensure all pipe/queue
3165the configuration options are written before
3166.Cm pie
3167token.
3168.Xr sysctl 8
3169variables can be used to control the
3170.Cm pie
3171default parameters.
3172See the
3173.Sx SYSCTL VARIABLES
3174section for more details.
3175.El
3176.Pp
3177When used with IPv6 data,
3178.Nm dummynet
3179currently has several limitations.
3180Information necessary to route link-local packets to an
3181interface is not available after processing by
3182.Nm dummynet
3183so those packets are dropped in the output path.
3184Care should be taken to ensure that link-local packets are not passed to
3185.Nm dummynet .
3186.Sh CHECKLIST
3187Here are some important points to consider when designing your
3188rules:
3189.Bl -bullet
3190.It
3191Remember that you filter both packets going
3192.Cm in
3193and
3194.Cm out .
3195Most connections need packets going in both directions.
3196.It
3197Remember to test very carefully.
3198It is a good idea to be near the console when doing this.
3199If you cannot be near the console,
3200use an auto-recovery script such as the one in
3201.Pa /usr/share/examples/ipfw/change_rules.sh .
3202.It
3203Do not forget the loopback interface.
3204.El
3205.Sh FINE POINTS
3206.Bl -bullet
3207.It
3208There are circumstances where fragmented datagrams are unconditionally
3209dropped.
3210TCP packets are dropped if they do not contain at least 20 bytes of
3211TCP header, UDP packets are dropped if they do not contain a full 8
3212byte UDP header, and ICMP packets are dropped if they do not contain
32134 bytes of ICMP header, enough to specify the ICMP type, code, and
3214checksum.
3215These packets are simply logged as
3216.Dq pullup failed
3217since there may not be enough good data in the packet to produce a
3218meaningful log entry.
3219.It
3220Another type of packet is unconditionally dropped, a TCP packet with a
3221fragment offset of one.
3222This is a valid packet, but it only has one use, to try
3223to circumvent firewalls.
3224When logging is enabled, these packets are
3225reported as being dropped by rule -1.
3226.It
3227If you are logged in over a network, loading the
3228.Xr kld 4
3229version of
3230.Nm
3231is probably not as straightforward as you would think.
3232The following command line is recommended:
3233.Bd -literal -offset indent
3234kldload ipfw && \e
3235ipfw add 32000 allow ip from any to any
3236.Ed
3237.Pp
3238Along the same lines, doing an
3239.Bd -literal -offset indent
3240ipfw flush
3241.Ed
3242.Pp
3243in similar surroundings is also a bad idea.
3244.It
3245The
3246.Nm
3247filter list may not be modified if the system security level
3248is set to 3 or higher
3249(see
3250.Xr init 8
3251for information on system security levels).
3252.El
3253.Sh PACKET DIVERSION
3254A
3255.Xr divert 4
3256socket bound to the specified port will receive all packets
3257diverted to that port.
3258If no socket is bound to the destination port, or if the divert module is
3259not loaded, or if the kernel was not compiled with divert socket support,
3260the packets are dropped.
3261.Sh NETWORK ADDRESS TRANSLATION (NAT)
3262.Nm
3263support in-kernel NAT using the kernel version of
3264.Xr libalias 3 .
3265The kernel module
3266.Cm ipfw_nat
3267should be loaded or kernel should have
3268.Cm options IPFIREWALL_NAT
3269to be able use NAT.
3270.Pp
3271The nat configuration command is the following:
3272.Bd -ragged -offset indent
3273.Bk -words
3274.Cm nat
3275.Ar nat_number
3276.Cm config
3277.Ar nat-configuration
3278.Ek
3279.Ed
3280.Pp
3281The following parameters can be configured:
3282.Bl -tag -width indent
3283.It Cm ip Ar ip_address
3284Define an ip address to use for aliasing.
3285.It Cm if Ar nic
3286Use ip address of NIC for aliasing, dynamically changing
3287it if NIC's ip address changes.
3288.It Cm log
3289Enable logging on this nat instance.
3290.It Cm deny_in
3291Deny any incoming connection from outside world.
3292.It Cm same_ports
3293Try to leave the alias port numbers unchanged from
3294the actual local port numbers.
3295.It Cm unreg_only
3296Traffic on the local network not originating from a RFC 1918
3297unregistered address spaces will be ignored.
3298.It Cm unreg_cgn
3299Like unreg_only, but includes the RFC 6598 (Carrier Grade NAT)
3300address range.
3301.It Cm reset
3302Reset table of the packet aliasing engine on address change.
3303.It Cm reverse
3304Reverse the way libalias handles aliasing.
3305.It Cm proxy_only
3306Obey transparent proxy rules only, packet aliasing is not performed.
3307.It Cm skip_global
3308Skip instance in case of global state lookup (see below).
3309.It Cm port_range Ar lower-upper
3310Set the aliasing ports between the ranges given. Upper port has to be greater
3311than lower.
3312.El
3313.Pp
3314Some special values can be supplied instead of
3315.Va nat_number
3316in nat rule actions:
3317.Bl -tag -width indent
3318.It Cm global
3319Looks up translation state in all configured nat instances.
3320If an entry is found, packet is aliased according to that entry.
3321If no entry was found in any of the instances, packet is passed unchanged,
3322and no new entry will be created.
3323See section
3324.Sx MULTIPLE INSTANCES
3325in
3326.Xr natd 8
3327for more information.
3328.It Cm tablearg
3329Uses argument supplied in lookup table.
3330See
3331.Sx LOOKUP TABLES
3332section below for more information on lookup tables.
3333.El
3334.Pp
3335To let the packet continue after being (de)aliased, set the sysctl variable
3336.Va net.inet.ip.fw.one_pass
3337to 0.
3338For more information about aliasing modes, refer to
3339.Xr libalias 3 .
3340See Section
3341.Sx EXAMPLES
3342for some examples of nat usage.
3343.Ss REDIRECT AND LSNAT SUPPORT IN IPFW
3344Redirect and LSNAT support follow closely the syntax used in
3345.Xr natd 8 .
3346See Section
3347.Sx EXAMPLES
3348for some examples on how to do redirect and lsnat.
3349.Ss SCTP NAT SUPPORT
3350SCTP nat can be configured in a similar manner to TCP through the
3351.Nm
3352command line tool.
3353The main difference is that
3354.Nm sctp nat
3355does not do port translation.
3356Since the local and global side ports will be the same,
3357there is no need to specify both.
3358Ports are redirected as follows:
3359.Bd -ragged -offset indent
3360.Bk -words
3361.Cm nat
3362.Ar nat_number
3363.Cm config if
3364.Ar nic
3365.Cm redirect_port sctp
3366.Ar ip_address [,addr_list] {[port | port-port] [,ports]}
3367.Ek
3368.Ed
3369.Pp
3370Most
3371.Nm sctp nat
3372configuration can be done in real-time through the
3373.Xr sysctl 8
3374interface.
3375All may be changed dynamically, though the hash_table size will only
3376change for new
3377.Nm nat
3378instances.
3379See
3380.Sx SYSCTL VARIABLES
3381for more info.
3382.Sh IPv6/IPv4 NETWORK ADDRESS AND PROTOCOL TRANSLATION
3383.Ss Stateful translation
3384.Nm
3385supports in-kernel IPv6/IPv4 network address and protocol translation.
3386Stateful NAT64 translation allows IPv6-only clients to contact IPv4 servers
3387using unicast TCP, UDP or ICMP protocols.
3388One or more IPv4 addresses assigned to a stateful NAT64 translator are shared
3389among several IPv6-only clients.
3390When stateful NAT64 is used in conjunction with DNS64, no changes are usually
3391required in the IPv6 client or the IPv4 server.
3392The kernel module
3393.Cm ipfw_nat64
3394should be loaded or kernel should have
3395.Cm options IPFIREWALL_NAT64
3396to be able use stateful NAT64 translator.
3397.Pp
3398Stateful NAT64 uses a bunch of memory for several types of objects.
3399When IPv6 client initiates connection, NAT64 translator creates a host entry
3400in the states table.
3401Each host entry uses preallocated IPv4 alias entry.
3402Each alias entry has a number of ports group entries allocated on demand.
3403Ports group entries contains connection state entries.
3404There are several options to control limits and lifetime for these objects.
3405.Pp
3406NAT64 translator follows RFC7915 when does ICMPv6/ICMP translation,
3407unsupported message types will be silently dropped.
3408IPv6 needs several ICMPv6 message types to be explicitly allowed for correct
3409operation.
3410Make sure that ND6 neighbor solicitation (ICMPv6 type 135) and neighbor
3411advertisement (ICMPv6 type 136) messages will not be handled by translation
3412rules.
3413.Pp
3414After translation NAT64 translator by default sends packets through
3415corresponding netisr queue.
3416Thus translator host should be configured as IPv4 and IPv6 router.
3417Also this means, that a packet is handled by firewall twice.
3418First time an original packet is handled and consumed by translator,
3419and then it is handled again as translated packet.
3420This behavior can be changed by sysctl variable
3421.Va net.inet.ip.fw.nat64_direct_output .
3422Also translated packet can be tagged using
3423.Cm tag
3424rule action, and then matched by
3425.Cm tagged
3426opcode to avoid loops and extra overhead.
3427.Pp
3428The stateful NAT64 configuration command is the following:
3429.Bd -ragged -offset indent
3430.Bk -words
3431.Cm nat64lsn
3432.Ar name
3433.Cm create
3434.Ar create-options
3435.Ek
3436.Ed
3437.Pp
3438The following parameters can be configured:
3439.Bl -tag -width indent
3440.It Cm prefix4 Ar ipv4_prefix/plen
3441The IPv4 prefix with mask defines the pool of IPv4 addresses used as
3442source address after translation.
3443Stateful NAT64 module translates IPv6 source address of client to one
3444IPv4 address from this pool.
3445Note that incoming IPv4 packets that don't have corresponding state entry
3446in the states table will be dropped by translator.
3447Make sure that translation rules handle packets, destined to configured prefix.
3448.It Cm prefix6 Ar ipv6_prefix/length
3449The IPv6 prefix defines IPv4-embedded IPv6 addresses used by translator
3450to represent IPv4 addresses.
3451This IPv6 prefix should be configured in DNS64.
3452The translator implementation follows RFC6052, that restricts the length of
3453prefixes to one of following: 32, 40, 48, 56, 64, or 96.
3454The Well-Known IPv6 Prefix 64:ff9b:: must be 96 bits long.
3455The special
3456.Ar ::/length
3457prefix can be used to handle several IPv6 prefixes with one NAT64 instance.
3458The NAT64 instance will determine a destination IPv4 address from prefix
3459.Ar length .
3460.It Cm states_chunks Ar number
3461The number of states chunks in single ports group.
3462Each ports group by default can keep 64 state entries in single chunk.
3463The above value affects the maximum number of states that can be associated with single IPv4 alias address and port.
3464The value must be power of 2, and up to 128.
3465.It Cm host_del_age Ar seconds
3466The number of seconds until the host entry for a IPv6 client will be deleted
3467and all its resources will be released due to inactivity.
3468Default value is
3469.Ar 3600 .
3470.It Cm pg_del_age Ar seconds
3471The number of seconds until a ports group with unused state entries will
3472be released.
3473Default value is
3474.Ar 900 .
3475.It Cm tcp_syn_age Ar seconds
3476The number of seconds while a state entry for TCP connection with only SYN
3477sent will be kept.
3478If TCP connection establishing will not be finished,
3479state entry will be deleted.
3480Default value is
3481.Ar 10 .
3482.It Cm tcp_est_age Ar seconds
3483The number of seconds while a state entry for established TCP connection
3484will be kept.
3485Default value is
3486.Ar 7200 .
3487.It Cm tcp_close_age Ar seconds
3488The number of seconds while a state entry for closed TCP connection
3489will be kept.
3490Keeping state entries for closed connections is needed, because IPv4 servers
3491typically keep closed connections in a TIME_WAIT state for a several minutes.
3492Since translator's IPv4 addresses are shared among all IPv6 clients,
3493new connections from the same addresses and ports may be rejected by server,
3494because these connections are still in a TIME_WAIT state.
3495Keeping them in translator's state table protects from such rejects.
3496Default value is
3497.Ar 180 .
3498.It Cm udp_age Ar seconds
3499The number of seconds while translator keeps state entry in a waiting for
3500reply to the sent UDP datagram.
3501Default value is
3502.Ar 120 .
3503.It Cm icmp_age Ar seconds
3504The number of seconds while translator keeps state entry in a waiting for
3505reply to the sent ICMP message.
3506Default value is
3507.Ar 60 .
3508.It Cm log
3509Turn on logging of all handled packets via BPF through
3510.Ar ipfwlog0
3511interface.
3512.Ar ipfwlog0
3513is a pseudo interface and can be created after a boot manually with
3514.Cm ifconfig
3515command.
3516Note that it has different purpose than
3517.Ar ipfw0
3518interface.
3519Translators sends to BPF an additional information with each packet.
3520With
3521.Cm tcpdump
3522you are able to see each handled packet before and after translation.
3523.It Cm -log
3524Turn off logging of all handled packets via BPF.
3525.It Cm allow_private
3526Turn on processing private IPv4 addresses.
3527By default IPv6 packets with destinations mapped to private address ranges
3528defined by RFC1918 are not processed.
3529.It Cm -allow_private
3530Turn off private address handling in
3531.Nm nat64
3532instance.
3533.El
3534.Pp
3535To inspect a states table of stateful NAT64 the following command can be used:
3536.Bd -ragged -offset indent
3537.Bk -words
3538.Cm nat64lsn
3539.Ar name
3540.Cm show Cm states
3541.Ek
3542.Ed
3543.Pp
3544Stateless NAT64 translator doesn't use a states table for translation
3545and converts IPv4 addresses to IPv6 and vice versa solely based on the
3546mappings taken from configured lookup tables.
3547Since a states table doesn't used by stateless translator,
3548it can be configured to pass IPv4 clients to IPv6-only servers.
3549.Pp
3550The stateless NAT64 configuration command is the following:
3551.Bd -ragged -offset indent
3552.Bk -words
3553.Cm nat64stl
3554.Ar name
3555.Cm create
3556.Ar create-options
3557.Ek
3558.Ed
3559.Pp
3560The following parameters can be configured:
3561.Bl -tag -width indent
3562.It Cm prefix6 Ar ipv6_prefix/length
3563The IPv6 prefix defines IPv4-embedded IPv6 addresses used by translator
3564to represent IPv4 addresses.
3565This IPv6 prefix should be configured in DNS64.
3566.It Cm table4 Ar table46
3567The lookup table
3568.Ar table46
3569contains mapping how IPv4 addresses should be translated to IPv6 addresses.
3570.It Cm table6 Ar table64
3571The lookup table
3572.Ar table64
3573contains mapping how IPv6 addresses should be translated to IPv4 addresses.
3574.It Cm log
3575Turn on logging of all handled packets via BPF through
3576.Ar ipfwlog0
3577interface.
3578.It Cm -log
3579Turn off logging of all handled packets via BPF.
3580.It Cm allow_private
3581Turn on processing private IPv4 addresses.
3582By default IPv6 packets with destinations mapped to private address ranges
3583defined by RFC1918 are not processed.
3584.It Cm -allow_private
3585Turn off private address handling in
3586.Nm nat64
3587instance.
3588.El
3589.Pp
3590Note that the behavior of stateless translator with respect to not matched
3591packets differs from stateful translator.
3592If corresponding addresses was not found in the lookup tables, the packet
3593will not be dropped and the search continues.
3594.Ss XLAT464 CLAT translation
3595XLAT464 CLAT NAT64 translator implements client-side stateless translation as
3596defined in RFC6877 and is very similar to statless NAT64 translator
3597explained above.
3598Instead of lookup tables it uses one-to-one mapping between IPv4 and IPv6
3599addresses using configured prefixes.
3600This mode can be used as a replacement of DNS64 service for applications
3601that are not using it (e.g. VoIP) allowing them to access IPv4-only Internet
3602over IPv6-only networks with help of remote NAT64 translator.
3603.Pp
3604The CLAT NAT64 configuration command is the following:
3605.Bd -ragged -offset indent
3606.Bk -words
3607.Cm nat64clat
3608.Ar name
3609.Cm create
3610.Ar create-options
3611.Ek
3612.Ed
3613.Pp
3614The following parameters can be configured:
3615.Bl -tag -width indent
3616.It Cm clat_prefix Ar ipv6_prefix/length
3617The IPv6 prefix defines IPv4-embedded IPv6 addresses used by translator
3618to represent source IPv4 addresses.
3619.It Cm plat_prefix Ar ipv6_prefix/length
3620The IPv6 prefix defines IPv4-embedded IPv6 addresses used by translator
3621to represent destination IPv4 addresses.
3622This IPv6 prefix should be configured on a remote NAT64 translator.
3623.It Cm log
3624Turn on logging of all handled packets via BPF through
3625.Ar ipfwlog0
3626interface.
3627.It Cm -log
3628Turn off logging of all handled packets via BPF.
3629.It Cm allow_private
3630Turn on processing private IPv4 addresses.
3631By default
3632.Nm nat64clat
3633instance will not process IPv4 packets with destination address from private
3634ranges as defined in RFC1918.
3635.It Cm -allow_private
3636Turn off private address handling in
3637.Nm nat64clat
3638instance.
3639.El
3640.Pp
3641Note that the behavior of CLAT translator with respect to not matched
3642packets differs from stateful translator.
3643If corresponding addresses were not matched against prefixes configured,
3644the packet will not be dropped and the search continues.
3645.Sh IPv6-to-IPv6 NETWORK PREFIX TRANSLATION (NPTv6)
3646.Nm
3647supports in-kernel IPv6-to-IPv6 network prefix translation as described
3648in RFC6296.
3649The kernel module
3650.Cm ipfw_nptv6
3651should be loaded or kernel should has
3652.Cm options IPFIREWALL_NPTV6
3653to be able use NPTv6 translator.
3654.Pp
3655The NPTv6 configuration command is the following:
3656.Bd -ragged -offset indent
3657.Bk -words
3658.Cm nptv6
3659.Ar name
3660.Cm create
3661.Ar create-options
3662.Ek
3663.Ed
3664.Pp
3665The following parameters can be configured:
3666.Bl -tag -width indent
3667.It Cm int_prefix Ar ipv6_prefix
3668IPv6 prefix used in internal network.
3669NPTv6 module translates source address when it matches this prefix.
3670.It Cm ext_prefix Ar ipv6_prefix
3671IPv6 prefix used in external network.
3672NPTv6 module translates destination address when it matches this prefix.
3673.It Cm ext_if Ar nic
3674The NPTv6 module will use first global IPv6 address from interface
3675.Ar nic
3676as external prefix.
3677It can be useful when IPv6 prefix of external network is dynamically obtained.
3678.Cm ext_prefix
3679and
3680.Cm ext_if
3681options are mutually exclusive.
3682.It Cm prefixlen Ar length
3683The length of specified IPv6 prefixes.
3684It must be in range from 8 to 64.
3685.El
3686.Pp
3687Note that the prefix translation rules are silently ignored when IPv6 packet
3688forwarding is disabled.
3689To enable the packet forwarding, set the sysctl variable
3690.Va net.inet6.ip6.forwarding
3691to 1.
3692.Pp
3693To let the packet continue after being translated, set the sysctl variable
3694.Va net.inet.ip.fw.one_pass
3695to 0.
3696.Sh LOADER TUNABLES
3697Tunables can be set in
3698.Xr loader 8
3699prompt,
3700.Xr loader.conf 5
3701or
3702.Xr kenv 1
3703before ipfw module gets loaded.
3704.Bl -tag -width indent
3705.It Va net.inet.ip.fw.default_to_accept : No 0
3706Defines ipfw last rule behavior.
3707This value overrides
3708.Cd "options IPFW_DEFAULT_TO_(ACCEPT|DENY)"
3709from kernel configuration file.
3710.It Va net.inet.ip.fw.tables_max : No 128
3711Defines number of tables available in ipfw.
3712Number cannot exceed 65534.
3713.El
3714.Sh SYSCTL VARIABLES
3715A set of
3716.Xr sysctl 8
3717variables controls the behaviour of the firewall and
3718associated modules
3719.Pq Nm dummynet , bridge , sctp nat .
3720These are shown below together with their default value
3721(but always check with the
3722.Xr sysctl 8
3723command what value is actually in use) and meaning:
3724.Bl -tag -width indent
3725.It Va net.inet.ip.alias.sctp.accept_global_ootb_addip : No 0
3726Defines how the
3727.Nm nat
3728responds to receipt of global OOTB ASCONF-AddIP:
3729.Bl -tag -width indent
3730.It Cm 0
3731No response (unless a partially matching association exists -
3732ports and vtags match but global address does not)
3733.It Cm 1
3734.Nm nat
3735will accept and process all OOTB global AddIP messages.
3736.El
3737.Pp
3738Option 1 should never be selected as this forms a security risk.
3739An attacker can
3740establish multiple fake associations by sending AddIP messages.
3741.It Va net.inet.ip.alias.sctp.chunk_proc_limit : No 5
3742Defines the maximum number of chunks in an SCTP packet that will be
3743parsed for a
3744packet that matches an existing association.
3745This value is enforced to be greater or equal than
3746.Cm net.inet.ip.alias.sctp.initialising_chunk_proc_limit .
3747A high value is
3748a DoS risk yet setting too low a value may result in
3749important control chunks in
3750the packet not being located and parsed.
3751.It Va net.inet.ip.alias.sctp.error_on_ootb : No 1
3752Defines when the
3753.Nm nat
3754responds to any Out-of-the-Blue (OOTB) packets with ErrorM packets.
3755An OOTB packet is a packet that arrives with no existing association
3756registered in the
3757.Nm nat
3758and is not an INIT or ASCONF-AddIP packet:
3759.Bl -tag -width indent
3760.It Cm 0
3761ErrorM is never sent in response to OOTB packets.
3762.It Cm 1
3763ErrorM is only sent to OOTB packets received on the local side.
3764.It Cm 2
3765ErrorM is sent to the local side and on the global side ONLY if there is a
3766partial match (ports and vtags match but the source global IP does not).
3767This value is only useful if the
3768.Nm nat
3769is tracking global IP addresses.
3770.It Cm 3
3771ErrorM is sent in response to all OOTB packets on both
3772the local and global side
3773(DoS risk).
3774.El
3775.Pp
3776At the moment the default is 0, since the ErrorM packet is not yet
3777supported by most SCTP stacks.
3778When it is supported, and if not tracking
3779global addresses, we recommend setting this value to 1 to allow
3780multi-homed local hosts to function with the
3781.Nm nat .
3782To track global addresses, we recommend setting this value to 2 to
3783allow global hosts to be informed when they need to (re)send an
3784ASCONF-AddIP.
3785Value 3 should never be chosen (except for debugging) as the
3786.Nm nat
3787will respond to all OOTB global packets (a DoS risk).
3788.It Va net.inet.ip.alias.sctp.hashtable_size : No 2003
3789Size of hash tables used for
3790.Nm nat
3791lookups (100 < prime_number > 1000001).
3792This value sets the
3793.Nm hash table
3794size for any future created
3795.Nm nat
3796instance and therefore must be set prior to creating a
3797.Nm nat
3798instance.
3799The table sizes may be changed to suit specific needs.
3800If there will be few
3801concurrent associations, and memory is scarce, you may make these smaller.
3802If there will be many thousands (or millions) of concurrent associations, you
3803should make these larger.
3804A prime number is best for the table size.
3805The sysctl
3806update function will adjust your input value to the next highest prime number.
3807.It Va net.inet.ip.alias.sctp.holddown_time : No 0
3808Hold association in table for this many seconds after receiving a
3809SHUTDOWN-COMPLETE.
3810This allows endpoints to correct shutdown gracefully if a
3811shutdown_complete is lost and retransmissions are required.
3812.It Va net.inet.ip.alias.sctp.init_timer : No 15
3813Timeout value while waiting for (INIT-ACK|AddIP-ACK).
3814This value cannot be 0.
3815.It Va net.inet.ip.alias.sctp.initialising_chunk_proc_limit : No 2
3816Defines the maximum number of chunks in an SCTP packet that will be parsed when
3817no existing association exists that matches that packet.
3818Ideally this packet
3819will only be an INIT or ASCONF-AddIP packet.
3820A higher value may become a DoS
3821risk as malformed packets can consume processing resources.
3822.It Va net.inet.ip.alias.sctp.param_proc_limit : No 25
3823Defines the maximum number of parameters within a chunk that will be
3824parsed in a
3825packet.
3826As for other similar sysctl variables, larger values pose a DoS risk.
3827.It Va net.inet.ip.alias.sctp.log_level : No 0
3828Level of detail in the system log messages (0 \- minimal, 1 \- event,
38292 \- info, 3 \- detail, 4 \- debug, 5 \- max debug).
3830May be a good
3831option in high loss environments.
3832.It Va net.inet.ip.alias.sctp.shutdown_time : No 15
3833Timeout value while waiting for SHUTDOWN-COMPLETE.
3834This value cannot be 0.
3835.It Va net.inet.ip.alias.sctp.track_global_addresses : No 0
3836Enables/disables global IP address tracking within the
3837.Nm nat
3838and places an
3839upper limit on the number of addresses tracked for each association:
3840.Bl -tag -width indent
3841.It Cm 0
3842Global tracking is disabled
3843.It Cm >1
3844Enables tracking, the maximum number of addresses tracked for each
3845association is limited to this value
3846.El
3847.Pp
3848This variable is fully dynamic, the new value will be adopted for all newly
3849arriving associations, existing associations are treated
3850as they were previously.
3851Global tracking will decrease the number of collisions within the
3852.Nm nat
3853at a cost
3854of increased processing load, memory usage, complexity, and possible
3855.Nm nat
3856state
3857problems in complex networks with multiple
3858.Nm nats .
3859We recommend not tracking
3860global IP addresses, this will still result in a fully functional
3861.Nm nat .
3862.It Va net.inet.ip.alias.sctp.up_timer : No 300
3863Timeout value to keep an association up with no traffic.
3864This value cannot be 0.
3865.It Va net.inet.ip.dummynet.codel.interval : No 100000
3866Default
3867.Cm codel
3868AQM interval in microseconds.
3869The value must be in the range 1..5000000.
3870.It Va net.inet.ip.dummynet.codel.target : No 5000
3871Default
3872.Cm codel
3873AQM target delay time in microseconds (the minimum acceptable persistent queue
3874delay).
3875The value must be in the range 1..5000000.
3876.It Va net.inet.ip.dummynet.expire : No 1
3877Lazily delete dynamic pipes/queue once they have no pending traffic.
3878You can disable this by setting the variable to 0, in which case
3879the pipes/queues will only be deleted when the threshold is reached.
3880.It Va net.inet.ip.dummynet.fqcodel.flows : No 1024
3881Defines the default total number of flow queues (sub-queues) that
3882.Cm fq_codel
3883creates and manages.
3884The value must be in the range 1..65536.
3885.It Va net.inet.ip.dummynet.fqcodel.interval : No 100000
3886Default
3887.Cm fq_codel
3888scheduler/AQM interval in microseconds.
3889The value must be in the range 1..5000000.
3890.It Va net.inet.ip.dummynet.fqcodel.limit : No 10240
3891The default hard size limit (in unit of packet) of all queues managed by an
3892instance of the
3893.Cm fq_codel
3894scheduler.
3895The value must be in the range 1..20480.
3896.It Va net.inet.ip.dummynet.fqcodel.quantum : No 1514
3897The default quantum (credit) of the
3898.Cm fq_codel
3899in unit of byte.
3900The value must be in the range 1..9000.
3901.It Va net.inet.ip.dummynet.fqcodel.target : No 5000
3902Default
3903.Cm fq_codel
3904scheduler/AQM target delay time in microseconds (the minimum acceptable
3905persistent queue delay).
3906The value must be in the range 1..5000000.
3907.It Va net.inet.ip.dummynet.fqpie.alpha : No 125
3908The default
3909.Ar alpha
3910parameter (scaled by 1000) for
3911.Cm fq_pie
3912scheduler/AQM.
3913The value must be in the range 1..7000.
3914.It Va net.inet.ip.dummynet.fqpie.beta : No 1250
3915The default
3916.Ar beta
3917parameter (scaled by 1000) for
3918.Cm fq_pie
3919scheduler/AQM.
3920The value must be in the range 1..7000.
3921.It Va net.inet.ip.dummynet.fqpie.flows : No 1024
3922Defines the default total number of flow queues (sub-queues) that
3923.Cm fq_pie
3924creates and manages.
3925The value must be in the range 1..65536.
3926.It Va net.inet.ip.dummynet.fqpie.limit : No 10240
3927The default hard size limit (in unit of packet) of all queues managed by an
3928instance of the
3929.Cm fq_pie
3930scheduler.
3931The value must be in the range 1..20480.
3932.It Va net.inet.ip.dummynet.fqpie.max_burst : No 150000
3933The default maximum period of microseconds that
3934.Cm fq_pie
3935scheduler/AQM does not drop/mark packets.
3936The value must be in the range 1..10000000.
3937.It Va net.inet.ip.dummynet.fqpie.max_ecnth : No 99
3938The default maximum ECN probability threshold (scaled by 1000) for
3939.Cm fq_pie
3940scheduler/AQM.
3941The value must be in the range 1..7000.
3942.It Va net.inet.ip.dummynet.fqpie.quantum : No 1514
3943The default quantum (credit) of the
3944.Cm fq_pie
3945in unit of byte.
3946The value must be in the range 1..9000.
3947.It Va net.inet.ip.dummynet.fqpie.target : No 15000
3948The default
3949.Cm target
3950delay of the
3951.Cm fq_pie
3952in unit of microsecond.
3953The value must be in the range 1..5000000.
3954.It Va net.inet.ip.dummynet.fqpie.tupdate : No 15000
3955The default
3956.Cm tupdate
3957of the
3958.Cm fq_pie
3959in unit of microsecond.
3960The value must be in the range 1..5000000.
3961.It Va net.inet.ip.dummynet.hash_size : No 64
3962Default size of the hash table used for dynamic pipes/queues.
3963This value is used when no
3964.Cm buckets
3965option is specified when configuring a pipe/queue.
3966.It Va net.inet.ip.dummynet.io_fast : No 0
3967If set to a non-zero value,
3968the
3969.Dq fast
3970mode of
3971.Nm dummynet
3972operation (see above) is enabled.
3973.It Va net.inet.ip.dummynet.io_pkt
3974Number of packets passed to
3975.Nm dummynet .
3976.It Va net.inet.ip.dummynet.io_pkt_drop
3977Number of packets dropped by
3978.Nm dummynet .
3979.It Va net.inet.ip.dummynet.io_pkt_fast
3980Number of packets bypassed by the
3981.Nm dummynet
3982scheduler.
3983.It Va net.inet.ip.dummynet.max_chain_len : No 16
3984Target value for the maximum number of pipes/queues in a hash bucket.
3985The product
3986.Cm max_chain_len*hash_size
3987is used to determine the threshold over which empty pipes/queues
3988will be expired even when
3989.Cm net.inet.ip.dummynet.expire=0 .
3990.It Va net.inet.ip.dummynet.red_lookup_depth : No 256
3991.It Va net.inet.ip.dummynet.red_avg_pkt_size : No 512
3992.It Va net.inet.ip.dummynet.red_max_pkt_size : No 1500
3993Parameters used in the computations of the drop probability
3994for the RED algorithm.
3995.It Va net.inet.ip.dummynet.pie.alpha : No 125
3996The default
3997.Ar alpha
3998parameter (scaled by 1000) for
3999.Cm pie
4000AQM.
4001The value must be in the range 1..7000.
4002.It Va net.inet.ip.dummynet.pie.beta : No 1250
4003The default
4004.Ar beta
4005parameter (scaled by 1000) for
4006.Cm pie
4007AQM.
4008The value must be in the range 1..7000.
4009.It Va net.inet.ip.dummynet.pie.max_burst : No 150000
4010The default maximum period of microseconds that
4011.Cm pie
4012AQM does not drop/mark packets.
4013The value must be in the range 1..10000000.
4014.It Va net.inet.ip.dummynet.pie.max_ecnth : No 99
4015The default maximum ECN probability threshold (scaled by 1000) for
4016.Cm pie
4017AQM.
4018The value must be in the range 1..7000.
4019.It Va net.inet.ip.dummynet.pie.target : No 15000
4020The default
4021.Cm target
4022delay of
4023.Cm pie
4024AQM in unit of microsecond.
4025The value must be in the range 1..5000000.
4026.It Va net.inet.ip.dummynet.pie.tupdate : No 15000
4027The default
4028.Cm tupdate
4029of
4030.Cm pie
4031AQM in unit of microsecond.
4032The value must be in the range 1..5000000.
4033.It Va net.inet.ip.dummynet.pipe_byte_limit : No 1048576
4034.It Va net.inet.ip.dummynet.pipe_slot_limit : No 100
4035The maximum queue size that can be specified in bytes or packets.
4036These limits prevent accidental exhaustion of resources such as mbufs.
4037If you raise these limits,
4038you should make sure the system is configured so that sufficient resources
4039are available.
4040.It Va net.inet.ip.fw.autoinc_step : No 100
4041Delta between rule numbers when auto-generating them.
4042The value must be in the range 1..1000.
4043.It Va net.inet.ip.fw.curr_dyn_buckets : Va net.inet.ip.fw.dyn_buckets
4044The current number of buckets in the hash table for dynamic rules
4045(readonly).
4046.It Va net.inet.ip.fw.debug : No 1
4047Controls debugging messages produced by
4048.Nm .
4049.It Va net.inet.ip.fw.default_rule : No 65535
4050The default rule number (read-only).
4051By the design of
4052.Nm , the default rule is the last one, so its number
4053can also serve as the highest number allowed for a rule.
4054.It Va net.inet.ip.fw.dyn_buckets : No 256
4055The number of buckets in the hash table for dynamic rules.
4056Must be a power of 2, up to 65536.
4057It only takes effect when all dynamic rules have expired, so you
4058are advised to use a
4059.Cm flush
4060command to make sure that the hash table is resized.
4061.It Va net.inet.ip.fw.dyn_count : No 3
4062Current number of dynamic rules
4063(read-only).
4064.It Va net.inet.ip.fw.dyn_keepalive : No 1
4065Enables generation of keepalive packets for
4066.Cm keep-state
4067rules on TCP sessions.
4068A keepalive is generated to both
4069sides of the connection every 5 seconds for the last 20
4070seconds of the lifetime of the rule.
4071.It Va net.inet.ip.fw.dyn_max : No 8192
4072Maximum number of dynamic rules.
4073When you hit this limit, no more dynamic rules can be
4074installed until old ones expire.
4075.It Va net.inet.ip.fw.dyn_ack_lifetime : No 300
4076.It Va net.inet.ip.fw.dyn_syn_lifetime : No 20
4077.It Va net.inet.ip.fw.dyn_fin_lifetime : No 1
4078.It Va net.inet.ip.fw.dyn_rst_lifetime : No 1
4079.It Va net.inet.ip.fw.dyn_udp_lifetime : No 5
4080.It Va net.inet.ip.fw.dyn_short_lifetime : No 30
4081These variables control the lifetime, in seconds, of dynamic
4082rules.
4083Upon the initial SYN exchange the lifetime is kept short,
4084then increased after both SYN have been seen, then decreased
4085again during the final FIN exchange or when a RST is received.
4086Both
4087.Em dyn_fin_lifetime
4088and
4089.Em dyn_rst_lifetime
4090must be strictly lower than 5 seconds, the period of
4091repetition of keepalives.
4092The firewall enforces that.
4093.It Va net.inet.ip.fw.dyn_keep_states : No 0
4094Keep dynamic states on rule/set deletion.
4095States are relinked to default rule (65535).
4096This can be handly for ruleset reload.
4097Turned off by default.
4098.It Va net.inet.ip.fw.enable : No 1
4099Enables the firewall.
4100Setting this variable to 0 lets you run your machine without
4101firewall even if compiled in.
4102.It Va net.inet6.ip6.fw.enable : No 1
4103provides the same functionality as above for the IPv6 case.
4104.It Va net.inet.ip.fw.one_pass : No 1
4105When set, the packet exiting from the
4106.Nm dummynet
4107pipe or from
4108.Xr ng_ipfw 4
4109node is not passed though the firewall again.
4110Otherwise, after an action, the packet is
4111reinjected into the firewall at the next rule.
4112.It Va net.inet.ip.fw.tables_max : No 128
4113Maximum number of tables.
4114.It Va net.inet.ip.fw.verbose : No 1
4115Enables verbose messages.
4116.It Va net.inet.ip.fw.verbose_limit : No 0
4117Limits the number of messages produced by a verbose firewall.
4118.It Va net.inet6.ip6.fw.deny_unknown_exthdrs : No 1
4119If enabled packets with unknown IPv6 Extension Headers will be denied.
4120.It Va net.link.ether.ipfw : No 0
4121Controls whether layer2 packets are passed to
4122.Nm .
4123Default is no.
4124.It Va net.link.bridge.ipfw : No 0
4125Controls whether bridged packets are passed to
4126.Nm .
4127Default is no.
4128.It Va net.inet.ip.fw.nat64_debug : No 0
4129Controls debugging messages produced by
4130.Nm ipfw_nat64
4131module.
4132.It Va net.inet.ip.fw.nat64_direct_output : No 0
4133Controls the output method used by
4134.Nm ipfw_nat64
4135module:
4136.Bl -tag -width indent
4137.It Cm 0
4138A packet is handled by
4139.Nm ipfw
4140twice.
4141First time an original packet is handled by
4142.Nm ipfw
4143and consumed by
4144.Nm ipfw_nat64
4145translator.
4146Then translated packet is queued via netisr to input processing again.
4147.It Cm 1
4148A packet is handled by
4149.Nm ipfw
4150only once, and after translation it will be pushed directly to outgoing
4151interface.
4152.El
4153.El
4154.Sh INTERNAL DIAGNOSTICS
4155There are some commands that may be useful to understand current state
4156of certain subsystems inside kernel module.
4157These commands provide debugging output which may change without notice.
4158.Pp
4159Currently the following commands are available as
4160.Cm internal
4161sub-options:
4162.Bl -tag -width indent
4163.It Cm iflist
4164Lists all interface which are currently tracked by
4165.Nm
4166with their in-kernel status.
4167.It Cm talist
4168List all table lookup algorithms currently available.
4169.El
4170.Sh EXAMPLES
4171There are far too many possible uses of
4172.Nm
4173so this Section will only give a small set of examples.
4174.Ss BASIC PACKET FILTERING
4175This command adds an entry which denies all tcp packets from
4176.Em cracker.evil.org
4177to the telnet port of
4178.Em wolf.tambov.su
4179from being forwarded by the host:
4180.Pp
4181.Dl "ipfw add deny tcp from cracker.evil.org to wolf.tambov.su telnet"
4182.Pp
4183This one disallows any connection from the entire cracker's
4184network to my host:
4185.Pp
4186.Dl "ipfw add deny ip from 123.45.67.0/24 to my.host.org"
4187.Pp
4188A first and efficient way to limit access (not using dynamic rules)
4189is the use of the following rules:
4190.Pp
4191.Dl "ipfw add allow tcp from any to any established"
4192.Dl "ipfw add allow tcp from net1 portlist1 to net2 portlist2 setup"
4193.Dl "ipfw add allow tcp from net3 portlist3 to net3 portlist3 setup"
4194.Dl "..."
4195.Dl "ipfw add deny tcp from any to any"
4196.Pp
4197The first rule will be a quick match for normal TCP packets,
4198but it will not match the initial SYN packet, which will be
4199matched by the
4200.Cm setup
4201rules only for selected source/destination pairs.
4202All other SYN packets will be rejected by the final
4203.Cm deny
4204rule.
4205.Pp
4206If you administer one or more subnets, you can take advantage
4207of the address sets and or-blocks and write extremely
4208compact rulesets which selectively enable services to blocks
4209of clients, as below:
4210.Pp
4211.Dl "goodguys=\*q{ 10.1.2.0/24{20,35,66,18} or 10.2.3.0/28{6,3,11} }\*q"
4212.Dl "badguys=\*q10.1.2.0/24{8,38,60}\*q"
4213.Dl ""
4214.Dl "ipfw add allow ip from ${goodguys} to any"
4215.Dl "ipfw add deny ip from ${badguys} to any"
4216.Dl "... normal policies ..."
4217.Pp
4218The
4219.Cm verrevpath
4220option could be used to do automated anti-spoofing by adding the
4221following to the top of a ruleset:
4222.Pp
4223.Dl "ipfw add deny ip from any to any not verrevpath in"
4224.Pp
4225This rule drops all incoming packets that appear to be coming to the
4226system on the wrong interface.
4227For example, a packet with a source
4228address belonging to a host on a protected internal network would be
4229dropped if it tried to enter the system from an external interface.
4230.Pp
4231The
4232.Cm antispoof
4233option could be used to do similar but more restricted anti-spoofing
4234by adding the following to the top of a ruleset:
4235.Pp
4236.Dl "ipfw add deny ip from any to any not antispoof in"
4237.Pp
4238This rule drops all incoming packets that appear to be coming from another
4239directly connected system but on the wrong interface.
4240For example, a packet with a source address of
4241.Li 192.168.0.0/24 ,
4242configured on
4243.Li fxp0 ,
4244but coming in on
4245.Li fxp1
4246would be dropped.
4247.Pp
4248The
4249.Cm setdscp
4250option could be used to (re)mark user traffic,
4251by adding the following to the appropriate place in ruleset:
4252.Pp
4253.Dl "ipfw add setdscp be ip from any to any dscp af11,af21"
4254.Ss SELECTIVE MIRRORING
4255If your network has network traffic analyzer
4256connected to your host directly via dedicated interface
4257or remotely via RSPAN vlan, you can selectively mirror
4258some Ethernet layer2 frames to the analyzer.
4259.Pp
4260First, make sure your firewall is already configured and runs.
4261Then, enable layer2 processing if not already enabled:
4262.Pp
4263.Dl "sysctl net.link.ether.ipfw=1"
4264.Pp
4265Next, load needed additional kernel modules:
4266.Pp
4267.Dl "kldload ng_ether ng_ipfw"
4268.Pp
4269Optionally, make system load these modules automatically
4270at startup:
4271.Pp
4272.Dl sysrc kld_list+="ng_ether ng_ipfw"
4273.Pp
4274Next, configure
4275.Xr ng_ipfw 4
4276kernel module to transmit mirrored copies of layer2 frames
4277out via vlan900 interface:
4278.Pp
4279.Dl "ngctl connect ipfw: vlan900: 1 lower"
4280.Pp
4281Think of "1" here as of "mirroring instance index" and vlan900 is its
4282destination.
4283You can have arbitrary number of instances.
4284Refer to
4285.Xr ng_ipfw 4
4286for details.
4287.Pp
4288At last, actually start mirroring of selected frames using "instance 1".
4289For frames incoming from em0 interface:
4290.Pp
4291.Dl "ipfw add ngtee 1 ip from any to 192.168.0.1 layer2 in recv em0"
4292.Pp
4293For frames outgoing to em0 interface:
4294.Pp
4295.Dl "ipfw add ngtee 1 ip from any to 192.168.0.1 layer2 out xmit em0"
4296.Pp
4297For both incoming and outgoing frames while flowing through em0:
4298.Pp
4299.Dl "ipfw add ngtee 1 ip from any to 192.168.0.1 layer2 via em0"
4300.Pp
4301Make sure you do not perform mirroring for already duplicated frames
4302or kernel may hang as there is no safety net.
4303.Ss DYNAMIC RULES
4304In order to protect a site from flood attacks involving fake
4305TCP packets, it is safer to use dynamic rules:
4306.Pp
4307.Dl "ipfw add check-state"
4308.Dl "ipfw add deny tcp from any to any established"
4309.Dl "ipfw add allow tcp from my-net to any setup keep-state"
4310.Pp
4311This will let the firewall install dynamic rules only for
4312those connection which start with a regular SYN packet coming
4313from the inside of our network.
4314Dynamic rules are checked when encountering the first
4315occurrence of a
4316.Cm check-state ,
4317.Cm keep-state
4318or
4319.Cm limit
4320rule.
4321A
4322.Cm check-state
4323rule should usually be placed near the beginning of the
4324ruleset to minimize the amount of work scanning the ruleset.
4325Your mileage may vary.
4326.Pp
4327For more complex scenarios with dynamic rules
4328.Cm record-state
4329and
4330.Cm defer-action
4331can be used to precisely control creation and checking of dynamic rules.
4332Example of usage of these options are provided in
4333.Sx NETWORK ADDRESS TRANSLATION (NAT)
4334Section.
4335.Pp
4336To limit the number of connections a user can open
4337you can use the following type of rules:
4338.Pp
4339.Dl "ipfw add allow tcp from my-net/24 to any setup limit src-addr 10"
4340.Dl "ipfw add allow tcp from any to me setup limit src-addr 4"
4341.Pp
4342The former (assuming it runs on a gateway) will allow each host
4343on a /24 network to open at most 10 TCP connections.
4344The latter can be placed on a server to make sure that a single
4345client does not use more than 4 simultaneous connections.
4346.Pp
4347.Em BEWARE :
4348stateful rules can be subject to denial-of-service attacks
4349by a SYN-flood which opens a huge number of dynamic rules.
4350The effects of such attacks can be partially limited by
4351acting on a set of
4352.Xr sysctl 8
4353variables which control the operation of the firewall.
4354.Pp
4355Here is a good usage of the
4356.Cm list
4357command to see accounting records and timestamp information:
4358.Pp
4359.Dl ipfw -at list
4360.Pp
4361or in short form without timestamps:
4362.Pp
4363.Dl ipfw -a list
4364.Pp
4365which is equivalent to:
4366.Pp
4367.Dl ipfw show
4368.Pp
4369Next rule diverts all incoming packets from 192.168.2.0/24
4370to divert port 5000:
4371.Pp
4372.Dl ipfw divert 5000 ip from 192.168.2.0/24 to any in
4373.Ss TRAFFIC SHAPING
4374The following rules show some of the applications of
4375.Nm
4376and
4377.Nm dummynet
4378for simulations and the like.
4379.Pp
4380This rule drops random incoming packets with a probability
4381of 5%:
4382.Pp
4383.Dl "ipfw add prob 0.05 deny ip from any to any in"
4384.Pp
4385A similar effect can be achieved making use of
4386.Nm dummynet
4387pipes:
4388.Pp
4389.Dl "dnctl add pipe 10 ip from any to any"
4390.Dl "dnctl pipe 10 config plr 0.05"
4391.Pp
4392We can use pipes to artificially limit bandwidth, e.g.\& on a
4393machine acting as a router, if we want to limit traffic from
4394local clients on 192.168.2.0/24 we do:
4395.Pp
4396.Dl "ipfw add pipe 1 ip from 192.168.2.0/24 to any out"
4397.Dl "dnctl pipe 1 config bw 300Kbit/s queue 50KBytes"
4398.Pp
4399note that we use the
4400.Cm out
4401modifier so that the rule is not used twice.
4402Remember in fact that
4403.Nm
4404rules are checked both on incoming and outgoing packets.
4405.Pp
4406Should we want to simulate a bidirectional link with bandwidth
4407limitations, the correct way is the following:
4408.Pp
4409.Dl "ipfw add pipe 1 ip from any to any out"
4410.Dl "ipfw add pipe 2 ip from any to any in"
4411.Dl "dnctl pipe 1 config bw 64Kbit/s queue 10Kbytes"
4412.Dl "dnctl pipe 2 config bw 64Kbit/s queue 10Kbytes"
4413.Pp
4414The above can be very useful, e.g.\& if you want to see how
4415your fancy Web page will look for a residential user who
4416is connected only through a slow link.
4417You should not use only one pipe for both directions, unless
4418you want to simulate a half-duplex medium (e.g.\& AppleTalk,
4419Ethernet, IRDA).
4420It is not necessary that both pipes have the same configuration,
4421so we can also simulate asymmetric links.
4422.Pp
4423Should we want to verify network performance with the RED queue
4424management algorithm:
4425.Pp
4426.Dl "ipfw add pipe 1 ip from any to any"
4427.Dl "dnctl pipe 1 config bw 500Kbit/s queue 100 red 0.002/30/80/0.1"
4428.Pp
4429Another typical application of the traffic shaper is to
4430introduce some delay in the communication.
4431This can significantly affect applications which do a lot of Remote
4432Procedure Calls, and where the round-trip-time of the
4433connection often becomes a limiting factor much more than
4434bandwidth:
4435.Pp
4436.Dl "ipfw add pipe 1 ip from any to any out"
4437.Dl "ipfw add pipe 2 ip from any to any in"
4438.Dl "dnctl pipe 1 config delay 250ms bw 1Mbit/s"
4439.Dl "dnctl pipe 2 config delay 250ms bw 1Mbit/s"
4440.Pp
4441Per-flow queueing can be useful for a variety of purposes.
4442A very simple one is counting traffic:
4443.Pp
4444.Dl "ipfw add pipe 1 tcp from any to any"
4445.Dl "ipfw add pipe 1 udp from any to any"
4446.Dl "ipfw add pipe 1 ip from any to any"
4447.Dl "dnctl pipe 1 config mask all"
4448.Pp
4449The above set of rules will create queues (and collect
4450statistics) for all traffic.
4451Because the pipes have no limitations, the only effect is
4452collecting statistics.
4453Note that we need 3 rules, not just the last one, because
4454when
4455.Nm
4456tries to match IP packets it will not consider ports, so we
4457would not see connections on separate ports as different
4458ones.
4459.Pp
4460A more sophisticated example is limiting the outbound traffic
4461on a net with per-host limits, rather than per-network limits:
4462.Pp
4463.Dl "ipfw add pipe 1 ip from 192.168.2.0/24 to any out"
4464.Dl "ipfw add pipe 2 ip from any to 192.168.2.0/24 in"
4465.Dl "dnctl pipe 1 config mask src-ip 0x000000ff bw 200Kbit/s queue 20Kbytes"
4466.Dl "dnctl pipe 2 config mask dst-ip 0x000000ff bw 200Kbit/s queue 20Kbytes"
4467.Ss LOOKUP TABLES
4468In the following example, we need to create several traffic bandwidth
4469classes and we need different hosts/networks to fall into different classes.
4470We create one pipe for each class and configure them accordingly.
4471Then we create a single table and fill it with IP subnets and addresses.
4472For each subnet/host we set the argument equal to the number of the pipe
4473that it should use.
4474Then we classify traffic using a single rule:
4475.Pp
4476.Dl "dnctl pipe 1 config bw 1000Kbyte/s"
4477.Dl "dnctl pipe 4 config bw 4000Kbyte/s"
4478.Dl "..."
4479.Dl "ipfw table T1 create type addr"
4480.Dl "ipfw table T1 add 192.168.2.0/24 1"
4481.Dl "ipfw table T1 add 192.168.0.0/27 4"
4482.Dl "ipfw table T1 add 192.168.0.2 1"
4483.Dl "..."
4484.Dl "ipfw add pipe tablearg ip from 'table(T1)' to any"
4485.Pp
4486Using the
4487.Cm fwd
4488action, the table entries may include hostnames and IP addresses.
4489.Pp
4490.Dl "ipfw table T2 create type addr valtype ipv4"
4491.Dl "ipfw table T2 add 192.168.2.0/24 10.23.2.1"
4492.Dl "ipfw table T2 add 192.168.0.0/27 router1.dmz"
4493.Dl "..."
4494.Dl "ipfw add 100 fwd tablearg ip from any to 'table(T2)'"
4495.Pp
4496In the following example per-interface firewall is created:
4497.Pp
4498.Dl "ipfw table IN create type iface valtype skipto,fib"
4499.Dl "ipfw table IN add vlan20 12000,12"
4500.Dl "ipfw table IN add vlan30 13000,13"
4501.Dl "ipfw table OUT create type iface valtype skipto"
4502.Dl "ipfw table OUT add vlan20 22000"
4503.Dl "ipfw table OUT add vlan30 23000"
4504.Dl ".."
4505.Dl "ipfw add 100 setfib tablearg ip from any to any recv 'table(IN)' in"
4506.Dl "ipfw add 200 skipto tablearg ip from any to any recv 'table(IN)' in"
4507.Dl "ipfw add 300 skipto tablearg ip from any to any xmit 'table(OUT)' out"
4508.Pp
4509The following example illustrate usage of flow tables:
4510.Pp
4511.Dl "ipfw table fl create type flow:src-ip,proto,dst-ip,dst-port"
4512.Dl "ipfw table fl add 2a02:6b8:77::88,tcp,2a02:6b8:77::99,80 11"
4513.Dl "ipfw table fl add 10.0.0.1,udp,10.0.0.2,53 12"
4514.Dl ".."
4515.Dl "ipfw add 100 allow ip from any to any flow 'table(fl,11)' recv ix0"
4516.Ss SETS OF RULES
4517To add a set of rules atomically, e.g.\& set 18:
4518.Pp
4519.Dl "ipfw set disable 18"
4520.Dl "ipfw add NN set 18 ...         # repeat as needed"
4521.Dl "ipfw set enable 18"
4522.Pp
4523To delete a set of rules atomically the command is simply:
4524.Pp
4525.Dl "ipfw delete set 18"
4526.Pp
4527To test a ruleset and disable it and regain control if something goes wrong:
4528.Pp
4529.Dl "ipfw set disable 18"
4530.Dl "ipfw add NN set 18 ...         # repeat as needed"
4531.Dl "ipfw set enable 18; echo done; sleep 30 && ipfw set disable 18"
4532.Pp
4533Here if everything goes well, you press control-C before the "sleep"
4534terminates, and your ruleset will be left active.
4535Otherwise, e.g.\& if
4536you cannot access your box, the ruleset will be disabled after
4537the sleep terminates thus restoring the previous situation.
4538.Pp
4539To show rules of the specific set:
4540.Pp
4541.Dl "ipfw set 18 show"
4542.Pp
4543To show rules of the disabled set:
4544.Pp
4545.Dl "ipfw -S set 18 show"
4546.Pp
4547To clear a specific rule counters of the specific set:
4548.Pp
4549.Dl "ipfw set 18 zero NN"
4550.Pp
4551To delete a specific rule of the specific set:
4552.Pp
4553.Dl "ipfw set 18 delete NN"
4554.Ss NAT, REDIRECT AND LSNAT
4555First redirect all the traffic to nat instance 123:
4556.Pp
4557.Dl "ipfw add nat 123 all from any to any"
4558.Pp
4559Then to configure nat instance 123 to alias all the outgoing traffic with ip
4560192.168.0.123, blocking all incoming connections, trying to keep
4561same ports on both sides, clearing aliasing table on address change
4562and keeping a log of traffic/link statistics:
4563.Pp
4564.Dl "ipfw nat 123 config ip 192.168.0.123 log deny_in reset same_ports"
4565.Pp
4566Or to change address of instance 123, aliasing table will be cleared (see
4567reset option):
4568.Pp
4569.Dl "ipfw nat 123 config ip 10.0.0.1"
4570.Pp
4571To see configuration of nat instance 123:
4572.Pp
4573.Dl "ipfw nat 123 show config"
4574.Pp
4575To show logs of all instances:
4576.Pp
4577.Dl "ipfw nat show log"
4578.Pp
4579To see configurations of all instances:
4580.Pp
4581.Dl "ipfw nat show config"
4582.Pp
4583Or a redirect rule with mixed modes could looks like:
4584.Bd -literal -offset 2n
4585ipfw nat 123 config redirect_addr 10.0.0.1 10.0.0.66
4586			 redirect_port tcp 192.168.0.1:80 500
4587			 redirect_proto udp 192.168.1.43 192.168.1.1
4588			 redirect_addr 192.168.0.10,192.168.0.11
4589			 	    10.0.0.100	# LSNAT
4590			 redirect_port tcp 192.168.0.1:80,192.168.0.10:22
4591			 	    500		# LSNAT
4592.Ed
4593.Pp
4594or it could be split in:
4595.Bd -literal -offset 2n
4596ipfw nat 1 config redirect_addr 10.0.0.1 10.0.0.66
4597ipfw nat 2 config redirect_port tcp 192.168.0.1:80 500
4598ipfw nat 3 config redirect_proto udp 192.168.1.43 192.168.1.1
4599ipfw nat 4 config redirect_addr 192.168.0.10,192.168.0.11,192.168.0.12
4600				         10.0.0.100
4601ipfw nat 5 config redirect_port tcp
4602			192.168.0.1:80,192.168.0.10:22,192.168.0.20:25 500
4603.Ed
4604.Pp
4605Sometimes you may want to mix NAT and dynamic rules.
4606It could be achieved with
4607.Cm record-state
4608and
4609.Cm defer-action
4610options.
4611Problem is, you need to create dynamic rule before NAT and check it
4612after NAT actions (or vice versa) to have consistent addresses and ports.
4613Rule with
4614.Cm keep-state
4615option will trigger activation of existing dynamic state, and action of such
4616rule will be performed as soon as rule is matched.
4617In case of NAT and
4618.Cm allow
4619rule packet need to be passed to NAT, not allowed as soon is possible.
4620.Pp
4621There is example of set of rules to achieve this.
4622Bear in mind that this is example only and it is not very useful by itself.
4623.Pp
4624On way out, after all checks place this rules:
4625.Pp
4626.Dl "ipfw add allow record-state skip-action"
4627.Dl "ipfw add nat 1"
4628.Pp
4629And on way in there should be something like this:
4630.Pp
4631.Dl "ipfw add nat 1"
4632.Dl "ipfw add check-state"
4633.Pp
4634Please note, that first rule on way out doesn't allow packet and doesn't
4635execute existing dynamic rules.
4636All it does, create new dynamic rule with
4637.Cm allow
4638action, if it is not created yet.
4639Later, this dynamic rule is used on way in by
4640.Cm check-state
4641rule.
4642.Ss CONFIGURING CODEL, PIE, FQ-CODEL and FQ-PIE AQM
4643.Cm codel
4644and
4645.Cm pie
4646AQM can be configured for
4647.Nm dummynet
4648.Cm pipe
4649or
4650.Cm queue .
4651.Pp
4652To configure a
4653.Cm pipe
4654with
4655.Cm codel
4656AQM using default configuration for traffic from 192.168.0.0/24 and 1Mbits/s
4657rate limit, we do:
4658.Pp
4659.Dl "dnctl pipe 1 config bw 1mbits/s codel"
4660.Dl "ipfw add 100 pipe 1 ip from 192.168.0.0/24 to any"
4661.Pp
4662To configure a
4663.Cm queue
4664with
4665.Cm codel
4666AQM using different configurations parameters for traffic from
4667192.168.0.0/24 and 1Mbits/s rate limit, we do:
4668.Pp
4669.Dl "dnctl pipe 1 config bw 1mbits/s"
4670.Dl "dnctl queue 1 config pipe 1 codel target 8ms interval 160ms ecn"
4671.Dl "ipfw add 100 queue 1 ip from 192.168.0.0/24 to any"
4672.Pp
4673To configure a
4674.Cm pipe
4675with
4676.Cm pie
4677AQM using default configuration for traffic from 192.168.0.0/24 and 1Mbits/s
4678rate limit, we do:
4679.Pp
4680.Dl "dnctl pipe 1 config bw 1mbits/s pie"
4681.Dl "ipfw add 100 pipe 1 ip from 192.168.0.0/24 to any"
4682.Pp
4683To configure a
4684.Cm queue
4685with
4686.Cm pie
4687AQM using different configuration parameters for traffic from
4688192.168.0.0/24 and 1Mbits/s rate limit, we do:
4689.Pp
4690.Dl "dnctl pipe 1 config bw 1mbits/s"
4691.Dl "dnctl queue 1 config pipe 1 pie target 20ms tupdate 30ms ecn"
4692.Dl "ipfw add 100 queue 1 ip from 192.168.0.0/24 to any"
4693.Pp
4694.Cm fq_codel
4695and
4696.Cm fq_pie
4697AQM can be configured for
4698.Nm dummynet
4699schedulers.
4700.Pp
4701To configure
4702.Cm fq_codel
4703scheduler using different configurations parameters for traffic from
4704192.168.0.0/24 and 1Mbits/s rate limit, we do:
4705.Pp
4706.Dl "dnctl pipe 1 config bw 1mbits/s"
4707.Dl "dnctl sched 1 config pipe 1 type fq_codel"
4708.Dl "dnctl queue 1 config sched 1"
4709.Dl "ipfw add 100 queue 1 ip from 192.168.0.0/24 to any"
4710.Pp
4711To change
4712.Cm fq_codel
4713default configuration for a
4714.Cm sched
4715such as disable ECN and change the
4716.Ar target
4717to 10ms, we do:
4718.Pp
4719.Dl "dnctl sched 1 config pipe 1 type fq_codel target 10ms noecn"
4720.Pp
4721Similar to
4722.Cm fq_codel ,
4723to configure
4724.Cm fq_pie
4725scheduler using different configurations parameters for traffic from
4726192.168.0.0/24 and 1Mbits/s rate limit, we do:
4727.Pp
4728.Dl "dnctl pipe 1 config bw 1mbits/s"
4729.Dl "dnctl sched 1 config pipe 1 type fq_pie"
4730.Dl "dnctl queue 1 config sched 1"
4731.Dl "ipfw add 100 queue 1 ip from 192.168.0.0/24 to any"
4732.Pp
4733The configurations of
4734.Cm fq_pie
4735.Cm sched
4736can be changed in a similar way as for
4737.Cm fq_codel
4738.Sh SEE ALSO
4739.Xr cpp 1 ,
4740.Xr m4 1 ,
4741.Xr altq 4 ,
4742.Xr divert 4 ,
4743.Xr dummynet 4 ,
4744.Xr if_bridge 4 ,
4745.Xr ip 4 ,
4746.Xr ipfirewall 4 ,
4747.Xr ng_ether 4 ,
4748.Xr ng_ipfw 4 ,
4749.Xr protocols 5 ,
4750.Xr services 5 ,
4751.Xr init 8 ,
4752.Xr kldload 8 ,
4753.Xr reboot 8 ,
4754.Xr sysctl 8 ,
4755.Xr syslogd 8 ,
4756.Xr sysrc 8
4757.Sh HISTORY
4758The
4759.Nm
4760utility first appeared in
4761.Fx 2.0 .
4762.Nm dummynet
4763was introduced in
4764.Fx 2.2.8 .
4765Stateful extensions were introduced in
4766.Fx 4.0 .
4767.Nm ipfw2
4768was introduced in Summer 2002.
4769.Sh AUTHORS
4770.An Ugen J. S. Antsilevich ,
4771.An Poul-Henning Kamp ,
4772.An Alex Nash ,
4773.An Archie Cobbs ,
4774.An Luigi Rizzo ,
4775.An Rasool Al-Saadi .
4776.Pp
4777.An -nosplit
4778API based upon code written by
4779.An Daniel Boulet
4780for BSDI.
4781.Pp
4782Dummynet has been introduced by Luigi Rizzo in 1997-1998.
4783.Pp
4784Some early work (1999-2000) on the
4785.Nm dummynet
4786traffic shaper supported by Akamba Corp.
4787.Pp
4788The ipfw core (ipfw2) has been completely redesigned and
4789reimplemented by Luigi Rizzo in summer 2002.
4790Further
4791actions and
4792options have been added by various developers over the years.
4793.Pp
4794.An -nosplit
4795In-kernel NAT support written by
4796.An Paolo Pisati Aq Mt piso@FreeBSD.org
4797as part of a Summer of Code 2005 project.
4798.Pp
4799SCTP
4800.Nm nat
4801support has been developed by
4802.An The Centre for Advanced Internet Architectures (CAIA) Aq http://www.caia.swin.edu.au .
4803The primary developers and maintainers are David Hayes and Jason But.
4804For further information visit:
4805.Aq http://www.caia.swin.edu.au/urp/SONATA
4806.Pp
4807Delay profiles have been developed by Alessandro Cerri and
4808Luigi Rizzo, supported by the
4809European Commission within Projects Onelab and Onelab2.
4810.Pp
4811CoDel, PIE, FQ-CoDel and FQ-PIE AQM for Dummynet have been implemented by
4812.An The Centre for Advanced Internet Architectures (CAIA)
4813in 2016, supported by The Comcast Innovation Fund.
4814The primary developer is
4815Rasool Al-Saadi.
4816.Sh BUGS
4817The syntax has grown over the years and sometimes it might be confusing.
4818Unfortunately, backward compatibility prevents cleaning up mistakes
4819made in the definition of the syntax.
4820.Pp
4821.Em !!! WARNING !!!
4822.Pp
4823Misconfiguring the firewall can put your computer in an unusable state,
4824possibly shutting down network services and requiring console access to
4825regain control of it.
4826.Pp
4827Incoming packet fragments diverted by
4828.Cm divert
4829are reassembled before delivery to the socket.
4830The action used on those packet is the one from the
4831rule which matches the first fragment of the packet.
4832.Pp
4833Packets diverted to userland, and then reinserted by a userland process
4834may lose various packet attributes.
4835The packet source interface name
4836will be preserved if it is shorter than 8 bytes and the userland process
4837saves and reuses the sockaddr_in
4838(as does
4839.Xr natd 8 ) ;
4840otherwise, it may be lost.
4841If a packet is reinserted in this manner, later rules may be incorrectly
4842applied, making the order of
4843.Cm divert
4844rules in the rule sequence very important.
4845.Pp
4846Dummynet drops all packets with IPv6 link-local addresses.
4847.Pp
4848Rules using
4849.Cm uid
4850or
4851.Cm gid
4852may not behave as expected.
4853In particular, incoming SYN packets may
4854have no uid or gid associated with them since they do not yet belong
4855to a TCP connection, and the uid/gid associated with a packet may not
4856be as expected if the associated process calls
4857.Xr setuid 2
4858or similar system calls.
4859.Pp
4860Rule syntax is subject to the command line environment and some patterns
4861may need to be escaped with the backslash character
4862or quoted appropriately.
4863.Pp
4864Due to the architecture of
4865.Xr libalias 3 ,
4866ipfw nat is not compatible with the TCP segmentation offloading (TSO).
4867Thus, to reliably nat your network traffic, please disable TSO
4868on your NICs using
4869.Xr ifconfig 8 .
4870.Pp
4871ICMP error messages are not implicitly matched by dynamic rules
4872for the respective conversations.
4873To avoid failures of network error detection and path MTU discovery,
4874ICMP error messages may need to be allowed explicitly through static
4875rules.
4876.Pp
4877Rules using
4878.Cm call
4879and
4880.Cm return
4881actions may lead to confusing behaviour if ruleset has mistakes,
4882and/or interaction with other subsystems (netgraph, dummynet, etc.) is used.
4883One possible case for this is packet leaving
4884.Nm
4885in subroutine on the input pass, while later on output encountering unpaired
4886.Cm return
4887first.
4888As the call stack is kept intact after input pass, packet will suddenly
4889return to the rule number used on input pass, not on output one.
4890Order of processing should be checked carefully to avoid such mistakes.
4891