xref: /freebsd/sbin/ipfw/ipfw.8 (revision c0a4a7bb942fd3302f0093e4353820916d3661d1)
1.\"
2.\" $FreeBSD$
3.\"
4.Dd January 25, 2023
5.Dt IPFW 8
6.Os
7.Sh NAME
8.Nm ipfw , dnctl
9.Nd User interface for firewall, traffic shaper, packet scheduler,
10in-kernel NAT.
11.Sh SYNOPSIS
12.Ss FIREWALL CONFIGURATION
13.Nm
14.Op Fl cq
15.Cm add
16.Ar rule
17.Nm
18.Op Fl acdefnNStT
19.Op Cm set Ar N
20.Brq Cm list | show
21.Op Ar rule | first-last ...
22.Nm
23.Op Fl f | q
24.Op Cm set Ar N
25.Cm flush
26.Nm
27.Op Fl q
28.Op Cm set Ar N
29.Brq Cm delete | zero | resetlog
30.Op Ar number ...
31.Pp
32.Nm
33.Cm set Oo Cm disable Ar number ... Oc Op Cm enable Ar number ...
34.Nm
35.Cm set move
36.Op Cm rule
37.Ar number Cm to Ar number
38.Nm
39.Cm set swap Ar number number
40.Nm
41.Cm set show
42.Ss SYSCTL SHORTCUTS
43.Nm
44.Cm enable
45.Brq Cm firewall | altq | one_pass | debug | verbose | dyn_keepalive
46.Nm
47.Cm disable
48.Brq Cm firewall | altq | one_pass | debug | verbose | dyn_keepalive
49.Ss LOOKUP TABLES
50.Nm
51.Oo Cm set Ar N Oc Cm table Ar name Cm create Ar create-options
52.Nm
53.Oo Cm set Ar N Oc Cm table
54.Brq Ar name | all
55.Cm destroy
56.Nm
57.Oo Cm set Ar N Oc Cm table Ar name Cm modify Ar modify-options
58.Nm
59.Oo Cm set Ar N Oc Cm table Ar name Cm swap Ar name
60.Nm
61.Oo Cm set Ar N Oc Cm table Ar name Cm add Ar table-key Op Ar value
62.Nm
63.Oo Cm set Ar N Oc Cm table Ar name Cm add Op Ar table-key Ar value ...
64.Nm
65.Oo Cm set Ar N Oc Cm table Ar name Cm atomic add Op Ar table-key Ar value ...
66.Nm
67.Oo Cm set Ar N Oc Cm table Ar name Cm delete Op Ar table-key ...
68.Nm
69.Oo Cm set Ar N Oc Cm table Ar name Cm lookup Ar addr
70.Nm
71.Oo Cm set Ar N Oc Cm table Ar name Cm lock
72.Nm
73.Oo Cm set Ar N Oc Cm table Ar name Cm unlock
74.Nm
75.Oo Cm set Ar N Oc Cm table
76.Brq Ar name | all
77.Cm list
78.Nm
79.Oo Cm set Ar N Oc Cm table
80.Brq Ar name | all
81.Cm info
82.Nm
83.Oo Cm set Ar N Oc Cm table
84.Brq Ar name | all
85.Cm detail
86.Nm
87.Oo Cm set Ar N Oc Cm table
88.Brq Ar name | all
89.Cm flush
90.Ss DUMMYNET CONFIGURATION (TRAFFIC SHAPER AND PACKET SCHEDULER)
91.Nm dnctl
92.Brq Cm pipe | queue | sched
93.Ar number
94.Cm config
95.Ar config-options
96.Nm dnctl
97.Op Fl s Op Ar field
98.Brq Cm pipe | queue | sched
99.Brq Cm delete | list | show
100.Op Ar number ...
101.Ss IN-KERNEL NAT
102.Nm
103.Op Fl q
104.Cm nat
105.Ar number
106.Cm config
107.Ar config-options
108.Nm
109.Cm nat
110.Ar number
111.Cm show
112.Brq Cm config | log
113.Ss STATEFUL IPv6/IPv4 NETWORK ADDRESS AND PROTOCOL TRANSLATION
114.Nm
115.Oo Cm set Ar N Oc Cm nat64lsn Ar name Cm create Ar create-options
116.Nm
117.Oo Cm set Ar N Oc Cm nat64lsn Ar name Cm config Ar config-options
118.Nm
119.Oo Cm set Ar N Oc Cm nat64lsn
120.Brq Ar name | all
121.Brq Cm list | show
122.Op Cm states
123.Nm
124.Oo Cm set Ar N Oc Cm nat64lsn
125.Brq Ar name | all
126.Cm destroy
127.Nm
128.Oo Cm set Ar N Oc Cm nat64lsn Ar name Cm stats Op Cm reset
129.Ss STATELESS IPv6/IPv4 NETWORK ADDRESS AND PROTOCOL TRANSLATION
130.Nm
131.Oo Cm set Ar N Oc Cm nat64stl Ar name Cm create Ar create-options
132.Nm
133.Oo Cm set Ar N Oc Cm nat64stl Ar name Cm config Ar config-options
134.Nm
135.Oo Cm set Ar N Oc Cm nat64stl
136.Brq Ar name | all
137.Brq Cm list | show
138.Nm
139.Oo Cm set Ar N Oc Cm nat64stl
140.Brq Ar name | all
141.Cm destroy
142.Nm
143.Oo Cm set Ar N Oc Cm nat64stl Ar name Cm stats Op Cm reset
144.Ss XLAT464 CLAT IPv6/IPv4 NETWORK ADDRESS AND PROTOCOL TRANSLATION
145.Nm
146.Oo Cm set Ar N Oc Cm nat64clat Ar name Cm create Ar create-options
147.Nm
148.Oo Cm set Ar N Oc Cm nat64clat Ar name Cm config Ar config-options
149.Nm
150.Oo Cm set Ar N Oc Cm nat64clat
151.Brq Ar name | all
152.Brq Cm list | show
153.Nm
154.Oo Cm set Ar N Oc Cm nat64clat
155.Brq Ar name | all
156.Cm destroy
157.Nm
158.Oo Cm set Ar N Oc Cm nat64clat Ar name Cm stats Op Cm reset
159.Ss IPv6-to-IPv6 NETWORK PREFIX TRANSLATION
160.Nm
161.Oo Cm set Ar N Oc Cm nptv6 Ar name Cm create Ar create-options
162.Nm
163.Oo Cm set Ar N Oc Cm nptv6
164.Brq Ar name | all
165.Brq Cm list | show
166.Nm
167.Oo Cm set Ar N Oc Cm nptv6
168.Brq Ar name | all
169.Cm destroy
170.Nm
171.Oo Cm set Ar N Oc Cm nptv6 Ar name Cm stats Op Cm reset
172.Ss INTERNAL DIAGNOSTICS
173.Nm
174.Cm internal iflist
175.Nm
176.Cm internal talist
177.Nm
178.Cm internal vlist
179.Ss LIST OF RULES AND PREPROCESSING
180.Nm
181.Op Fl cfnNqS
182.Oo
183.Fl p Ar preproc
184.Oo
185.Ar preproc-flags
186.Oc
187.Oc
188.Ar pathname
189.Sh DESCRIPTION
190The
191.Nm
192utility is the user interface for controlling the
193.Xr ipfw 4
194firewall, the
195.Xr dummynet 4
196traffic shaper/packet scheduler, and the
197in-kernel NAT services.
198.Pp
199A firewall configuration, or
200.Em ruleset ,
201is made of a list of
202.Em rules
203numbered from 1 to 65535.
204Packets are passed to the firewall
205from a number of different places in the protocol stack
206(depending on the source and destination of the packet,
207it is possible for the firewall to be
208invoked multiple times on the same packet).
209The packet passed to the firewall is compared
210against each of the rules in the
211.Em ruleset ,
212in rule-number order
213(multiple rules with the same number are permitted, in which case
214they are processed in order of insertion).
215When a match is found, the action corresponding to the
216matching rule is performed.
217.Pp
218Depending on the action and certain system settings, packets
219can be reinjected into the firewall at some rule after the
220matching one for further processing.
221.Pp
222A ruleset always includes a
223.Em default
224rule (numbered 65535) which cannot be modified or deleted,
225and matches all packets.
226The action associated with the
227.Em default
228rule can be either
229.Cm deny
230or
231.Cm allow
232depending on how the kernel is configured.
233.Pp
234If the ruleset includes one or more rules with the
235.Cm keep-state ,
236.Cm record-state ,
237.Cm limit
238or
239.Cm set-limit
240option,
241the firewall will have a
242.Em stateful
243behaviour, i.e., upon a match it will create
244.Em dynamic rules ,
245i.e., rules that match packets with the same 5-tuple
246(protocol, source and destination addresses and ports)
247as the packet which caused their creation.
248Dynamic rules, which have a limited lifetime, are checked
249at the first occurrence of a
250.Cm check-state ,
251.Cm keep-state
252or
253.Cm limit
254rule, and are typically used to open the firewall on-demand to
255legitimate traffic only.
256Please note, that
257.Cm keep-state
258and
259.Cm limit
260imply implicit
261.Cm check-state
262for all packets (not only these matched by the rule) but
263.Cm record-state
264and
265.Cm set-limit
266have no implicit
267.Cm check-state .
268See the
269.Sx STATEFUL FIREWALL
270and
271.Sx EXAMPLES
272Sections below for more information on the stateful behaviour of
273.Nm .
274.Pp
275All rules (including dynamic ones) have a few associated counters:
276a packet count, a byte count, a log count and a timestamp
277indicating the time of the last match.
278Counters can be displayed or reset with
279.Nm
280commands.
281.Pp
282Each rule belongs to one of 32 different
283.Em sets
284, and there are
285.Nm
286commands to atomically manipulate sets, such as enable,
287disable, swap sets, move all rules in a set to another
288one, delete all rules in a set.
289These can be useful to
290install temporary configurations, or to test them.
291See Section
292.Sx SETS OF RULES
293for more information on
294.Em sets .
295.Pp
296Rules can be added with the
297.Cm add
298command; deleted individually or in groups with the
299.Cm delete
300command, and globally (except those in set 31) with the
301.Cm flush
302command; displayed, optionally with the content of the
303counters, using the
304.Cm show
305and
306.Cm list
307commands.
308Finally, counters can be reset with the
309.Cm zero
310and
311.Cm resetlog
312commands.
313.Ss COMMAND OPTIONS
314The following general options are available when invoking
315.Nm :
316.Bl -tag -width indent
317.It Fl a
318Show counter values when listing rules.
319The
320.Cm show
321command implies this option.
322.It Fl b
323Only show the action and the comment, not the body of a rule.
324Implies
325.Fl c .
326.It Fl c
327When entering or showing rules, print them in compact form,
328i.e., omitting the "ip from any to any" string
329when this does not carry any additional information.
330.It Fl d
331When listing, show dynamic rules in addition to static ones.
332.It Fl D
333When listing, show only dynamic states.
334When deleting, delete only dynamic states.
335.It Fl f
336Run without prompting for confirmation for commands that can cause problems if misused,
337i.e.,
338.Cm flush .
339If there is no tty associated with the process, this is implied.
340The
341.Cm delete
342command with this flag ignores possible errors,
343i.e., nonexistent rule number.
344And for batched commands execution continues with the next command.
345.It Fl i
346When listing a table (see the
347.Sx LOOKUP TABLES
348section below for more information on lookup tables), format values
349as IP addresses.
350By default, values are shown as integers.
351.It Fl n
352Only check syntax of the command strings, without actually passing
353them to the kernel.
354.It Fl N
355Try to resolve addresses and service names in output.
356.It Fl q
357Be quiet when executing the
358.Cm add ,
359.Cm nat ,
360.Cm zero ,
361.Cm resetlog
362or
363.Cm flush
364commands;
365(implies
366.Fl f ) .
367This is useful when updating rulesets by executing multiple
368.Nm
369commands in a script
370(e.g.,
371.Ql sh\ /etc/rc.firewall ) ,
372or by processing a file with many
373.Nm
374rules across a remote login session.
375It also stops a table add or delete
376from failing if the entry already exists or is not present.
377.Pp
378The reason why this option may be important is that
379for some of these actions,
380.Nm
381may print a message; if the action results in blocking the
382traffic to the remote client,
383the remote login session will be closed
384and the rest of the ruleset will not be processed.
385Access to the console would then be required to recover.
386.It Fl S
387When listing rules, show the
388.Em set
389each rule belongs to.
390If this flag is not specified, disabled rules will not be
391listed.
392.It Fl s Op Ar field
393When listing pipes, sort according to one of the four
394counters (total or current packets or bytes).
395.It Fl t
396When listing, show last match timestamp converted with
397.Fn ctime .
398.It Fl T
399When listing, show last match timestamp as seconds from the epoch.
400This form can be more convenient for postprocessing by scripts.
401.El
402.Ss LIST OF RULES AND PREPROCESSING
403To ease configuration, rules can be put into a file which is
404processed using
405.Nm
406as shown in the last synopsis line.
407An absolute
408.Ar pathname
409must be used.
410The file will be read line by line and applied as arguments to the
411.Nm
412utility.
413.Pp
414Optionally, a preprocessor can be specified using
415.Fl p Ar preproc
416where
417.Ar pathname
418is to be piped through.
419Useful preprocessors include
420.Xr cpp 1
421and
422.Xr m4 1 .
423If
424.Ar preproc
425does not start with a slash
426.Pq Ql /
427as its first character, the usual
428.Ev PATH
429name search is performed.
430Care should be taken with this in environments where not all
431file systems are mounted (yet) by the time
432.Nm
433is being run (e.g.\& when they are mounted over NFS).
434Once
435.Fl p
436has been specified, any additional arguments are passed on to the preprocessor
437for interpretation.
438This allows for flexible configuration files (like conditionalizing
439them on the local hostname) and the use of macros to centralize
440frequently required arguments like IP addresses.
441.Ss TRAFFIC SHAPER CONFIGURATION
442The
443.Nm dnctl
444.Cm pipe , queue
445and
446.Cm sched
447commands are used to configure the traffic shaper and packet scheduler.
448See the
449.Sx TRAFFIC SHAPER (DUMMYNET) CONFIGURATION
450Section below for details.
451.Pp
452If the world and the kernel get out of sync the
453.Nm
454ABI may break, preventing you from being able to add any rules.
455This can adversely affect the booting process.
456You can use
457.Nm
458.Cm disable
459.Cm firewall
460to temporarily disable the firewall to regain access to the network,
461allowing you to fix the problem.
462.Sh PACKET FLOW
463A packet is checked against the active ruleset in multiple places
464in the protocol stack, under control of several sysctl variables.
465These places and variables are shown below, and it is important to
466have this picture in mind in order to design a correct ruleset.
467.Bd -literal -offset indent
468       ^    to upper layers    V
469       |                       |
470       +----------->-----------+
471       ^                       V
472 [ip(6)_input]           [ip(6)_output]     net.inet(6).ip(6).fw.enable=1
473       |                       |
474       ^                       V
475 [ether_demux]        [ether_output_frame]  net.link.ether.ipfw=1
476       |                       |
477       +-->--[bdg_forward]-->--+            net.link.bridge.ipfw=1
478       ^                       V
479       |      to devices       |
480.Ed
481.Pp
482The number of
483times the same packet goes through the firewall can
484vary between 0 and 4 depending on packet source and
485destination, and system configuration.
486.Pp
487Note that as packets flow through the stack, headers can be
488stripped or added to it, and so they may or may not be available
489for inspection.
490E.g., incoming packets will include the MAC header when
491.Nm
492is invoked from
493.Cm ether_demux() ,
494but the same packets will have the MAC header stripped off when
495.Nm
496is invoked from
497.Cm ip_input()
498or
499.Cm ip6_input() .
500.Pp
501Also note that each packet is always checked against the complete ruleset,
502irrespective of the place where the check occurs, or the source of the packet.
503If a rule contains some match patterns or actions which are not valid
504for the place of invocation (e.g.\& trying to match a MAC header within
505.Cm ip_input
506or
507.Cm ip6_input ),
508the match pattern will not match, but a
509.Cm not
510operator in front of such patterns
511.Em will
512cause the pattern to
513.Em always
514match on those packets.
515It is thus the responsibility of
516the programmer, if necessary, to write a suitable ruleset to
517differentiate among the possible places.
518.Cm skipto
519rules can be useful here, as an example:
520.Bd -literal -offset indent
521# packets from ether_demux or bdg_forward
522ipfw add 10 skipto 1000 all from any to any layer2 in
523# packets from ip_input
524ipfw add 10 skipto 2000 all from any to any not layer2 in
525# packets from ip_output
526ipfw add 10 skipto 3000 all from any to any not layer2 out
527# packets from ether_output_frame
528ipfw add 10 skipto 4000 all from any to any layer2 out
529.Ed
530.Pp
531(yes, at the moment there is no way to differentiate between
532ether_demux and bdg_forward).
533.Pp
534Also note that only actions
535.Cm allow ,
536.Cm deny ,
537.Cm netgraph ,
538.Cm ngtee
539and related to
540.Cm dummynet
541are processed for
542.Cm layer2
543frames and all other actions act as if they were
544.Cm allow
545for such frames.
546Full set of actions is supported for IP packets without
547.Cm layer2
548headers only.
549For example,
550.Cm divert
551action does not divert
552.Cm layer2
553frames.
554.Sh SYNTAX
555In general, each keyword or argument must be provided as
556a separate command line argument, with no leading or trailing
557spaces.
558Keywords are case-sensitive, whereas arguments may
559or may not be case-sensitive depending on their nature
560(e.g.\& uid's are, hostnames are not).
561.Pp
562Some arguments (e.g., port or address lists) are comma-separated
563lists of values.
564In this case, spaces after commas ',' are allowed to make
565the line more readable.
566You can also put the entire
567command (including flags) into a single argument.
568E.g., the following forms are equivalent:
569.Bd -literal -offset indent
570ipfw -q add deny src-ip 10.0.0.0/24,127.0.0.1/8
571ipfw -q add deny src-ip 10.0.0.0/24, 127.0.0.1/8
572ipfw "-q add deny src-ip 10.0.0.0/24, 127.0.0.1/8"
573.Ed
574.Sh RULE FORMAT
575The format of firewall rules is the following:
576.Bd -ragged -offset indent
577.Bk -words
578.Op Ar rule_number
579.Op Cm set Ar set_number
580.Op Cm prob Ar match_probability
581.Ar action
582.Op Cm log Op Cm logamount Ar number
583.Op Cm altq Ar queue
584.Oo
585.Bro Cm tag | untag
586.Brc Ar number
587.Oc
588.Ar body
589.Ek
590.Ed
591.Pp
592where the body of the rule specifies which information is used
593for filtering packets, among the following:
594.Pp
595.Bl -tag -width "Source and dest. addresses and ports" -offset XXX -compact
596.It Layer2 header fields
597When available
598.It IPv4 and IPv6 Protocol
599SCTP, TCP, UDP, ICMP, etc.
600.It Source and dest. addresses and ports
601.It Direction
602See Section
603.Sx PACKET FLOW
604.It Transmit and receive interface
605By name or address
606.It Misc. IP header fields
607Version, type of service, datagram length, identification,
608fragmentation flags,
609Time To Live
610.It IP options
611.It IPv6 Extension headers
612Fragmentation, Hop-by-Hop options,
613Routing Headers, Source routing rthdr0, Mobile IPv6 rthdr2, IPSec options.
614.It IPv6 Flow-ID
615.It Misc. TCP header fields
616TCP flags (SYN, FIN, ACK, RST, etc.),
617sequence number, acknowledgment number,
618window
619.It TCP options
620.It ICMP types
621for ICMP packets
622.It ICMP6 types
623for ICMP6 packets
624.It User/group ID
625When the packet can be associated with a local socket.
626.It Divert status
627Whether a packet came from a divert socket (e.g.,
628.Xr natd 8 ) .
629.It Fib annotation state
630Whether a packet has been tagged for using a specific FIB (routing table)
631in future forwarding decisions.
632.El
633.Pp
634Note that some of the above information, e.g.\& source MAC or IP addresses and
635TCP/UDP ports, can be easily spoofed, so filtering on those fields
636alone might not guarantee the desired results.
637.Bl -tag -width indent
638.It Ar rule_number
639Each rule is associated with a
640.Ar rule_number
641in the range 1..65535, with the latter reserved for the
642.Em default
643rule.
644Rules are checked sequentially by rule number.
645Multiple rules can have the same number, in which case they are
646checked (and listed) according to the order in which they have
647been added.
648If a rule is entered without specifying a number, the kernel will
649assign one in such a way that the rule becomes the last one
650before the
651.Em default
652rule.
653Automatic rule numbers are assigned by incrementing the last
654non-default rule number by the value of the sysctl variable
655.Ar net.inet.ip.fw.autoinc_step
656which defaults to 100.
657If this is not possible (e.g.\& because we would go beyond the
658maximum allowed rule number), the number of the last
659non-default value is used instead.
660.It Cm set Ar set_number
661Each rule is associated with a
662.Ar set_number
663in the range 0..31.
664Sets can be individually disabled and enabled, so this parameter
665is of fundamental importance for atomic ruleset manipulation.
666It can be also used to simplify deletion of groups of rules.
667If a rule is entered without specifying a set number,
668set 0 will be used.
669.br
670Set 31 is special in that it cannot be disabled,
671and rules in set 31 are not deleted by the
672.Nm ipfw flush
673command (but you can delete them with the
674.Nm ipfw delete set 31
675command).
676Set 31 is also used for the
677.Em default
678rule.
679.It Cm prob Ar match_probability
680A match is only declared with the specified probability
681(floating point number between 0 and 1).
682This can be useful for a number of applications such as
683random packet drop or
684(in conjunction with
685.Nm dummynet )
686to simulate the effect of multiple paths leading to out-of-order
687packet delivery.
688.Pp
689Note: this condition is checked before any other condition, including
690ones such as
691.Cm keep-state
692or
693.Cm check-state
694which might have
695side effects.
696.It Cm log Op Cm logamount Ar number
697Packets matching a rule with the
698.Cm log
699keyword will be made available for logging in two ways:
700if the sysctl variable
701.Va net.inet.ip.fw.verbose
702is set to 0 (default), one can use
703.Xr bpf 4
704attached to the
705.Li ipfw0
706pseudo interface.
707This pseudo interface can be created manually after a system
708boot by using the following command:
709.Bd -literal -offset indent
710# ifconfig ipfw0 create
711.Ed
712.Pp
713Or, automatically at boot time by adding the following
714line to the
715.Xr rc.conf 5
716file:
717.Bd -literal -offset indent
718firewall_logif="YES"
719.Ed
720.Pp
721There is zero overhead when no
722.Xr bpf 4
723is attached to the pseudo interface.
724.Pp
725If
726.Va net.inet.ip.fw.verbose
727is set to 1, packets will be logged to
728.Xr syslogd 8
729with a
730.Dv LOG_SECURITY
731facility up to a maximum of
732.Cm logamount
733packets.
734If no
735.Cm logamount
736is specified, the limit is taken from the sysctl variable
737.Va net.inet.ip.fw.verbose_limit .
738In both cases, a value of 0 means unlimited logging.
739.Pp
740Once the limit is reached, logging can be re-enabled by
741clearing the logging counter or the packet counter for that entry, see the
742.Cm resetlog
743command.
744.Pp
745Note: logging is done after all other packet matching conditions
746have been successfully verified, and before performing the final
747action (accept, deny, etc.) on the packet.
748.It Cm tag Ar number
749When a packet matches a rule with the
750.Cm tag
751keyword, the numeric tag for the given
752.Ar number
753in the range 1..65534 will be attached to the packet.
754The tag acts as an internal marker (it is not sent out over
755the wire) that can be used to identify these packets later on.
756This can be used, for example, to provide trust between interfaces
757and to start doing policy-based filtering.
758A packet can have multiple tags at the same time.
759Tags are "sticky", meaning once a tag is applied to a packet by a
760matching rule it exists until explicit removal.
761Tags are kept with the packet everywhere within the kernel, but are
762lost when packet leaves the kernel, for example, on transmitting
763packet out to the network or sending packet to a
764.Xr divert 4
765socket.
766.Pp
767To check for previously applied tags, use the
768.Cm tagged
769rule option.
770To delete previously applied tag, use the
771.Cm untag
772keyword.
773.Pp
774Note: since tags are kept with the packet everywhere in kernelspace,
775they can be set and unset anywhere in the kernel network subsystem
776(using the
777.Xr mbuf_tags 9
778facility), not only by means of the
779.Xr ipfw 4
780.Cm tag
781and
782.Cm untag
783keywords.
784For example, there can be a specialized
785.Xr netgraph 4
786node doing traffic analyzing and tagging for later inspecting
787in firewall.
788.It Cm untag Ar number
789When a packet matches a rule with the
790.Cm untag
791keyword, the tag with the number
792.Ar number
793is searched among the tags attached to this packet and,
794if found, removed from it.
795Other tags bound to packet, if present, are left untouched.
796.It Cm altq Ar queue
797When a packet matches a rule with the
798.Cm altq
799keyword, the ALTQ identifier for the given
800.Ar queue
801(see
802.Xr altq 4 )
803will be attached.
804Note that this ALTQ tag is only meaningful for packets going "out" of IPFW,
805and not being rejected or going to divert sockets.
806Note that if there is insufficient memory at the time the packet is
807processed, it will not be tagged, so it is wise to make your ALTQ
808"default" queue policy account for this.
809If multiple
810.Cm altq
811rules match a single packet, only the first one adds the ALTQ classification
812tag.
813In doing so, traffic may be shaped by using
814.Cm count Cm altq Ar queue
815rules for classification early in the ruleset, then later applying
816the filtering decision.
817For example,
818.Cm check-state
819and
820.Cm keep-state
821rules may come later and provide the actual filtering decisions in
822addition to the fallback ALTQ tag.
823.Pp
824You must run
825.Xr pfctl 8
826to set up the queues before IPFW will be able to look them up by name,
827and if the ALTQ disciplines are rearranged, the rules in containing the
828queue identifiers in the kernel will likely have gone stale and need
829to be reloaded.
830Stale queue identifiers will probably result in misclassification.
831.Pp
832All system ALTQ processing can be turned on or off via
833.Nm
834.Cm enable Ar altq
835and
836.Nm
837.Cm disable Ar altq .
838The usage of
839.Va net.inet.ip.fw.one_pass
840is irrelevant to ALTQ traffic shaping, as the actual rule action is followed
841always after adding an ALTQ tag.
842.El
843.Ss RULE ACTIONS
844A rule can be associated with one of the following actions, which
845will be executed when the packet matches the body of the rule.
846.Bl -tag -width indent
847.It Cm allow | accept | pass | permit
848Allow packets that match rule.
849The search terminates.
850.It Cm check-state Op Ar :flowname | Cm :any
851Checks the packet against the dynamic ruleset.
852If a match is found, execute the action associated with
853the rule which generated this dynamic rule, otherwise
854move to the next rule.
855.br
856.Cm Check-state
857rules do not have a body.
858If no
859.Cm check-state
860rule is found, the dynamic ruleset is checked at the first
861.Cm keep-state
862or
863.Cm limit
864rule.
865The
866.Ar :flowname
867is symbolic name assigned to dynamic rule by
868.Cm keep-state
869opcode.
870The special flowname
871.Cm :any
872can be used to ignore states flowname when matching.
873The
874.Cm :default
875keyword is special name used for compatibility with old rulesets.
876.It Cm count
877Update counters for all packets that match rule.
878The search continues with the next rule.
879.It Cm deny | drop
880Discard packets that match this rule.
881The search terminates.
882.It Cm divert Ar port
883Divert packets that match this rule to the
884.Xr divert 4
885socket bound to port
886.Ar port .
887The search terminates.
888.It Cm fwd | forward Ar ipaddr | tablearg Ns Op , Ns Ar port
889Change the next-hop on matching packets to
890.Ar ipaddr ,
891which can be an IP address or a host name.
892The next hop can also be supplied by the last table
893looked up for the packet by using the
894.Cm tablearg
895keyword instead of an explicit address.
896The search terminates if this rule matches.
897.Pp
898If
899.Ar ipaddr
900is a local address, then matching packets will be forwarded to
901.Ar port
902(or the port number in the packet if one is not specified in the rule)
903on the local machine.
904.br
905If
906.Ar ipaddr
907is not a local address, then the port number
908(if specified) is ignored, and the packet will be
909forwarded to the remote address, using the route as found in
910the local routing table for that IP.
911.br
912A
913.Ar fwd
914rule will not match layer2 packets (those received
915on ether_input, ether_output, or bridged).
916.br
917The
918.Cm fwd
919action does not change the contents of the packet at all.
920In particular, the destination address remains unmodified, so
921packets forwarded to another system will usually be rejected by that system
922unless there is a matching rule on that system to capture them.
923For packets forwarded locally,
924the local address of the socket will be
925set to the original destination address of the packet.
926This makes the
927.Xr netstat 1
928entry look rather weird but is intended for
929use with transparent proxy servers.
930.It Cm nat Ar nat_nr | global | tablearg
931Pass packet to a
932nat instance
933(for network address translation, address redirect, etc.):
934see the
935.Sx NETWORK ADDRESS TRANSLATION (NAT)
936Section for further information.
937.It Cm nat64lsn Ar name
938Pass packet to a stateful NAT64 instance (for IPv6/IPv4 network address and
939protocol translation): see the
940.Sx IPv6/IPv4 NETWORK ADDRESS AND PROTOCOL TRANSLATION
941Section for further information.
942.It Cm nat64stl Ar name
943Pass packet to a stateless NAT64 instance (for IPv6/IPv4 network address and
944protocol translation): see the
945.Sx IPv6/IPv4 NETWORK ADDRESS AND PROTOCOL TRANSLATION
946Section for further information.
947.It Cm nat64clat Ar name
948Pass packet to a CLAT NAT64 instance (for client-side IPv6/IPv4 network address and
949protocol translation): see the
950.Sx IPv6/IPv4 NETWORK ADDRESS AND PROTOCOL TRANSLATION
951Section for further information.
952.It Cm nptv6 Ar name
953Pass packet to a NPTv6 instance (for IPv6-to-IPv6 network prefix translation):
954see the
955.Sx IPv6-to-IPv6 NETWORK PREFIX TRANSLATION (NPTv6)
956Section for further information.
957.It Cm pipe Ar pipe_nr
958Pass packet to a
959.Nm dummynet
960.Dq pipe
961(for bandwidth limitation, delay, etc.).
962See the
963.Sx TRAFFIC SHAPER (DUMMYNET) CONFIGURATION
964Section for further information.
965The search terminates; however, on exit from the pipe and if
966the
967.Xr sysctl 8
968variable
969.Va net.inet.ip.fw.one_pass
970is not set, the packet is passed again to the firewall code
971starting from the next rule.
972.It Cm queue Ar queue_nr
973Pass packet to a
974.Nm dummynet
975.Dq queue
976(for bandwidth limitation using WF2Q+).
977.It Cm reject
978(Deprecated).
979Synonym for
980.Cm unreach host .
981.It Cm reset
982Discard packets that match this rule, and if the
983packet is a TCP packet, try to send a TCP reset (RST) notice.
984The search terminates.
985.It Cm reset6
986Discard packets that match this rule, and if the
987packet is a TCP packet, try to send a TCP reset (RST) notice.
988The search terminates.
989.It Cm skipto Ar number | tablearg
990Skip all subsequent rules numbered less than
991.Ar number .
992The search continues with the first rule numbered
993.Ar number
994or higher.
995It is possible to use the
996.Cm tablearg
997keyword with a skipto for a
998.Em computed
999skipto.
1000Skipto may work either in O(log(N)) or in O(1) depending
1001on amount of memory and/or sysctl variables.
1002See the
1003.Sx SYSCTL VARIABLES
1004section for more details.
1005.It Cm call Ar number | tablearg
1006The current rule number is saved in the internal stack and
1007ruleset processing continues with the first rule numbered
1008.Ar number
1009or higher.
1010If later a rule with the
1011.Cm return
1012action is encountered, the processing returns to the first rule
1013with number of this
1014.Cm call
1015rule plus one or higher
1016(the same behaviour as with packets returning from
1017.Xr divert 4
1018socket after a
1019.Cm divert
1020action).
1021This could be used to make somewhat like an assembly language
1022.Dq subroutine
1023calls to rules with common checks for different interfaces, etc.
1024.Pp
1025Rule with any number could be called, not just forward jumps as with
1026.Cm skipto .
1027So, to prevent endless loops in case of mistakes, both
1028.Cm call
1029and
1030.Cm return
1031actions don't do any jumps and simply go to the next rule if memory
1032cannot be allocated or stack overflowed/underflowed.
1033.Pp
1034Internally stack for rule numbers is implemented using
1035.Xr mbuf_tags 9
1036facility and currently has size of 16 entries.
1037As mbuf tags are lost when packet leaves the kernel,
1038.Cm divert
1039should not be used in subroutines to avoid endless loops
1040and other undesired effects.
1041.It Cm return
1042Takes rule number saved to internal stack by the last
1043.Cm call
1044action and returns ruleset processing to the first rule
1045with number greater than number of corresponding
1046.Cm call
1047rule.
1048See description of the
1049.Cm call
1050action for more details.
1051.Pp
1052Note that
1053.Cm return
1054rules usually end a
1055.Dq subroutine
1056and thus are unconditional, but
1057.Nm
1058command-line utility currently requires every action except
1059.Cm check-state
1060to have body.
1061While it is sometimes useful to return only on some packets,
1062usually you want to print just
1063.Dq return
1064for readability.
1065A workaround for this is to use new syntax and
1066.Fl c
1067switch:
1068.Bd -literal -offset indent
1069# Add a rule without actual body
1070ipfw add 2999 return via any
1071
1072# List rules without "from any to any" part
1073ipfw -c list
1074.Ed
1075.Pp
1076This cosmetic annoyance may be fixed in future releases.
1077.It Cm tee Ar port
1078Send a copy of packets matching this rule to the
1079.Xr divert 4
1080socket bound to port
1081.Ar port .
1082The search continues with the next rule.
1083.It Cm unreach Ar code Op mtu
1084Discard packets that match this rule, and try to send an ICMP
1085unreachable notice with code
1086.Ar code ,
1087where
1088.Ar code
1089is a number from 0 to 255, or one of these aliases:
1090.Cm net , host , protocol , port ,
1091.Cm needfrag , srcfail , net-unknown , host-unknown ,
1092.Cm isolated , net-prohib , host-prohib , tosnet ,
1093.Cm toshost , filter-prohib , host-precedence
1094or
1095.Cm precedence-cutoff .
1096The
1097.Cm needfrag
1098code may have an optional
1099.Ar mtu
1100parameter.
1101If specified, the MTU value will be put into generated ICMP packet.
1102The search terminates.
1103.It Cm unreach6 Ar code
1104Discard packets that match this rule, and try to send an ICMPv6
1105unreachable notice with code
1106.Ar code ,
1107where
1108.Ar code
1109is a number from 0, 1, 3 or 4, or one of these aliases:
1110.Cm no-route, admin-prohib, address
1111or
1112.Cm port .
1113The search terminates.
1114.It Cm netgraph Ar cookie
1115Divert packet into netgraph with given
1116.Ar cookie .
1117The search terminates.
1118If packet is later returned from netgraph it is either
1119accepted or continues with the next rule, depending on
1120.Va net.inet.ip.fw.one_pass
1121sysctl variable.
1122.It Cm ngtee Ar cookie
1123A copy of packet is diverted into netgraph, original
1124packet continues with the next rule.
1125See
1126.Xr ng_ipfw 4
1127for more information on
1128.Cm netgraph
1129and
1130.Cm ngtee
1131actions.
1132.It Cm setfib Ar fibnum | tablearg
1133The packet is tagged so as to use the FIB (routing table)
1134.Ar fibnum
1135in any subsequent forwarding decisions.
1136In the current implementation, this is limited to the values 0 through 15, see
1137.Xr setfib 2 .
1138Processing continues at the next rule.
1139It is possible to use the
1140.Cm tablearg
1141keyword with setfib.
1142If the tablearg value is not within the compiled range of fibs,
1143the packet's fib is set to 0.
1144.It Cm setdscp Ar DSCP | number | tablearg
1145Set specified DiffServ codepoint for an IPv4/IPv6 packet.
1146Processing continues at the next rule.
1147Supported values are:
1148.Pp
1149.Cm cs0
1150.Pq Dv 000000 ,
1151.Cm cs1
1152.Pq Dv 001000 ,
1153.Cm cs2
1154.Pq Dv 010000 ,
1155.Cm cs3
1156.Pq Dv 011000 ,
1157.Cm cs4
1158.Pq Dv 100000 ,
1159.Cm cs5
1160.Pq Dv 101000 ,
1161.Cm cs6
1162.Pq Dv 110000 ,
1163.Cm cs7
1164.Pq Dv 111000 ,
1165.Cm af11
1166.Pq Dv 001010 ,
1167.Cm af12
1168.Pq Dv 001100 ,
1169.Cm af13
1170.Pq Dv 001110 ,
1171.Cm af21
1172.Pq Dv 010010 ,
1173.Cm af22
1174.Pq Dv 010100 ,
1175.Cm af23
1176.Pq Dv 010110 ,
1177.Cm af31
1178.Pq Dv 011010 ,
1179.Cm af32
1180.Pq Dv 011100 ,
1181.Cm af33
1182.Pq Dv 011110 ,
1183.Cm af41
1184.Pq Dv 100010 ,
1185.Cm af42
1186.Pq Dv 100100 ,
1187.Cm af43
1188.Pq Dv 100110 ,
1189.Cm va
1190.Pq Dv 101100 ,
1191.Cm ef
1192.Pq Dv 101110 ,
1193.Cm be
1194.Pq Dv 000000 .
1195Additionally, DSCP value can be specified by number (0..63).
1196It is also possible to use the
1197.Cm tablearg
1198keyword with setdscp.
1199If the tablearg value is not within the 0..63 range, lower 6 bits of supplied
1200value are used.
1201.It Cm tcp-setmss Ar mss
1202Set the Maximum Segment Size (MSS) in the TCP segment to value
1203.Ar mss .
1204The kernel module
1205.Cm ipfw_pmod
1206should be loaded or kernel should have
1207.Cm options IPFIREWALL_PMOD
1208to be able use this action.
1209This command does not change a packet if original MSS value is lower than
1210specified value.
1211Both TCP over IPv4 and over IPv6 are supported.
1212Regardless of matched a packet or not by the
1213.Cm tcp-setmss
1214rule, the search continues with the next rule.
1215.It Cm reass
1216Queue and reassemble IPv4 fragments.
1217If the packet is not fragmented, counters are updated and
1218processing continues with the next rule.
1219If the packet is the last logical fragment, the packet is reassembled and, if
1220.Va net.inet.ip.fw.one_pass
1221is set to 0, processing continues with the next rule.
1222Otherwise, the packet is allowed to pass and the search terminates.
1223If the packet is a fragment in the middle of a logical group of fragments,
1224it is consumed and
1225processing stops immediately.
1226.Pp
1227Fragment handling can be tuned via
1228.Va net.inet.ip.maxfragpackets
1229and
1230.Va net.inet.ip.maxfragsperpacket
1231which limit, respectively, the maximum number of processable
1232fragments (default: 800) and
1233the maximum number of fragments per packet (default: 16).
1234.Pp
1235NOTA BENE: since fragments do not contain port numbers,
1236they should be avoided with the
1237.Nm reass
1238rule.
1239Alternatively, direction-based (like
1240.Nm in
1241/
1242.Nm out
1243) and source-based (like
1244.Nm via
1245) match patterns can be used to select fragments.
1246.Pp
1247Usually a simple rule like:
1248.Bd -literal -offset indent
1249# reassemble incoming fragments
1250ipfw add reass all from any to any in
1251.Ed
1252.Pp
1253is all you need at the beginning of your ruleset.
1254.It Cm abort
1255Discard packets that match this rule, and if the packet is an SCTP packet,
1256try to send an SCTP packet containing an ABORT chunk.
1257The search terminates.
1258.It Cm abort6
1259Discard packets that match this rule, and if the packet is an SCTP packet,
1260try to send an SCTP packet containing an ABORT chunk.
1261The search terminates.
1262.El
1263.Ss RULE BODY
1264The body of a rule contains zero or more patterns (such as
1265specific source and destination addresses or ports,
1266protocol options, incoming or outgoing interfaces, etc.)
1267that the packet must match in order to be recognised.
1268In general, the patterns are connected by (implicit)
1269.Cm and
1270operators -- i.e., all must match in order for the
1271rule to match.
1272Individual patterns can be prefixed by the
1273.Cm not
1274operator to reverse the result of the match, as in
1275.Pp
1276.Dl "ipfw add 100 allow ip from not 1.2.3.4 to any"
1277.Pp
1278Additionally, sets of alternative match patterns
1279.Pq Em or-blocks
1280can be constructed by putting the patterns in
1281lists enclosed between parentheses ( ) or braces { }, and
1282using the
1283.Cm or
1284operator as follows:
1285.Pp
1286.Dl "ipfw add 100 allow ip from { x or not y or z } to any"
1287.Pp
1288Only one level of parentheses is allowed.
1289Beware that most shells have special meanings for parentheses
1290or braces, so it is advisable to put a backslash \\ in front of them
1291to prevent such interpretations.
1292.Pp
1293The body of a rule must in general include a source and destination
1294address specifier.
1295The keyword
1296.Ar any
1297can be used in various places to specify that the content of
1298a required field is irrelevant.
1299.Pp
1300The rule body has the following format:
1301.Bd -ragged -offset indent
1302.Op Ar proto Cm from Ar src Cm to Ar dst
1303.Op Ar options
1304.Ed
1305.Pp
1306The first part (proto from src to dst) is for backward
1307compatibility with earlier versions of
1308.Fx .
1309In modern
1310.Fx
1311any match pattern (including MAC headers, IP protocols,
1312addresses and ports) can be specified in the
1313.Ar options
1314section.
1315.Pp
1316Rule fields have the following meaning:
1317.Bl -tag -width indent
1318.It Ar proto : protocol | Cm { Ar protocol Cm or ... }
1319.It Ar protocol : Oo Cm not Oc Ar protocol-name | protocol-number
1320An IP protocol specified by number or name
1321(for a complete list see
1322.Pa /etc/protocols ) ,
1323or one of the following keywords:
1324.Bl -tag -width indent
1325.It Cm ip4 | ipv4
1326Matches IPv4 packets.
1327.It Cm ip6 | ipv6
1328Matches IPv6 packets.
1329.It Cm ip | all
1330Matches any packet.
1331.El
1332.Pp
1333The
1334.Cm ipv6
1335in
1336.Cm proto
1337option will be treated as inner protocol.
1338And, the
1339.Cm ipv4
1340is not available in
1341.Cm proto
1342option.
1343.Pp
1344The
1345.Cm { Ar protocol Cm or ... }
1346format (an
1347.Em or-block )
1348is provided for convenience only but its use is deprecated.
1349.It Ar src No and Ar dst : Bro Cm addr | Cm { Ar addr Cm or ... } Brc Op Oo Cm not Oc Ar ports
1350An address (or a list, see below)
1351optionally followed by
1352.Ar ports
1353specifiers.
1354.Pp
1355The second format
1356.Em ( or-block
1357with multiple addresses) is provided for convenience only and
1358its use is discouraged.
1359.It Ar addr : Oo Cm not Oc Bro
1360.Cm any | me | me6 |
1361.Cm table Ns Pq Ar name Ns Op , Ns Ar value
1362.Ar | addr-list | addr-set
1363.Brc
1364.Bl -tag -width indent
1365.It Cm any
1366Matches any IP address.
1367.It Cm me
1368Matches any IP address configured on an interface in the system.
1369.It Cm me6
1370Matches any IPv6 address configured on an interface in the system.
1371The address list is evaluated at the time the packet is
1372analysed.
1373.It Cm table Ns Pq Ar name Ns Op , Ns Ar value
1374Matches any IPv4 or IPv6 address for which an entry exists in the lookup table
1375.Ar number .
1376If an optional 32-bit unsigned
1377.Ar value
1378is also specified, an entry will match only if it has this value.
1379See the
1380.Sx LOOKUP TABLES
1381section below for more information on lookup tables.
1382.El
1383.It Ar addr-list : ip-addr Ns Op Ns , Ns Ar addr-list
1384.It Ar ip-addr :
1385A host or subnet address specified in one of the following ways:
1386.Bl -tag -width indent
1387.It Ar numeric-ip | hostname
1388Matches a single IPv4 address, specified as dotted-quad or a hostname.
1389Hostnames are resolved at the time the rule is added to the firewall list.
1390.It Ar addr Ns / Ns Ar masklen
1391Matches all addresses with base
1392.Ar addr
1393(specified as an IP address, a network number, or a hostname)
1394and mask width of
1395.Cm masklen
1396bits.
1397As an example, 1.2.3.4/25 or 1.2.3.0/25 will match
1398all IP numbers from 1.2.3.0 to 1.2.3.127 .
1399.It Ar addr Ns : Ns Ar mask
1400Matches all addresses with base
1401.Ar addr
1402(specified as an IP address, a network number, or a hostname)
1403and the mask of
1404.Ar mask ,
1405specified as a dotted quad.
1406As an example, 1.2.3.4:255.0.255.0 or 1.0.3.0:255.0.255.0 will match
14071.*.3.*.
1408This form is advised only for non-contiguous
1409masks.
1410It is better to resort to the
1411.Ar addr Ns / Ns Ar masklen
1412format for contiguous masks, which is more compact and less
1413error-prone.
1414.El
1415.It Ar addr-set : addr Ns Oo Ns / Ns Ar masklen Oc Ns Cm { Ns Ar list Ns Cm }
1416.It Ar list : Bro Ar num | num-num Brc Ns Op Ns , Ns Ar list
1417Matches all addresses with base address
1418.Ar addr
1419(specified as an IP address, a network number, or a hostname)
1420and whose last byte is in the list between braces { } .
1421Note that there must be no spaces between braces and
1422numbers (spaces after commas are allowed).
1423Elements of the list can be specified as single entries
1424or ranges.
1425The
1426.Ar masklen
1427field is used to limit the size of the set of addresses,
1428and can have any value between 24 and 32.
1429If not specified,
1430it will be assumed as 24.
1431.br
1432This format is particularly useful to handle sparse address sets
1433within a single rule.
1434Because the matching occurs using a
1435bitmask, it takes constant time and dramatically reduces
1436the complexity of rulesets.
1437.br
1438As an example, an address specified as 1.2.3.4/24{128,35-55,89}
1439or 1.2.3.0/24{128,35-55,89}
1440will match the following IP addresses:
1441.br
14421.2.3.128, 1.2.3.35 to 1.2.3.55, 1.2.3.89 .
1443.It Ar addr6-list : ip6-addr Ns Op Ns , Ns Ar addr6-list
1444.It Ar ip6-addr :
1445A host or subnet specified one of the following ways:
1446.Bl -tag -width indent
1447.It Ar numeric-ip | hostname
1448Matches a single IPv6 address as allowed by
1449.Xr inet_pton 3
1450or a hostname.
1451Hostnames are resolved at the time the rule is added to the firewall
1452list.
1453.It Ar addr Ns / Ns Ar masklen
1454Matches all IPv6 addresses with base
1455.Ar addr
1456(specified as allowed by
1457.Xr inet_pton 3
1458or a hostname)
1459and mask width of
1460.Cm masklen
1461bits.
1462.It Ar addr Ns / Ns Ar mask
1463Matches all IPv6 addresses with base
1464.Ar addr
1465(specified as allowed by
1466.Xr inet_pton 3
1467or a hostname)
1468and the mask of
1469.Ar mask ,
1470specified as allowed by
1471.Xr inet_pton 3 .
1472As an example, fe::640:0:0/ffff::ffff:ffff:0:0 will match
1473fe:*:*:*:0:640:*:*.
1474This form is advised only for non-contiguous
1475masks.
1476It is better to resort to the
1477.Ar addr Ns / Ns Ar masklen
1478format for contiguous masks, which is more compact and less
1479error-prone.
1480.El
1481.Pp
1482No support for sets of IPv6 addresses is provided because IPv6 addresses
1483are typically random past the initial prefix.
1484.It Ar ports : Bro Ar port | port Ns \&- Ns Ar port Ns Brc Ns Op , Ns Ar ports
1485For protocols which support port numbers (such as SCTP, TCP and UDP), optional
1486.Cm ports
1487may be specified as one or more ports or port ranges, separated
1488by commas but no spaces, and an optional
1489.Cm not
1490operator.
1491The
1492.Ql \&-
1493notation specifies a range of ports (including boundaries).
1494.Pp
1495Service names (from
1496.Pa /etc/services )
1497may be used instead of numeric port values.
1498The length of the port list is limited to 30 ports or ranges,
1499though one can specify larger ranges by using an
1500.Em or-block
1501in the
1502.Cm options
1503section of the rule.
1504.Pp
1505A backslash
1506.Pq Ql \e
1507can be used to escape the dash
1508.Pq Ql -
1509character in a service name (from a shell, the backslash must be
1510typed twice to avoid the shell itself interpreting it as an escape
1511character).
1512.Pp
1513.Dl "ipfw add count tcp from any ftp\e\e-data-ftp to any"
1514.Pp
1515Fragmented packets which have a non-zero offset (i.e., not the first
1516fragment) will never match a rule which has one or more port
1517specifications.
1518See the
1519.Cm frag
1520option for details on matching fragmented packets.
1521.El
1522.Ss RULE OPTIONS (MATCH PATTERNS)
1523Additional match patterns can be used within
1524rules.
1525Zero or more of these so-called
1526.Em options
1527can be present in a rule, optionally prefixed by the
1528.Cm not
1529operand, and possibly grouped into
1530.Em or-blocks .
1531.Pp
1532The following match patterns can be used (listed in alphabetical order):
1533.Bl -tag -width indent
1534.It Cm // this is a comment .
1535Inserts the specified text as a comment in the rule.
1536Everything following // is considered as a comment and stored in the rule.
1537You can have comment-only rules, which are listed as having a
1538.Cm count
1539action followed by the comment.
1540.It Cm bridged
1541Alias for
1542.Cm layer2 .
1543.It Cm defer-immediate-action | defer-action
1544A rule with this option will not perform normal action
1545upon a match.
1546This option is intended to be used with
1547.Cm record-state
1548or
1549.Cm keep-state
1550as the dynamic rule, created but ignored on match, will work
1551as intended.
1552Rules with both
1553.Cm record-state
1554and
1555.Cm defer-immediate-action
1556create a dynamic rule and continue with the next rule without actually
1557performing the action part of this rule.
1558When the rule is later activated via the state table, the action is
1559performed as usual.
1560.It Cm diverted
1561Matches only packets generated by a divert socket.
1562.It Cm diverted-loopback
1563Matches only packets coming from a divert socket back into the IP stack
1564input for delivery.
1565.It Cm diverted-output
1566Matches only packets going from a divert socket back outward to the IP
1567stack output for delivery.
1568.It Cm dst-ip Ar ip-address
1569Matches IPv4 packets whose destination IP is one of the address(es)
1570specified as argument.
1571.It Bro Cm dst-ip6 | dst-ipv6 Brc Ar ip6-address
1572Matches IPv6 packets whose destination IP is one of the address(es)
1573specified as argument.
1574.It Cm dst-port Ar ports
1575Matches IP packets whose destination port is one of the port(s)
1576specified as argument.
1577.It Cm established
1578Matches TCP packets that have the RST or ACK bits set.
1579.It Cm ext6hdr Ar header
1580Matches IPv6 packets containing the extended header given by
1581.Ar header .
1582Supported headers are:
1583.Pp
1584Fragment,
1585.Pq Cm frag ,
1586Hop-to-hop options
1587.Pq Cm hopopt ,
1588any type of Routing Header
1589.Pq Cm route ,
1590Source routing Routing Header Type 0
1591.Pq Cm rthdr0 ,
1592Mobile IPv6 Routing Header Type 2
1593.Pq Cm rthdr2 ,
1594Destination options
1595.Pq Cm dstopt ,
1596IPSec authentication headers
1597.Pq Cm ah ,
1598and IPsec encapsulated security payload headers
1599.Pq Cm esp .
1600.It Cm fib Ar fibnum
1601Matches a packet that has been tagged to use
1602the given FIB (routing table) number.
1603.It Cm flow Ar table Ns Pq Ar name Ns Op , Ns Ar value
1604Search for the flow entry in lookup table
1605.Ar name .
1606If not found, the match fails.
1607Otherwise, the match succeeds and
1608.Cm tablearg
1609is set to the value extracted from the table.
1610.Pp
1611This option can be useful to quickly dispatch traffic based on
1612certain packet fields.
1613See the
1614.Sx LOOKUP TABLES
1615section below for more information on lookup tables.
1616.It Cm flow-id Ar labels
1617Matches IPv6 packets containing any of the flow labels given in
1618.Ar labels .
1619.Ar labels
1620is a comma separated list of numeric flow labels.
1621.It Cm dst-mac Ar table Ns Pq Ar name Ns Op , Ns Ar value
1622Search for the destination MAC address entry in lookup table
1623.Ar name .
1624If not found, the match fails.
1625Otherwise, the match succeeds and
1626.Cm tablearg
1627is set to the value extracted from the table.
1628.It Cm src-mac Ar table Ns Pq Ar name Ns Op , Ns Ar value
1629Search for the source MAC address entry in lookup table
1630.Ar name .
1631If not found, the match fails.
1632Otherwise, the match succeeds and
1633.Cm tablearg
1634is set to the value extracted from the table.
1635.It Cm frag Ar spec
1636Matches IPv4 packets whose
1637.Cm ip_off
1638field contains the comma separated list of IPv4 fragmentation
1639options specified in
1640.Ar spec .
1641The recognized options are:
1642.Cm df
1643.Pq Dv don't fragment ,
1644.Cm mf
1645.Pq Dv more fragments ,
1646.Cm rf
1647.Pq Dv reserved fragment bit
1648.Cm offset
1649.Pq Dv non-zero fragment offset .
1650The absence of a particular options may be denoted
1651with a
1652.Ql \&! .
1653.Pp
1654Empty list of options defaults to matching on non-zero fragment offset.
1655Such rule would match all not the first fragment datagrams,
1656both IPv4 and IPv6.
1657This is a backward compatibility with older rulesets.
1658.It Cm gid Ar group
1659Matches all TCP or UDP packets sent by or received for a
1660.Ar group .
1661A
1662.Ar group
1663may be specified by name or number.
1664.It Cm jail Ar jail
1665Matches all TCP or UDP packets sent by or received for the
1666jail whose ID or name is
1667.Ar jail .
1668.It Cm icmptypes Ar types
1669Matches ICMP packets whose ICMP type is in the list
1670.Ar types .
1671The list may be specified as any combination of
1672individual types (numeric) separated by commas.
1673.Em Ranges are not allowed .
1674The supported ICMP types are:
1675.Pp
1676echo reply
1677.Pq Cm 0 ,
1678destination unreachable
1679.Pq Cm 3 ,
1680source quench
1681.Pq Cm 4 ,
1682redirect
1683.Pq Cm 5 ,
1684echo request
1685.Pq Cm 8 ,
1686router advertisement
1687.Pq Cm 9 ,
1688router solicitation
1689.Pq Cm 10 ,
1690time-to-live exceeded
1691.Pq Cm 11 ,
1692IP header bad
1693.Pq Cm 12 ,
1694timestamp request
1695.Pq Cm 13 ,
1696timestamp reply
1697.Pq Cm 14 ,
1698information request
1699.Pq Cm 15 ,
1700information reply
1701.Pq Cm 16 ,
1702address mask request
1703.Pq Cm 17
1704and address mask reply
1705.Pq Cm 18 .
1706.It Cm icmp6types Ar types
1707Matches ICMP6 packets whose ICMP6 type is in the list of
1708.Ar types .
1709The list may be specified as any combination of
1710individual types (numeric) separated by commas.
1711.Em Ranges are not allowed .
1712.It Cm in | out
1713Matches incoming or outgoing packets, respectively.
1714.Cm in
1715and
1716.Cm out
1717are mutually exclusive (in fact,
1718.Cm out
1719is implemented as
1720.Cm not in Ns No ).
1721.It Cm ipid Ar id-list
1722Matches IPv4 packets whose
1723.Cm ip_id
1724field has value included in
1725.Ar id-list ,
1726which is either a single value or a list of values or ranges
1727specified in the same way as
1728.Ar ports .
1729.It Cm iplen Ar len-list
1730Matches IP packets whose total length, including header and data, is
1731in the set
1732.Ar len-list ,
1733which is either a single value or a list of values or ranges
1734specified in the same way as
1735.Ar ports .
1736.It Cm ipoptions Ar spec
1737Matches packets whose IPv4 header contains the comma separated list of
1738options specified in
1739.Ar spec .
1740The supported IP options are:
1741.Pp
1742.Cm ssrr
1743(strict source route),
1744.Cm lsrr
1745(loose source route),
1746.Cm rr
1747(record packet route) and
1748.Cm ts
1749(timestamp).
1750The absence of a particular option may be denoted
1751with a
1752.Ql \&! .
1753.It Cm ipprecedence Ar precedence
1754Matches IPv4 packets whose precedence field is equal to
1755.Ar precedence .
1756.It Cm ipsec
1757Matches packets that have IPSEC history associated with them
1758(i.e., the packet comes encapsulated in IPSEC, the kernel
1759has IPSEC support, and can correctly decapsulate it).
1760.Pp
1761Note that specifying
1762.Cm ipsec
1763is different from specifying
1764.Cm proto Ar ipsec
1765as the latter will only look at the specific IP protocol field,
1766irrespective of IPSEC kernel support and the validity of the IPSEC data.
1767.Pp
1768Further note that this flag is silently ignored in kernels without
1769IPSEC support.
1770It does not affect rule processing when given and the
1771rules are handled as if with no
1772.Cm ipsec
1773flag.
1774.It Cm iptos Ar spec
1775Matches IPv4 packets whose
1776.Cm tos
1777field contains the comma separated list of
1778service types specified in
1779.Ar spec .
1780The supported IP types of service are:
1781.Pp
1782.Cm lowdelay
1783.Pq Dv IPTOS_LOWDELAY ,
1784.Cm throughput
1785.Pq Dv IPTOS_THROUGHPUT ,
1786.Cm reliability
1787.Pq Dv IPTOS_RELIABILITY ,
1788.Cm mincost
1789.Pq Dv IPTOS_MINCOST ,
1790.Cm congestion
1791.Pq Dv IPTOS_ECN_CE .
1792The absence of a particular type may be denoted
1793with a
1794.Ql \&! .
1795.It Cm dscp spec Ns Op , Ns Ar spec
1796Matches IPv4/IPv6 packets whose
1797.Cm DS
1798field value is contained in
1799.Ar spec
1800mask.
1801Multiple values can be specified via
1802the comma separated list.
1803Value can be one of keywords used in
1804.Cm setdscp
1805action or exact number.
1806.It Cm ipttl Ar ttl-list
1807Matches IPv4 packets whose time to live is included in
1808.Ar ttl-list ,
1809which is either a single value or a list of values or ranges
1810specified in the same way as
1811.Ar ports .
1812.It Cm ipversion Ar ver
1813Matches IP packets whose IP version field is
1814.Ar ver .
1815.It Cm keep-state Op Ar :flowname
1816Upon a match, the firewall will create a dynamic rule, whose
1817default behaviour is to match bidirectional traffic between
1818source and destination IP/port using the same protocol.
1819The rule has a limited lifetime (controlled by a set of
1820.Xr sysctl 8
1821variables), and the lifetime is refreshed every time a matching
1822packet is found.
1823The
1824.Ar :flowname
1825is used to assign additional to addresses, ports and protocol parameter
1826to dynamic rule.
1827It can be used for more accurate matching by
1828.Cm check-state
1829rule.
1830The
1831.Cm :default
1832keyword is special name used for compatibility with old rulesets.
1833.It Cm layer2
1834Matches only layer2 packets, i.e., those passed to
1835.Nm
1836from
1837.Fn ether_demux
1838and
1839.Fn ether_output_frame .
1840.It Cm limit Bro Cm src-addr | src-port | dst-addr | dst-port Brc Ar N Op Ar :flowname
1841The firewall will only allow
1842.Ar N
1843connections with the same
1844set of parameters as specified in the rule.
1845One or more
1846of source and destination addresses and ports can be
1847specified.
1848.It Cm lookup Bro Cm dst-ip | dst-port | dst-mac | src-ip | src-port | src-mac | uid | jail Brc Ar name
1849Search an entry in lookup table
1850.Ar name
1851that matches the field specified as argument.
1852If not found, the match fails.
1853Otherwise, the match succeeds and
1854.Cm tablearg
1855is set to the value extracted from the table.
1856.Pp
1857This option can be useful to quickly dispatch traffic based on
1858certain packet fields.
1859See the
1860.Sx LOOKUP TABLES
1861section below for more information on lookup tables.
1862.It Cm { MAC | mac } Ar dst-mac src-mac
1863Match packets with a given
1864.Ar dst-mac
1865and
1866.Ar src-mac
1867addresses, specified as the
1868.Cm any
1869keyword (matching any MAC address), or six groups of hex digits
1870separated by colons,
1871and optionally followed by a mask indicating the significant bits.
1872The mask may be specified using either of the following methods:
1873.Bl -enum -width indent
1874.It
1875A slash
1876.Pq /
1877followed by the number of significant bits.
1878For example, an address with 33 significant bits could be specified as:
1879.Pp
1880.Dl "MAC 10:20:30:40:50:60/33 any"
1881.It
1882An ampersand
1883.Pq &
1884followed by a bitmask specified as six groups of hex digits separated
1885by colons.
1886For example, an address in which the last 16 bits are significant could
1887be specified as:
1888.Pp
1889.Dl "MAC 10:20:30:40:50:60&00:00:00:00:ff:ff any"
1890.Pp
1891Note that the ampersand character has a special meaning in many shells
1892and should generally be escaped.
1893.El
1894Note that the order of MAC addresses (destination first,
1895source second) is
1896the same as on the wire, but the opposite of the one used for
1897IP addresses.
1898.It Cm mac-type Ar mac-type
1899Matches packets whose Ethernet Type field
1900corresponds to one of those specified as argument.
1901.Ar mac-type
1902is specified in the same way as
1903.Cm port numbers
1904(i.e., one or more comma-separated single values or ranges).
1905You can use symbolic names for known values such as
1906.Em vlan , ipv4, ipv6 .
1907Values can be entered as decimal or hexadecimal (if prefixed by 0x),
1908and they are always printed as hexadecimal (unless the
1909.Cm -N
1910option is used, in which case symbolic resolution will be attempted).
1911.It Cm proto Ar protocol
1912Matches packets with the corresponding IP protocol.
1913.It Cm record-state
1914Upon a match, the firewall will create a dynamic rule as if
1915.Cm keep-state
1916was specified.
1917However, this option doesn't imply an implicit
1918.Cm check-state
1919in contrast to
1920.Cm keep-state .
1921.It Cm recv | xmit | via Brq Ar ifX | Ar if Ns Cm * | Ar table Ns Po Ar name Ns Oo , Ns Ar value Oc Pc | Ar ipno | Ar any
1922Matches packets received, transmitted or going through,
1923respectively, the interface specified by exact name
1924.Po Ar ifX Pc ,
1925by device name
1926.Po Ar if* Pc ,
1927by IP address, or through some interface.
1928Table
1929.Ar name
1930may be used to match interface by its kernel ifindex.
1931See the
1932.Sx LOOKUP TABLES
1933section below for more information on lookup tables.
1934.Pp
1935The
1936.Cm via
1937keyword causes the interface to always be checked.
1938If
1939.Cm recv
1940or
1941.Cm xmit
1942is used instead of
1943.Cm via ,
1944then only the receive or transmit interface (respectively)
1945is checked.
1946By specifying both, it is possible to match packets based on
1947both receive and transmit interface, e.g.:
1948.Pp
1949.Dl "ipfw add deny ip from any to any out recv ed0 xmit ed1"
1950.Pp
1951The
1952.Cm recv
1953interface can be tested on either incoming or outgoing packets,
1954while the
1955.Cm xmit
1956interface can only be tested on outgoing packets.
1957So
1958.Cm out
1959is required (and
1960.Cm in
1961is invalid) whenever
1962.Cm xmit
1963is used.
1964.Pp
1965A packet might not have a receive or transmit interface: packets
1966originating from the local host have no receive interface,
1967while packets destined for the local host have no transmit
1968interface.
1969.It Cm set-limit Bro Cm src-addr | src-port | dst-addr | dst-port Brc Ar N
1970Works like
1971.Cm limit
1972but does not have an implicit
1973.Cm check-state
1974attached to it.
1975.It Cm setup
1976Matches TCP packets that have the SYN bit set but no ACK bit.
1977This is the short form of
1978.Dq Li tcpflags\ syn,!ack .
1979.It Cm sockarg
1980Matches packets that are associated to a local socket and
1981for which the SO_USER_COOKIE socket option has been set
1982to a non-zero value.
1983As a side effect, the value of the
1984option is made available as
1985.Cm tablearg
1986value, which in turn can be used as
1987.Cm skipto
1988or
1989.Cm pipe
1990number.
1991.It Cm src-ip Ar ip-address
1992Matches IPv4 packets whose source IP is one of the address(es)
1993specified as an argument.
1994.It Cm src-ip6 Ar ip6-address
1995Matches IPv6 packets whose source IP is one of the address(es)
1996specified as an argument.
1997.It Cm src-port Ar ports
1998Matches IP packets whose source port is one of the port(s)
1999specified as argument.
2000.It Cm tagged Ar tag-list
2001Matches packets whose tags are included in
2002.Ar tag-list ,
2003which is either a single value or a list of values or ranges
2004specified in the same way as
2005.Ar ports .
2006Tags can be applied to the packet using
2007.Cm tag
2008rule action parameter (see it's description for details on tags).
2009.It Cm tcpack Ar ack
2010TCP packets only.
2011Match if the TCP header acknowledgment number field is set to
2012.Ar ack .
2013.It Cm tcpdatalen Ar tcpdatalen-list
2014Matches TCP packets whose length of TCP data is
2015.Ar tcpdatalen-list ,
2016which is either a single value or a list of values or ranges
2017specified in the same way as
2018.Ar ports .
2019.It Cm tcpflags Ar spec
2020TCP packets only.
2021Match if the TCP header contains the comma separated list of
2022flags specified in
2023.Ar spec .
2024The supported TCP flags are:
2025.Pp
2026.Cm fin ,
2027.Cm syn ,
2028.Cm rst ,
2029.Cm psh ,
2030.Cm ack
2031and
2032.Cm urg .
2033The absence of a particular flag may be denoted
2034with a
2035.Ql \&! .
2036A rule which contains a
2037.Cm tcpflags
2038specification can never match a fragmented packet which has
2039a non-zero offset.
2040See the
2041.Cm frag
2042option for details on matching fragmented packets.
2043.It Cm tcpmss Ar tcpmss-list
2044Matches TCP packets whose MSS (maximum segment size) value is set to
2045.Ar tcpmss-list ,
2046which is either a single value or a list of values or ranges
2047specified in the same way as
2048.Ar ports .
2049.It Cm tcpseq Ar seq
2050TCP packets only.
2051Match if the TCP header sequence number field is set to
2052.Ar seq .
2053.It Cm tcpwin Ar tcpwin-list
2054Matches TCP packets whose  header window field is set to
2055.Ar tcpwin-list ,
2056which is either a single value or a list of values or ranges
2057specified in the same way as
2058.Ar ports .
2059.It Cm tcpoptions Ar spec
2060TCP packets only.
2061Match if the TCP header contains the comma separated list of
2062options specified in
2063.Ar spec .
2064The supported TCP options are:
2065.Pp
2066.Cm mss
2067(maximum segment size),
2068.Cm window
2069(tcp window advertisement),
2070.Cm sack
2071(selective ack),
2072.Cm ts
2073(rfc1323 timestamp) and
2074.Cm cc
2075(rfc1644 t/tcp connection count).
2076The absence of a particular option may be denoted
2077with a
2078.Ql \&! .
2079.It Cm uid Ar user
2080Match all TCP or UDP packets sent by or received for a
2081.Ar user .
2082A
2083.Ar user
2084may be matched by name or identification number.
2085.It Cm verrevpath
2086For incoming packets,
2087a routing table lookup is done on the packet's source address.
2088If the interface on which the packet entered the system matches the
2089outgoing interface for the route,
2090the packet matches.
2091If the interfaces do not match up,
2092the packet does not match.
2093All outgoing packets or packets with no incoming interface match.
2094.Pp
2095The name and functionality of the option is intentionally similar to
2096the Cisco IOS command:
2097.Pp
2098.Dl ip verify unicast reverse-path
2099.Pp
2100This option can be used to make anti-spoofing rules to reject all
2101packets with source addresses not from this interface.
2102See also the option
2103.Cm antispoof .
2104.It Cm versrcreach
2105For incoming packets,
2106a routing table lookup is done on the packet's source address.
2107If a route to the source address exists, but not the default route
2108or a blackhole/reject route, the packet matches.
2109Otherwise, the packet does not match.
2110All outgoing packets match.
2111.Pp
2112The name and functionality of the option is intentionally similar to
2113the Cisco IOS command:
2114.Pp
2115.Dl ip verify unicast source reachable-via any
2116.Pp
2117This option can be used to make anti-spoofing rules to reject all
2118packets whose source address is unreachable.
2119.It Cm antispoof
2120For incoming packets, the packet's source address is checked if it
2121belongs to a directly connected network.
2122If the network is directly connected, then the interface the packet
2123came on in is compared to the interface the network is connected to.
2124When incoming interface and directly connected interface are not the
2125same, the packet does not match.
2126Otherwise, the packet does match.
2127All outgoing packets match.
2128.Pp
2129This option can be used to make anti-spoofing rules to reject all
2130packets that pretend to be from a directly connected network but do
2131not come in through that interface.
2132This option is similar to but more restricted than
2133.Cm verrevpath
2134because it engages only on packets with source addresses of directly
2135connected networks instead of all source addresses.
2136.El
2137.Sh LOOKUP TABLES
2138Lookup tables are useful to handle large sparse sets of
2139addresses or other search keys (e.g., ports, jail IDs, interface names).
2140In the rest of this section we will use the term ``key''.
2141Table name needs to match the following spec:
2142.Ar table-name .
2143Tables with the same name can be created in different
2144.Ar sets .
2145However, rule links to the tables in
2146.Ar set 0
2147by default.
2148This behavior can be controlled by
2149.Va net.inet.ip.fw.tables_sets
2150variable.
2151See the
2152.Sx SETS OF RULES
2153section for more information.
2154There may be up to 65535 different lookup tables.
2155.Pp
2156The following table types are supported:
2157.Bl -tag -width indent
2158.It Ar table-type : Ar addr | iface | number | flow | mac
2159.It Ar table-key : Ar addr Ns Oo / Ns Ar masklen Oc | iface-name | number | flow-spec
2160.It Ar flow-spec : Ar flow-field Ns Op , Ns Ar flow-spec
2161.It Ar flow-field : src-ip | proto | src-port | dst-ip | dst-port
2162.It Cm addr
2163Matches IPv4 or IPv6 address.
2164Each entry is represented by an
2165.Ar addr Ns Op / Ns Ar masklen
2166and will match all addresses with base
2167.Ar addr
2168(specified as an IPv4/IPv6 address, or a hostname) and mask width of
2169.Ar masklen
2170bits.
2171If
2172.Ar masklen
2173is not specified, it defaults to 32 for IPv4 and 128 for IPv6.
2174When looking up an IP address in a table, the most specific
2175entry will match.
2176.It Cm iface
2177Matches interface names.
2178Each entry is represented by string treated as interface name.
2179Wildcards are not supported.
2180.It Cm number
2181Matches protocol ports, uids/gids or jail IDs.
2182Each entry is represented by 32-bit unsigned integer.
2183Ranges are not supported.
2184.It Cm flow
2185Matches packet fields specified by
2186.Ar flow
2187type suboptions with table entries.
2188.It Cm mac
2189Matches MAC address.
2190Each entry is represented by an
2191.Ar addr Ns Op / Ns Ar masklen
2192and will match all addresses with base
2193.Ar addr
2194and mask width of
2195.Ar masklen
2196bits.
2197If
2198.Ar masklen
2199is not specified, it defaults to 48.
2200When looking up an MAC address in a table, the most specific
2201entry will match.
2202.El
2203.Pp
2204Tables require explicit creation via
2205.Cm create
2206before use.
2207.Pp
2208The following creation options are supported:
2209.Bl -tag -width indent
2210.It Ar create-options : Ar create-option | create-options
2211.It Ar create-option : Cm type Ar table-type | Cm valtype Ar value-mask | Cm algo Ar algo-desc |
2212.Cm limit Ar number | Cm locked | Cm missing | Cm or-flush
2213.It Cm type
2214Table key type.
2215.It Cm valtype
2216Table value mask.
2217.It Cm algo
2218Table algorithm to use (see below).
2219.It Cm limit
2220Maximum number of items that may be inserted into table.
2221.It Cm locked
2222Restrict any table modifications.
2223.It Cm missing
2224Do not fail if table already exists and has exactly same options as new one.
2225.It Cm or-flush
2226Flush existing table with same name instead of returning error.
2227Implies
2228.Cm missing
2229so existing table must be compatible with new one.
2230.El
2231.Pp
2232Some of these options may be modified later via
2233.Cm modify
2234keyword.
2235The following options can be changed:
2236.Bl -tag -width indent
2237.It Ar modify-options : Ar modify-option | modify-options
2238.It Ar modify-option : Cm limit Ar number
2239.It Cm limit
2240Alter maximum number of items that may be inserted into table.
2241.El
2242.Pp
2243Additionally, table can be locked or unlocked using
2244.Cm lock
2245or
2246.Cm unlock
2247commands.
2248.Pp
2249Tables of the same
2250.Ar type
2251can be swapped with each other using
2252.Cm swap Ar name
2253command.
2254Swap may fail if tables limits are set and data exchange
2255would result in limits hit.
2256Operation is performed atomically.
2257.Pp
2258One or more entries can be added to a table at once using
2259.Cm add
2260command.
2261Addition of all items are performed atomically.
2262By default, error in addition of one entry does not influence
2263addition of other entries.
2264However, non-zero error code is returned in that case.
2265Special
2266.Cm atomic
2267keyword may be specified before
2268.Cm add
2269to indicate all-or-none add request.
2270.Pp
2271One or more entries can be removed from a table at once using
2272.Cm delete
2273command.
2274By default, error in removal of one entry does not influence
2275removing of other entries.
2276However, non-zero error code is returned in that case.
2277.Pp
2278It may be possible to check what entry will be found on particular
2279.Ar table-key
2280using
2281.Cm lookup
2282.Ar table-key
2283command.
2284This functionality is optional and may be unsupported in some algorithms.
2285.Pp
2286The following operations can be performed on
2287.Ar one
2288or
2289.Cm all
2290tables:
2291.Bl -tag -width indent
2292.It Cm list
2293List all entries.
2294.It Cm flush
2295Removes all entries.
2296.It Cm info
2297Shows generic table information.
2298.It Cm detail
2299Shows generic table information and algo-specific data.
2300.El
2301.Pp
2302The following lookup algorithms are supported:
2303.Bl -tag -width indent
2304.It Ar algo-desc : algo-name | "algo-name algo-data"
2305.It Ar algo-name : Ar addr: radix | addr: hash | iface: array | number: array | flow: hash | mac: radix
2306.It Cm addr: radix
2307Separate Radix trees for IPv4 and IPv6, the same way as the routing table (see
2308.Xr route 4 ) .
2309Default choice for
2310.Ar addr
2311type.
2312.It Cm addr:hash
2313Separate auto-growing hashes for IPv4 and IPv6.
2314Accepts entries with the same mask length specified initially via
2315.Cm "addr:hash masks=/v4,/v6"
2316algorithm creation options.
2317Assume /32 and /128 masks by default.
2318Search removes host bits (according to mask) from supplied address and checks
2319resulting key in appropriate hash.
2320Mostly optimized for /64 and byte-ranged IPv6 masks.
2321.It Cm iface:array
2322Array storing sorted indexes for entries which are presented in the system.
2323Optimized for very fast lookup.
2324.It Cm number:array
2325Array storing sorted u32 numbers.
2326.It Cm flow:hash
2327Auto-growing hash storing flow entries.
2328Search calculates hash on required packet fields and searches for matching
2329entries in selected bucket.
2330.It Cm mac: radix
2331Radix tree for MAC address
2332.El
2333.Pp
2334The
2335.Cm tablearg
2336feature provides the ability to use a value, looked up in the table, as
2337the argument for a rule action, action parameter or rule option.
2338This can significantly reduce number of rules in some configurations.
2339If two tables are used in a rule, the result of the second (destination)
2340is used.
2341.Pp
2342Each record may hold one or more values according to
2343.Ar value-mask .
2344This mask is set on table creation via
2345.Cm valtype
2346option.
2347The following value types are supported:
2348.Bl -tag -width indent
2349.It Ar value-mask : Ar value-type Ns Op , Ns Ar value-mask
2350.It Ar value-type : Ar skipto | pipe | fib | nat | dscp | tag | divert |
2351.Ar netgraph | limit | ipv4
2352.It Cm skipto
2353rule number to jump to.
2354.It Cm pipe
2355Pipe number to use.
2356.It Cm fib
2357fib number to match/set.
2358.It Cm nat
2359nat number to jump to.
2360.It Cm dscp
2361dscp value to match/set.
2362.It Cm tag
2363tag number to match/set.
2364.It Cm divert
2365port number to divert traffic to.
2366.It Cm netgraph
2367hook number to move packet to.
2368.It Cm limit
2369maximum number of connections.
2370.It Cm ipv4
2371IPv4 nexthop to fwd packets to.
2372.It Cm ipv6
2373IPv6 nexthop to fwd packets to.
2374.El
2375.Pp
2376The
2377.Cm tablearg
2378argument can be used with the following actions:
2379.Cm nat, pipe, queue, divert, tee, netgraph, ngtee, fwd, skipto, setfib ,
2380action parameters:
2381.Cm tag, untag ,
2382rule options:
2383.Cm limit, tagged .
2384.Pp
2385When used with the
2386.Cm skipto
2387action, the user should be aware that the code will walk the ruleset
2388up to a rule equal to, or past, the given number.
2389.Pp
2390See the
2391.Sx EXAMPLES
2392Section for example usage of tables and the tablearg keyword.
2393.Sh SETS OF RULES
2394Each rule or table belongs to one of 32 different
2395.Em sets
2396, numbered 0 to 31.
2397Set 31 is reserved for the default rule.
2398.Pp
2399By default, rules or tables are put in set 0, unless you use the
2400.Cm set N
2401attribute when adding a new rule or table.
2402Sets can be individually and atomically enabled or disabled,
2403so this mechanism permits an easy way to store multiple configurations
2404of the firewall and quickly (and atomically) switch between them.
2405.Pp
2406By default, tables from set 0 are referenced when adding rule with
2407table opcodes regardless of rule set.
2408This behavior can be changed by setting
2409.Va net.inet.ip.fw.tables_sets
2410variable to 1.
2411Rule's set will then be used for table references.
2412.Pp
2413The command to enable/disable sets is
2414.Bd -ragged -offset indent
2415.Nm
2416.Cm set Oo Cm disable Ar number ... Oc Op Cm enable Ar number ...
2417.Ed
2418.Pp
2419where multiple
2420.Cm enable
2421or
2422.Cm disable
2423sections can be specified.
2424Command execution is atomic on all the sets specified in the command.
2425By default, all sets are enabled.
2426.Pp
2427When you disable a set, its rules behave as if they do not exist
2428in the firewall configuration, with only one exception:
2429.Bd -ragged -offset indent
2430dynamic rules created from a rule before it had been disabled
2431will still be active until they expire.
2432In order to delete
2433dynamic rules you have to explicitly delete the parent rule
2434which generated them.
2435.Ed
2436.Pp
2437The set number of rules can be changed with the command
2438.Bd -ragged -offset indent
2439.Nm
2440.Cm set move
2441.Brq Cm rule Ar rule-number | old-set
2442.Cm to Ar new-set
2443.Ed
2444.Pp
2445Also, you can atomically swap two rulesets with the command
2446.Bd -ragged -offset indent
2447.Nm
2448.Cm set swap Ar first-set second-set
2449.Ed
2450.Pp
2451See the
2452.Sx EXAMPLES
2453Section on some possible uses of sets of rules.
2454.Sh STATEFUL FIREWALL
2455Stateful operation is a way for the firewall to dynamically
2456create rules for specific flows when packets that
2457match a given pattern are detected.
2458Support for stateful
2459operation comes through the
2460.Cm check-state , keep-state , record-state , limit
2461and
2462.Cm set-limit
2463options of
2464.Nm rules .
2465.Pp
2466Dynamic rules are created when a packet matches a
2467.Cm keep-state ,
2468.Cm record-state ,
2469.Cm limit
2470or
2471.Cm set-limit
2472rule, causing the creation of a
2473.Em dynamic
2474rule which will match all and only packets with
2475a given
2476.Em protocol
2477between a
2478.Em src-ip/src-port dst-ip/dst-port
2479pair of addresses
2480.Em ( src
2481and
2482.Em dst
2483are used here only to denote the initial match addresses, but they
2484are completely equivalent afterwards).
2485Rules created by
2486.Cm keep-state
2487option also have a
2488.Ar :flowname
2489taken from it.
2490This name is used in matching together with addresses, ports and protocol.
2491Dynamic rules will be checked at the first
2492.Cm check-state, keep-state
2493or
2494.Cm limit
2495occurrence, and the action performed upon a match will be the same
2496as in the parent rule.
2497.Pp
2498Note that no additional attributes other than protocol and IP addresses
2499and ports and :flowname are checked on dynamic rules.
2500.Pp
2501The typical use of dynamic rules is to keep a closed firewall configuration,
2502but let the first TCP SYN packet from the inside network install a
2503dynamic rule for the flow so that packets belonging to that session
2504will be allowed through the firewall:
2505.Pp
2506.Dl "ipfw add check-state :OUTBOUND"
2507.Dl "ipfw add allow tcp from my-subnet to any setup keep-state :OUTBOUND"
2508.Dl "ipfw add deny tcp from any to any"
2509.Pp
2510A similar approach can be used for UDP, where an UDP packet coming
2511from the inside will install a dynamic rule to let the response through
2512the firewall:
2513.Pp
2514.Dl "ipfw add check-state :OUTBOUND"
2515.Dl "ipfw add allow udp from my-subnet to any keep-state :OUTBOUND"
2516.Dl "ipfw add deny udp from any to any"
2517.Pp
2518Dynamic rules expire after some time, which depends on the status
2519of the flow and the setting of some
2520.Cm sysctl
2521variables.
2522See Section
2523.Sx SYSCTL VARIABLES
2524for more details.
2525For TCP sessions, dynamic rules can be instructed to periodically
2526send keepalive packets to refresh the state of the rule when it is
2527about to expire.
2528.Pp
2529See Section
2530.Sx EXAMPLES
2531for more examples on how to use dynamic rules.
2532.Sh TRAFFIC SHAPER (DUMMYNET) CONFIGURATION
2533.Nm
2534is also the user interface for the
2535.Nm dummynet
2536traffic shaper, packet scheduler and network emulator, a subsystem that
2537can artificially queue, delay or drop packets
2538emulating the behaviour of certain network links
2539or queueing systems.
2540.Pp
2541.Nm dummynet
2542operates by first using the firewall to select packets
2543using any match pattern that can be used in
2544.Nm
2545rules.
2546Matching packets are then passed to either of two
2547different objects, which implement the traffic regulation:
2548.Bl -hang -offset XXXX
2549.It Em pipe
2550A
2551.Em pipe
2552emulates a
2553.Em link
2554with given bandwidth and propagation delay,
2555driven by a FIFO scheduler and a single queue with programmable
2556queue size and packet loss rate.
2557Packets are appended to the queue as they come out from
2558.Nm ipfw ,
2559and then transferred in FIFO order to the link at the desired rate.
2560.It Em queue
2561A
2562.Em queue
2563is an abstraction used to implement packet scheduling
2564using one of several packet scheduling algorithms.
2565Packets sent to a
2566.Em queue
2567are first grouped into flows according to a mask on the 5-tuple.
2568Flows are then passed to the scheduler associated to the
2569.Em queue ,
2570and each flow uses scheduling parameters (weight and others)
2571as configured in the
2572.Em queue
2573itself.
2574A scheduler in turn is connected to an emulated link,
2575and arbitrates the link's bandwidth among backlogged flows according to
2576weights and to the features of the scheduling algorithm in use.
2577.El
2578.Pp
2579In practice,
2580.Em pipes
2581can be used to set hard limits to the bandwidth that a flow can use, whereas
2582.Em queues
2583can be used to determine how different flows share the available bandwidth.
2584.Pp
2585A graphical representation of the binding of queues,
2586flows, schedulers and links is below.
2587.Bd -literal -offset indent
2588                 (flow_mask|sched_mask)  sched_mask
2589         +---------+   weight Wx  +-------------+
2590         |         |->-[flow]-->--|             |-+
2591    -->--| QUEUE x |   ...        |             | |
2592         |         |->-[flow]-->--| SCHEDuler N | |
2593         +---------+              |             | |
2594             ...                  |             +--[LINK N]-->--
2595         +---------+   weight Wy  |             | +--[LINK N]-->--
2596         |         |->-[flow]-->--|             | |
2597    -->--| QUEUE y |   ...        |             | |
2598         |         |->-[flow]-->--|             | |
2599         +---------+              +-------------+ |
2600                                    +-------------+
2601.Ed
2602It is important to understand the role of the SCHED_MASK
2603and FLOW_MASK, which are configured through the commands
2604.Dl "ipfw sched N config mask SCHED_MASK ..."
2605and
2606.Dl "ipfw queue X config mask FLOW_MASK ..." .
2607.Pp
2608The SCHED_MASK is used to assign flows to one or more
2609scheduler instances, one for each
2610value of the packet's 5-tuple after applying SCHED_MASK.
2611As an example, using ``src-ip 0xffffff00'' creates one instance
2612for each /24 destination subnet.
2613.Pp
2614The FLOW_MASK, together with the SCHED_MASK, is used to split
2615packets into flows.
2616As an example, using
2617``src-ip 0x000000ff''
2618together with the previous SCHED_MASK makes a flow for
2619each individual source address.
2620In turn, flows for each /24
2621subnet will be sent to the same scheduler instance.
2622.Pp
2623The above diagram holds even for the
2624.Em pipe
2625case, with the only restriction that a
2626.Em pipe
2627only supports a SCHED_MASK, and forces the use of a FIFO
2628scheduler (these are for backward compatibility reasons;
2629in fact, internally, a
2630.Nm dummynet's
2631pipe is implemented exactly as above).
2632.Pp
2633There are two modes of
2634.Nm dummynet
2635operation:
2636.Dq normal
2637and
2638.Dq fast .
2639The
2640.Dq normal
2641mode tries to emulate a real link: the
2642.Nm dummynet
2643scheduler ensures that the packet will not leave the pipe faster than it
2644would on the real link with a given bandwidth.
2645The
2646.Dq fast
2647mode allows certain packets to bypass the
2648.Nm dummynet
2649scheduler (if packet flow does not exceed pipe's bandwidth).
2650This is the reason why the
2651.Dq fast
2652mode requires less CPU cycles per packet (on average) and packet latency
2653can be significantly lower in comparison to a real link with the same
2654bandwidth.
2655The default mode is
2656.Dq normal .
2657The
2658.Dq fast
2659mode can be enabled by setting the
2660.Va net.inet.ip.dummynet.io_fast
2661.Xr sysctl 8
2662variable to a non-zero value.
2663.Ss PIPE, QUEUE AND SCHEDULER CONFIGURATION
2664The
2665.Em pipe ,
2666.Em queue
2667and
2668.Em scheduler
2669configuration commands are the following:
2670.Bd -ragged -offset indent
2671.Cm pipe Ar number Cm config Ar pipe-configuration
2672.Pp
2673.Cm queue Ar number Cm config Ar queue-configuration
2674.Pp
2675.Cm sched Ar number Cm config Ar sched-configuration
2676.Ed
2677.Pp
2678The following parameters can be configured for a pipe:
2679.Pp
2680.Bl -tag -width indent -compact
2681.It Cm bw Ar bandwidth | device
2682Bandwidth, measured in
2683.Sm off
2684.Op Cm K | M | G
2685.Brq Cm bit/s | Byte/s .
2686.Sm on
2687.Pp
2688A value of 0 (default) means unlimited bandwidth.
2689The unit must immediately follow the number, as in
2690.Pp
2691.Dl "dnctl pipe 1 config bw 300Kbit/s"
2692.Pp
2693If a device name is specified instead of a numeric value, as in
2694.Pp
2695.Dl "dnctl pipe 1 config bw tun0"
2696.Pp
2697then the transmit clock is supplied by the specified device.
2698At the moment only the
2699.Xr tun 4
2700device supports this
2701functionality, for use in conjunction with
2702.Xr ppp 8 .
2703.Pp
2704.It Cm delay Ar ms-delay
2705Propagation delay, measured in milliseconds.
2706The value is rounded to the next multiple of the clock tick
2707(typically 10ms, but it is a good practice to run kernels
2708with
2709.Dq "options HZ=1000"
2710to reduce
2711the granularity to 1ms or less).
2712The default value is 0, meaning no delay.
2713.Pp
2714.It Cm burst Ar size
2715If the data to be sent exceeds the pipe's bandwidth limit
2716(and the pipe was previously idle), up to
2717.Ar size
2718bytes of data are allowed to bypass the
2719.Nm dummynet
2720scheduler, and will be sent as fast as the physical link allows.
2721Any additional data will be transmitted at the rate specified
2722by the
2723.Nm pipe
2724bandwidth.
2725The burst size depends on how long the pipe has been idle;
2726the effective burst size is calculated as follows:
2727MAX(
2728.Ar size
2729,
2730.Nm bw
2731* pipe_idle_time).
2732.Pp
2733.It Cm profile Ar filename
2734A file specifying the additional overhead incurred in the transmission
2735of a packet on the link.
2736.Pp
2737Some link types introduce extra delays in the transmission
2738of a packet, e.g., because of MAC level framing, contention on
2739the use of the channel, MAC level retransmissions and so on.
2740From our point of view, the channel is effectively unavailable
2741for this extra time, which is constant or variable depending
2742on the link type.
2743Additionally, packets may be dropped after this
2744time (e.g., on a wireless link after too many retransmissions).
2745We can model the additional delay with an empirical curve
2746that represents its distribution.
2747.Bd -literal -offset indent
2748      cumulative probability
2749      1.0 ^
2750          |
2751      L   +-- loss-level          x
2752          |                 ******
2753          |                *
2754          |           *****
2755          |          *
2756          |        **
2757          |       *
2758          +-------*------------------->
2759                      delay
2760.Ed
2761The empirical curve may have both vertical and horizontal lines.
2762Vertical lines represent constant delay for a range of
2763probabilities.
2764Horizontal lines correspond to a discontinuity in the delay
2765distribution: the pipe will use the largest delay for a
2766given probability.
2767.Pp
2768The file format is the following, with whitespace acting as
2769a separator and '#' indicating the beginning a comment:
2770.Bl -tag -width indent
2771.It Cm name Ar identifier
2772optional name (listed by "dnctl pipe show")
2773to identify the delay distribution;
2774.It Cm bw Ar value
2775the bandwidth used for the pipe.
2776If not specified here, it must be present
2777explicitly as a configuration parameter for the pipe;
2778.It Cm loss-level Ar L
2779the probability above which packets are lost.
2780(0.0 <= L <= 1.0, default 1.0 i.e., no loss);
2781.It Cm samples Ar N
2782the number of samples used in the internal
2783representation of the curve (2..1024; default 100);
2784.It Cm "delay prob" | "prob delay"
2785One of these two lines is mandatory and defines
2786the format of the following lines with data points.
2787.It Ar XXX Ar YYY
27882 or more lines representing points in the curve,
2789with either delay or probability first, according
2790to the chosen format.
2791The unit for delay is milliseconds.
2792Data points do not need to be sorted.
2793Also, the number of actual lines can be different
2794from the value of the "samples" parameter:
2795.Nm
2796utility will sort and interpolate
2797the curve as needed.
2798.El
2799.Pp
2800Example of a profile file:
2801.Bd -literal -offset indent
2802name    bla_bla_bla
2803samples 100
2804loss-level    0.86
2805prob    delay
28060       200	# minimum overhead is 200ms
28070.5     200
28080.5     300
28090.8     1000
28100.9     1300
28111       1300
2812#configuration file end
2813.Ed
2814.El
2815.Pp
2816The following parameters can be configured for a queue:
2817.Pp
2818.Bl -tag -width indent -compact
2819.It Cm pipe Ar pipe_nr
2820Connects a queue to the specified pipe.
2821Multiple queues (with the same or different weights) can be connected to
2822the same pipe, which specifies the aggregate rate for the set of queues.
2823.Pp
2824.It Cm weight Ar weight
2825Specifies the weight to be used for flows matching this queue.
2826The weight must be in the range 1..100, and defaults to 1.
2827.El
2828.Pp
2829The following case-insensitive parameters can be configured for a
2830scheduler:
2831.Pp
2832.Bl -tag -width indent -compact
2833.It Cm type Ar {fifo | wf2q+ | rr | qfq | fq_codel | fq_pie}
2834specifies the scheduling algorithm to use.
2835.Bl -tag -width indent -compact
2836.It Cm fifo
2837is just a FIFO scheduler (which means that all packets
2838are stored in the same queue as they arrive to the scheduler).
2839FIFO has O(1) per-packet time complexity, with very low
2840constants (estimate 60-80ns on a 2GHz desktop machine)
2841but gives no service guarantees.
2842.It Cm wf2q+
2843implements the WF2Q+ algorithm, which is a Weighted Fair Queueing
2844algorithm which permits flows to share bandwidth according to
2845their weights.
2846Note that weights are not priorities; even a flow
2847with a minuscule weight will never starve.
2848WF2Q+ has O(log N) per-packet processing cost, where N is the number
2849of flows, and is the default algorithm used by previous versions
2850dummynet's queues.
2851.It Cm rr
2852implements the Deficit Round Robin algorithm, which has O(1) processing
2853costs (roughly, 100-150ns per packet)
2854and permits bandwidth allocation according to weights, but
2855with poor service guarantees.
2856.It Cm qfq
2857implements the QFQ algorithm, which is a very fast variant of
2858WF2Q+, with similar service guarantees and O(1) processing
2859costs (roughly, 200-250ns per packet).
2860.It Cm fq_codel
2861implements the FQ-CoDel (FlowQueue-CoDel) scheduler/AQM algorithm, which
2862uses a modified Deficit Round Robin scheduler to manage two lists of sub-queues
2863(old sub-queues and new sub-queues) for providing brief periods of priority to
2864lightweight or short burst flows.
2865By default, the total number of sub-queues is 1024.
2866FQ-CoDel's internal, dynamically
2867created sub-queues are controlled by separate instances of CoDel AQM.
2868.It Cm fq_pie
2869implements the FQ-PIE (FlowQueue-PIE) scheduler/AQM algorithm, which similar to
2870.Cm fq_codel
2871but uses per sub-queue PIE AQM instance to control the queue delay.
2872.El
2873.Pp
2874.Cm fq_codel
2875inherits AQM parameters and options from
2876.Cm codel
2877(see below), and
2878.Cm fq_pie
2879inherits AQM parameters and options from
2880.Cm pie
2881(see below).
2882Additionally, both of
2883.Cm fq_codel
2884and
2885.Cm fq_pie
2886have shared scheduler parameters which are:
2887.Bl -tag -width indent
2888.It Cm quantum
2889.Ar m
2890specifies the quantum (credit) of the scheduler.
2891.Ar m
2892is the number of bytes a queue can serve before being moved to the tail
2893of old queues list.
2894The default is 1514 bytes, and the maximum acceptable value
2895is 9000 bytes.
2896.It Cm limit
2897.Ar m
2898specifies the hard size limit (in unit of packets) of all queues managed by an
2899instance of the scheduler.
2900The default value of
2901.Ar m
2902is 10240 packets, and the maximum acceptable value is 20480 packets.
2903.It Cm flows
2904.Ar m
2905specifies the total number of flow queues (sub-queues) that fq_*
2906creates and manages.
2907By default, 1024 sub-queues are created when an instance
2908of the fq_{codel/pie} scheduler is created.
2909The maximum acceptable value is
291065536.
2911.El
2912.Pp
2913Note that any token after
2914.Cm fq_codel
2915or
2916.Cm fq_pie
2917is considered a parameter for fq_{codel/pie}.
2918So, ensure all scheduler
2919configuration options not related to fq_{codel/pie} are written before
2920.Cm fq_codel/fq_pie
2921tokens.
2922.El
2923.Pp
2924In addition to the type, all parameters allowed for a pipe can also
2925be specified for a scheduler.
2926.Pp
2927Finally, the following parameters can be configured for both
2928pipes and queues:
2929.Pp
2930.Bl -tag -width XXXX -compact
2931.It Cm buckets Ar hash-table-size
2932Specifies the size of the hash table used for storing the
2933various queues.
2934Default value is 64 controlled by the
2935.Xr sysctl 8
2936variable
2937.Va net.inet.ip.dummynet.hash_size ,
2938allowed range is 16 to 65536.
2939.Pp
2940.It Cm mask Ar mask-specifier
2941Packets sent to a given pipe or queue by an
2942.Nm
2943rule can be further classified into multiple flows, each of which is then
2944sent to a different
2945.Em dynamic
2946pipe or queue.
2947A flow identifier is constructed by masking the IP addresses,
2948ports and protocol types as specified with the
2949.Cm mask
2950options in the configuration of the pipe or queue.
2951For each different flow identifier, a new pipe or queue is created
2952with the same parameters as the original object, and matching packets
2953are sent to it.
2954.Pp
2955Thus, when
2956.Em dynamic pipes
2957are used, each flow will get the same bandwidth as defined by the pipe,
2958whereas when
2959.Em dynamic queues
2960are used, each flow will share the parent's pipe bandwidth evenly
2961with other flows generated by the same queue (note that other queues
2962with different weights might be connected to the same pipe).
2963.br
2964Available mask specifiers are a combination of one or more of the following:
2965.Pp
2966.Cm dst-ip Ar mask ,
2967.Cm dst-ip6 Ar mask ,
2968.Cm src-ip Ar mask ,
2969.Cm src-ip6 Ar mask ,
2970.Cm dst-port Ar mask ,
2971.Cm src-port Ar mask ,
2972.Cm flow-id Ar mask ,
2973.Cm proto Ar mask
2974or
2975.Cm all ,
2976.Pp
2977where the latter means all bits in all fields are significant.
2978.Pp
2979.It Cm noerror
2980When a packet is dropped by a
2981.Nm dummynet
2982queue or pipe, the error
2983is normally reported to the caller routine in the kernel, in the
2984same way as it happens when a device queue fills up.
2985Setting this
2986option reports the packet as successfully delivered, which can be
2987needed for some experimental setups where you want to simulate
2988loss or congestion at a remote router.
2989.Pp
2990.It Cm plr Ar packet-loss-rate
2991Packet loss rate.
2992Argument
2993.Ar packet-loss-rate
2994is a floating-point number between 0 and 1, with 0 meaning no
2995loss, 1 meaning 100% loss.
2996The loss rate is internally represented on 31 bits.
2997.Pp
2998.It Cm queue Brq Ar slots | size Ns Cm Kbytes
2999Queue size, in
3000.Ar slots
3001or
3002.Cm KBytes .
3003Default value is 50 slots, which
3004is the typical queue size for Ethernet devices.
3005Note that for slow speed links you should keep the queue
3006size short or your traffic might be affected by a significant
3007queueing delay.
3008E.g., 50 max-sized Ethernet packets (1500 bytes) mean 600Kbit
3009or 20s of queue on a 30Kbit/s pipe.
3010Even worse effects can result if you get packets from an
3011interface with a much larger MTU, e.g.\& the loopback interface
3012with its 16KB packets.
3013The
3014.Xr sysctl 8
3015variables
3016.Em net.inet.ip.dummynet.pipe_byte_limit
3017and
3018.Em net.inet.ip.dummynet.pipe_slot_limit
3019control the maximum lengths that can be specified.
3020.Pp
3021.It Cm red | gred Ar w_q Ns / Ns Ar min_th Ns / Ns Ar max_th Ns / Ns Ar max_p
3022[ecn]
3023Make use of the RED (Random Early Detection) queue management algorithm.
3024.Ar w_q
3025and
3026.Ar max_p
3027are floating
3028point numbers between 0 and 1 (inclusive), while
3029.Ar min_th
3030and
3031.Ar max_th
3032are integer numbers specifying thresholds for queue management
3033(thresholds are computed in bytes if the queue has been defined
3034in bytes, in slots otherwise).
3035The two parameters can also be of the same value if needed.
3036The
3037.Nm dummynet
3038also supports the gentle RED variant (gred) and ECN (Explicit Congestion
3039Notification) as optional.
3040Three
3041.Xr sysctl 8
3042variables can be used to control the RED behaviour:
3043.Bl -tag -width indent
3044.It Va net.inet.ip.dummynet.red_lookup_depth
3045specifies the accuracy in computing the average queue
3046when the link is idle (defaults to 256, must be greater than zero)
3047.It Va net.inet.ip.dummynet.red_avg_pkt_size
3048specifies the expected average packet size (defaults to 512, must be
3049greater than zero)
3050.It Va net.inet.ip.dummynet.red_max_pkt_size
3051specifies the expected maximum packet size, only used when queue
3052thresholds are in bytes (defaults to 1500, must be greater than zero).
3053.El
3054.Pp
3055.It Cm codel Oo Cm target Ar time Oc Oo Cm interval Ar time Oc Oo Cm ecn |
3056.Cm noecn Oc
3057Make use of the CoDel (Controlled-Delay) queue management algorithm.
3058.Ar time
3059is interpreted as milliseconds by default but seconds (s), milliseconds (ms) or
3060microseconds (us) can be specified instead.
3061CoDel drops or marks (ECN) packets
3062depending on packet sojourn time in the queue.
3063.Cm target
3064.Ar time
3065(5ms by default) is the minimum acceptable persistent queue delay that CoDel
3066allows.
3067CoDel does not drop packets directly after packets sojourn time becomes
3068higher than
3069.Cm target
3070.Ar time
3071but waits for
3072.Cm interval
3073.Ar time
3074(100ms default) before dropping.
3075.Cm interval
3076.Ar time
3077should be set to maximum RTT for all expected connections.
3078.Cm ecn
3079enables (disabled by default) packet marking (instead of dropping) for
3080ECN-enabled TCP flows when queue delay becomes high.
3081.Pp
3082Note that any token after
3083.Cm codel
3084is considered a parameter for CoDel.
3085So, ensure all pipe/queue
3086configuration options are written before
3087.Cm codel
3088token.
3089.Pp
3090The
3091.Xr sysctl 8
3092variables
3093.Va net.inet.ip.dummynet.codel.target
3094and
3095.Va net.inet.ip.dummynet.codel.interval
3096can be used to set CoDel default parameters.
3097.Pp
3098.It Cm pie Oo Cm target Ar time Oc Oo Cm tupdate Ar time Oc Oo
3099.Cm alpha Ar n Oc Oo Cm beta Ar n Oc Oo Cm max_burst Ar time Oc Oo
3100.Cm max_ecnth Ar n Oc Oo Cm ecn | Cm noecn Oc Oo Cm capdrop |
3101.Cm nocapdrop Oc Oo Cm drand | Cm nodrand Oc Oo Cm onoff
3102.Oc Oo Cm dre | Cm ts Oc
3103Make use of the PIE (Proportional Integral controller Enhanced) queue management
3104algorithm.
3105PIE drops or marks packets depending on a calculated drop probability during
3106en-queue process, with the aim of achieving high throughput while keeping queue
3107delay low.
3108At regular time intervals of
3109.Cm tupdate
3110.Ar time
3111(15ms by default) a background process (re)calculates the probability based on queue delay
3112deviations from
3113.Cm target
3114.Ar time
3115(15ms by default) and queue delay trends.
3116PIE approximates current queue
3117delay by using a departure rate estimation method, or (optionally) by using a
3118packet timestamp method similar to CoDel.
3119.Ar time
3120is interpreted as milliseconds by default but seconds (s), milliseconds (ms) or
3121microseconds (us) can be specified instead.
3122The other PIE parameters and options are as follows:
3123.Bl -tag -width indent
3124.It Cm alpha Ar n
3125.Ar n
3126is a floating point number between 0 and 7 which specifies the weight of queue
3127delay deviations that is used in drop probability calculation.
31280.125 is the default.
3129.It Cm beta Ar n
3130.Ar n
3131is a floating point number between 0 and 7 which specifies is the weight of queue
3132delay trend that is used in drop probability calculation.
31331.25 is the default.
3134.It Cm max_burst Ar time
3135The maximum period of time that PIE does not drop/mark packets.
3136150ms is the
3137default and 10s is the maximum value.
3138.It Cm max_ecnth Ar n
3139Even when ECN is enabled, PIE drops packets instead of marking them when drop
3140probability becomes higher than ECN probability threshold
3141.Cm max_ecnth Ar n
3142, the default is 0.1 (i.e 10%) and 1 is the maximum value.
3143.It Cm ecn | noecn
3144enable or disable ECN marking for ECN-enabled TCP flows.
3145Disabled by default.
3146.It Cm capdrop | nocapdrop
3147enable or disable cap drop adjustment.
3148Cap drop adjustment is enabled by default.
3149.It Cm drand | nodrand
3150enable or disable drop probability de-randomisation.
3151De-randomisation eliminates
3152the problem of dropping packets too close or too far.
3153De-randomisation is enabled by default.
3154.It Cm onoff
3155enable turning PIE on and off depending on queue load.
3156If this option is enabled,
3157PIE turns on when over 1/3 of queue becomes full.
3158This option is disabled by
3159default.
3160.It Cm dre | ts
3161Calculate queue delay using departure rate estimation
3162.Cm dre
3163or timestamps
3164.Cm ts .
3165.Cm dre
3166is used by default.
3167.El
3168.Pp
3169Note that any token after
3170.Cm pie
3171is considered a parameter for PIE.
3172So ensure all pipe/queue
3173the configuration options are written before
3174.Cm pie
3175token.
3176.Xr sysctl 8
3177variables can be used to control the
3178.Cm pie
3179default parameters.
3180See the
3181.Sx SYSCTL VARIABLES
3182section for more details.
3183.El
3184.Pp
3185When used with IPv6 data,
3186.Nm dummynet
3187currently has several limitations.
3188Information necessary to route link-local packets to an
3189interface is not available after processing by
3190.Nm dummynet
3191so those packets are dropped in the output path.
3192Care should be taken to ensure that link-local packets are not passed to
3193.Nm dummynet .
3194.Sh CHECKLIST
3195Here are some important points to consider when designing your
3196rules:
3197.Bl -bullet
3198.It
3199Remember that you filter both packets going
3200.Cm in
3201and
3202.Cm out .
3203Most connections need packets going in both directions.
3204.It
3205Remember to test very carefully.
3206It is a good idea to be near the console when doing this.
3207If you cannot be near the console,
3208use an auto-recovery script such as the one in
3209.Pa /usr/share/examples/ipfw/change_rules.sh .
3210.It
3211Do not forget the loopback interface.
3212.El
3213.Sh FINE POINTS
3214.Bl -bullet
3215.It
3216There are circumstances where fragmented datagrams are unconditionally
3217dropped.
3218TCP packets are dropped if they do not contain at least 20 bytes of
3219TCP header, UDP packets are dropped if they do not contain a full 8
3220byte UDP header, and ICMP packets are dropped if they do not contain
32214 bytes of ICMP header, enough to specify the ICMP type, code, and
3222checksum.
3223These packets are simply logged as
3224.Dq pullup failed
3225since there may not be enough good data in the packet to produce a
3226meaningful log entry.
3227.It
3228Another type of packet is unconditionally dropped, a TCP packet with a
3229fragment offset of one.
3230This is a valid packet, but it only has one use, to try
3231to circumvent firewalls.
3232When logging is enabled, these packets are
3233reported as being dropped by rule -1.
3234.It
3235If you are logged in over a network, loading the
3236.Xr kld 4
3237version of
3238.Nm
3239is probably not as straightforward as you would think.
3240The following command line is recommended:
3241.Bd -literal -offset indent
3242kldload ipfw && \e
3243ipfw add 32000 allow ip from any to any
3244.Ed
3245.Pp
3246Along the same lines, doing an
3247.Bd -literal -offset indent
3248ipfw flush
3249.Ed
3250.Pp
3251in similar surroundings is also a bad idea.
3252.It
3253The
3254.Nm
3255filter list may not be modified if the system security level
3256is set to 3 or higher
3257(see
3258.Xr init 8
3259for information on system security levels).
3260.El
3261.Sh PACKET DIVERSION
3262A
3263.Xr divert 4
3264socket bound to the specified port will receive all packets
3265diverted to that port.
3266If no socket is bound to the destination port, or if the divert module is
3267not loaded, or if the kernel was not compiled with divert socket support,
3268the packets are dropped.
3269.Sh NETWORK ADDRESS TRANSLATION (NAT)
3270.Nm
3271support in-kernel NAT using the kernel version of
3272.Xr libalias 3 .
3273The kernel module
3274.Cm ipfw_nat
3275should be loaded or kernel should have
3276.Cm options IPFIREWALL_NAT
3277to be able use NAT.
3278.Pp
3279The nat configuration command is the following:
3280.Bd -ragged -offset indent
3281.Bk -words
3282.Cm nat
3283.Ar nat_number
3284.Cm config
3285.Ar nat-configuration
3286.Ek
3287.Ed
3288.Pp
3289The following parameters can be configured:
3290.Bl -tag -width indent
3291.It Cm ip Ar ip_address
3292Define an ip address to use for aliasing.
3293.It Cm if Ar nic
3294Use ip address of NIC for aliasing, dynamically changing
3295it if NIC's ip address changes.
3296.It Cm log
3297Enable logging on this nat instance.
3298.It Cm deny_in
3299Deny any incoming connection from outside world.
3300.It Cm same_ports
3301Try to leave the alias port numbers unchanged from
3302the actual local port numbers.
3303.It Cm unreg_only
3304Traffic on the local network not originating from a RFC 1918
3305unregistered address spaces will be ignored.
3306.It Cm unreg_cgn
3307Like unreg_only, but includes the RFC 6598 (Carrier Grade NAT)
3308address range.
3309.It Cm reset
3310Reset table of the packet aliasing engine on address change.
3311.It Cm reverse
3312Reverse the way libalias handles aliasing.
3313.It Cm proxy_only
3314Obey transparent proxy rules only, packet aliasing is not performed.
3315.It Cm skip_global
3316Skip instance in case of global state lookup (see below).
3317.It Cm port_range Ar lower-upper
3318Set the aliasing ports between the ranges given. Upper port has to be greater
3319than lower.
3320.El
3321.Pp
3322Some special values can be supplied instead of
3323.Va nat_number
3324in nat rule actions:
3325.Bl -tag -width indent
3326.It Cm global
3327Looks up translation state in all configured nat instances.
3328If an entry is found, packet is aliased according to that entry.
3329If no entry was found in any of the instances, packet is passed unchanged,
3330and no new entry will be created.
3331See section
3332.Sx MULTIPLE INSTANCES
3333in
3334.Xr natd 8
3335for more information.
3336.It Cm tablearg
3337Uses argument supplied in lookup table.
3338See
3339.Sx LOOKUP TABLES
3340section below for more information on lookup tables.
3341.El
3342.Pp
3343To let the packet continue after being (de)aliased, set the sysctl variable
3344.Va net.inet.ip.fw.one_pass
3345to 0.
3346For more information about aliasing modes, refer to
3347.Xr libalias 3 .
3348See Section
3349.Sx EXAMPLES
3350for some examples of nat usage.
3351.Ss REDIRECT AND LSNAT SUPPORT IN IPFW
3352Redirect and LSNAT support follow closely the syntax used in
3353.Xr natd 8 .
3354See Section
3355.Sx EXAMPLES
3356for some examples on how to do redirect and lsnat.
3357.Ss SCTP NAT SUPPORT
3358SCTP nat can be configured in a similar manner to TCP through the
3359.Nm
3360command line tool.
3361The main difference is that
3362.Nm sctp nat
3363does not do port translation.
3364Since the local and global side ports will be the same,
3365there is no need to specify both.
3366Ports are redirected as follows:
3367.Bd -ragged -offset indent
3368.Bk -words
3369.Cm nat
3370.Ar nat_number
3371.Cm config if
3372.Ar nic
3373.Cm redirect_port sctp
3374.Ar ip_address [,addr_list] {[port | port-port] [,ports]}
3375.Ek
3376.Ed
3377.Pp
3378Most
3379.Nm sctp nat
3380configuration can be done in real-time through the
3381.Xr sysctl 8
3382interface.
3383All may be changed dynamically, though the hash_table size will only
3384change for new
3385.Nm nat
3386instances.
3387See
3388.Sx SYSCTL VARIABLES
3389for more info.
3390.Sh IPv6/IPv4 NETWORK ADDRESS AND PROTOCOL TRANSLATION
3391.Ss Stateful translation
3392.Nm
3393supports in-kernel IPv6/IPv4 network address and protocol translation.
3394Stateful NAT64 translation allows IPv6-only clients to contact IPv4 servers
3395using unicast TCP, UDP or ICMP protocols.
3396One or more IPv4 addresses assigned to a stateful NAT64 translator are shared
3397among several IPv6-only clients.
3398When stateful NAT64 is used in conjunction with DNS64, no changes are usually
3399required in the IPv6 client or the IPv4 server.
3400The kernel module
3401.Cm ipfw_nat64
3402should be loaded or kernel should have
3403.Cm options IPFIREWALL_NAT64
3404to be able use stateful NAT64 translator.
3405.Pp
3406Stateful NAT64 uses a bunch of memory for several types of objects.
3407When IPv6 client initiates connection, NAT64 translator creates a host entry
3408in the states table.
3409Each host entry uses preallocated IPv4 alias entry.
3410Each alias entry has a number of ports group entries allocated on demand.
3411Ports group entries contains connection state entries.
3412There are several options to control limits and lifetime for these objects.
3413.Pp
3414NAT64 translator follows RFC7915 when does ICMPv6/ICMP translation,
3415unsupported message types will be silently dropped.
3416IPv6 needs several ICMPv6 message types to be explicitly allowed for correct
3417operation.
3418Make sure that ND6 neighbor solicitation (ICMPv6 type 135) and neighbor
3419advertisement (ICMPv6 type 136) messages will not be handled by translation
3420rules.
3421.Pp
3422After translation NAT64 translator by default sends packets through
3423corresponding netisr queue.
3424Thus translator host should be configured as IPv4 and IPv6 router.
3425Also this means, that a packet is handled by firewall twice.
3426First time an original packet is handled and consumed by translator,
3427and then it is handled again as translated packet.
3428This behavior can be changed by sysctl variable
3429.Va net.inet.ip.fw.nat64_direct_output .
3430Also translated packet can be tagged using
3431.Cm tag
3432rule action, and then matched by
3433.Cm tagged
3434opcode to avoid loops and extra overhead.
3435.Pp
3436The stateful NAT64 configuration command is the following:
3437.Bd -ragged -offset indent
3438.Bk -words
3439.Cm nat64lsn
3440.Ar name
3441.Cm create
3442.Ar create-options
3443.Ek
3444.Ed
3445.Pp
3446The following parameters can be configured:
3447.Bl -tag -width indent
3448.It Cm prefix4 Ar ipv4_prefix/plen
3449The IPv4 prefix with mask defines the pool of IPv4 addresses used as
3450source address after translation.
3451Stateful NAT64 module translates IPv6 source address of client to one
3452IPv4 address from this pool.
3453Note that incoming IPv4 packets that don't have corresponding state entry
3454in the states table will be dropped by translator.
3455Make sure that translation rules handle packets, destined to configured prefix.
3456.It Cm prefix6 Ar ipv6_prefix/length
3457The IPv6 prefix defines IPv4-embedded IPv6 addresses used by translator
3458to represent IPv4 addresses.
3459This IPv6 prefix should be configured in DNS64.
3460The translator implementation follows RFC6052, that restricts the length of
3461prefixes to one of following: 32, 40, 48, 56, 64, or 96.
3462The Well-Known IPv6 Prefix 64:ff9b:: must be 96 bits long.
3463The special
3464.Ar ::/length
3465prefix can be used to handle several IPv6 prefixes with one NAT64 instance.
3466The NAT64 instance will determine a destination IPv4 address from prefix
3467.Ar length .
3468.It Cm states_chunks Ar number
3469The number of states chunks in single ports group.
3470Each ports group by default can keep 64 state entries in single chunk.
3471The above value affects the maximum number of states that can be associated with single IPv4 alias address and port.
3472The value must be power of 2, and up to 128.
3473.It Cm host_del_age Ar seconds
3474The number of seconds until the host entry for a IPv6 client will be deleted
3475and all its resources will be released due to inactivity.
3476Default value is
3477.Ar 3600 .
3478.It Cm pg_del_age Ar seconds
3479The number of seconds until a ports group with unused state entries will
3480be released.
3481Default value is
3482.Ar 900 .
3483.It Cm tcp_syn_age Ar seconds
3484The number of seconds while a state entry for TCP connection with only SYN
3485sent will be kept.
3486If TCP connection establishing will not be finished,
3487state entry will be deleted.
3488Default value is
3489.Ar 10 .
3490.It Cm tcp_est_age Ar seconds
3491The number of seconds while a state entry for established TCP connection
3492will be kept.
3493Default value is
3494.Ar 7200 .
3495.It Cm tcp_close_age Ar seconds
3496The number of seconds while a state entry for closed TCP connection
3497will be kept.
3498Keeping state entries for closed connections is needed, because IPv4 servers
3499typically keep closed connections in a TIME_WAIT state for a several minutes.
3500Since translator's IPv4 addresses are shared among all IPv6 clients,
3501new connections from the same addresses and ports may be rejected by server,
3502because these connections are still in a TIME_WAIT state.
3503Keeping them in translator's state table protects from such rejects.
3504Default value is
3505.Ar 180 .
3506.It Cm udp_age Ar seconds
3507The number of seconds while translator keeps state entry in a waiting for
3508reply to the sent UDP datagram.
3509Default value is
3510.Ar 120 .
3511.It Cm icmp_age Ar seconds
3512The number of seconds while translator keeps state entry in a waiting for
3513reply to the sent ICMP message.
3514Default value is
3515.Ar 60 .
3516.It Cm log
3517Turn on logging of all handled packets via BPF through
3518.Ar ipfwlog0
3519interface.
3520.Ar ipfwlog0
3521is a pseudo interface and can be created after a boot manually with
3522.Cm ifconfig
3523command.
3524Note that it has different purpose than
3525.Ar ipfw0
3526interface.
3527Translators sends to BPF an additional information with each packet.
3528With
3529.Cm tcpdump
3530you are able to see each handled packet before and after translation.
3531.It Cm -log
3532Turn off logging of all handled packets via BPF.
3533.It Cm allow_private
3534Turn on processing private IPv4 addresses.
3535By default IPv6 packets with destinations mapped to private address ranges
3536defined by RFC1918 are not processed.
3537.It Cm -allow_private
3538Turn off private address handling in
3539.Nm nat64
3540instance.
3541.El
3542.Pp
3543To inspect a states table of stateful NAT64 the following command can be used:
3544.Bd -ragged -offset indent
3545.Bk -words
3546.Cm nat64lsn
3547.Ar name
3548.Cm show Cm states
3549.Ek
3550.Ed
3551.Pp
3552Stateless NAT64 translator doesn't use a states table for translation
3553and converts IPv4 addresses to IPv6 and vice versa solely based on the
3554mappings taken from configured lookup tables.
3555Since a states table doesn't used by stateless translator,
3556it can be configured to pass IPv4 clients to IPv6-only servers.
3557.Pp
3558The stateless NAT64 configuration command is the following:
3559.Bd -ragged -offset indent
3560.Bk -words
3561.Cm nat64stl
3562.Ar name
3563.Cm create
3564.Ar create-options
3565.Ek
3566.Ed
3567.Pp
3568The following parameters can be configured:
3569.Bl -tag -width indent
3570.It Cm prefix6 Ar ipv6_prefix/length
3571The IPv6 prefix defines IPv4-embedded IPv6 addresses used by translator
3572to represent IPv4 addresses.
3573This IPv6 prefix should be configured in DNS64.
3574.It Cm table4 Ar table46
3575The lookup table
3576.Ar table46
3577contains mapping how IPv4 addresses should be translated to IPv6 addresses.
3578.It Cm table6 Ar table64
3579The lookup table
3580.Ar table64
3581contains mapping how IPv6 addresses should be translated to IPv4 addresses.
3582.It Cm log
3583Turn on logging of all handled packets via BPF through
3584.Ar ipfwlog0
3585interface.
3586.It Cm -log
3587Turn off logging of all handled packets via BPF.
3588.It Cm allow_private
3589Turn on processing private IPv4 addresses.
3590By default IPv6 packets with destinations mapped to private address ranges
3591defined by RFC1918 are not processed.
3592.It Cm -allow_private
3593Turn off private address handling in
3594.Nm nat64
3595instance.
3596.El
3597.Pp
3598Note that the behavior of stateless translator with respect to not matched
3599packets differs from stateful translator.
3600If corresponding addresses was not found in the lookup tables, the packet
3601will not be dropped and the search continues.
3602.Ss XLAT464 CLAT translation
3603XLAT464 CLAT NAT64 translator implements client-side stateless translation as
3604defined in RFC6877 and is very similar to statless NAT64 translator
3605explained above.
3606Instead of lookup tables it uses one-to-one mapping between IPv4 and IPv6
3607addresses using configured prefixes.
3608This mode can be used as a replacement of DNS64 service for applications
3609that are not using it (e.g. VoIP) allowing them to access IPv4-only Internet
3610over IPv6-only networks with help of remote NAT64 translator.
3611.Pp
3612The CLAT NAT64 configuration command is the following:
3613.Bd -ragged -offset indent
3614.Bk -words
3615.Cm nat64clat
3616.Ar name
3617.Cm create
3618.Ar create-options
3619.Ek
3620.Ed
3621.Pp
3622The following parameters can be configured:
3623.Bl -tag -width indent
3624.It Cm clat_prefix Ar ipv6_prefix/length
3625The IPv6 prefix defines IPv4-embedded IPv6 addresses used by translator
3626to represent source IPv4 addresses.
3627.It Cm plat_prefix Ar ipv6_prefix/length
3628The IPv6 prefix defines IPv4-embedded IPv6 addresses used by translator
3629to represent destination IPv4 addresses.
3630This IPv6 prefix should be configured on a remote NAT64 translator.
3631.It Cm log
3632Turn on logging of all handled packets via BPF through
3633.Ar ipfwlog0
3634interface.
3635.It Cm -log
3636Turn off logging of all handled packets via BPF.
3637.It Cm allow_private
3638Turn on processing private IPv4 addresses.
3639By default
3640.Nm nat64clat
3641instance will not process IPv4 packets with destination address from private
3642ranges as defined in RFC1918.
3643.It Cm -allow_private
3644Turn off private address handling in
3645.Nm nat64clat
3646instance.
3647.El
3648.Pp
3649Note that the behavior of CLAT translator with respect to not matched
3650packets differs from stateful translator.
3651If corresponding addresses were not matched against prefixes configured,
3652the packet will not be dropped and the search continues.
3653.Sh IPv6-to-IPv6 NETWORK PREFIX TRANSLATION (NPTv6)
3654.Nm
3655supports in-kernel IPv6-to-IPv6 network prefix translation as described
3656in RFC6296.
3657The kernel module
3658.Cm ipfw_nptv6
3659should be loaded or kernel should has
3660.Cm options IPFIREWALL_NPTV6
3661to be able use NPTv6 translator.
3662.Pp
3663The NPTv6 configuration command is the following:
3664.Bd -ragged -offset indent
3665.Bk -words
3666.Cm nptv6
3667.Ar name
3668.Cm create
3669.Ar create-options
3670.Ek
3671.Ed
3672.Pp
3673The following parameters can be configured:
3674.Bl -tag -width indent
3675.It Cm int_prefix Ar ipv6_prefix
3676IPv6 prefix used in internal network.
3677NPTv6 module translates source address when it matches this prefix.
3678.It Cm ext_prefix Ar ipv6_prefix
3679IPv6 prefix used in external network.
3680NPTv6 module translates destination address when it matches this prefix.
3681.It Cm ext_if Ar nic
3682The NPTv6 module will use first global IPv6 address from interface
3683.Ar nic
3684as external prefix.
3685It can be useful when IPv6 prefix of external network is dynamically obtained.
3686.Cm ext_prefix
3687and
3688.Cm ext_if
3689options are mutually exclusive.
3690.It Cm prefixlen Ar length
3691The length of specified IPv6 prefixes.
3692It must be in range from 8 to 64.
3693.El
3694.Pp
3695Note that the prefix translation rules are silently ignored when IPv6 packet
3696forwarding is disabled.
3697To enable the packet forwarding, set the sysctl variable
3698.Va net.inet6.ip6.forwarding
3699to 1.
3700.Pp
3701To let the packet continue after being translated, set the sysctl variable
3702.Va net.inet.ip.fw.one_pass
3703to 0.
3704.Sh LOADER TUNABLES
3705Tunables can be set in
3706.Xr loader 8
3707prompt,
3708.Xr loader.conf 5
3709or
3710.Xr kenv 1
3711before ipfw module gets loaded.
3712.Bl -tag -width indent
3713.It Va net.inet.ip.fw.default_to_accept : No 0
3714Defines ipfw last rule behavior.
3715This value overrides
3716.Cd "options IPFW_DEFAULT_TO_(ACCEPT|DENY)"
3717from kernel configuration file.
3718.It Va net.inet.ip.fw.tables_max : No 128
3719Defines number of tables available in ipfw.
3720Number cannot exceed 65534.
3721.El
3722.Sh SYSCTL VARIABLES
3723A set of
3724.Xr sysctl 8
3725variables controls the behaviour of the firewall and
3726associated modules
3727.Pq Nm dummynet , bridge , sctp nat .
3728These are shown below together with their default value
3729(but always check with the
3730.Xr sysctl 8
3731command what value is actually in use) and meaning:
3732.Bl -tag -width indent
3733.It Va net.inet.ip.alias.sctp.accept_global_ootb_addip : No 0
3734Defines how the
3735.Nm nat
3736responds to receipt of global OOTB ASCONF-AddIP:
3737.Bl -tag -width indent
3738.It Cm 0
3739No response (unless a partially matching association exists -
3740ports and vtags match but global address does not)
3741.It Cm 1
3742.Nm nat
3743will accept and process all OOTB global AddIP messages.
3744.El
3745.Pp
3746Option 1 should never be selected as this forms a security risk.
3747An attacker can
3748establish multiple fake associations by sending AddIP messages.
3749.It Va net.inet.ip.alias.sctp.chunk_proc_limit : No 5
3750Defines the maximum number of chunks in an SCTP packet that will be
3751parsed for a
3752packet that matches an existing association.
3753This value is enforced to be greater or equal than
3754.Cm net.inet.ip.alias.sctp.initialising_chunk_proc_limit .
3755A high value is
3756a DoS risk yet setting too low a value may result in
3757important control chunks in
3758the packet not being located and parsed.
3759.It Va net.inet.ip.alias.sctp.error_on_ootb : No 1
3760Defines when the
3761.Nm nat
3762responds to any Out-of-the-Blue (OOTB) packets with ErrorM packets.
3763An OOTB packet is a packet that arrives with no existing association
3764registered in the
3765.Nm nat
3766and is not an INIT or ASCONF-AddIP packet:
3767.Bl -tag -width indent
3768.It Cm 0
3769ErrorM is never sent in response to OOTB packets.
3770.It Cm 1
3771ErrorM is only sent to OOTB packets received on the local side.
3772.It Cm 2
3773ErrorM is sent to the local side and on the global side ONLY if there is a
3774partial match (ports and vtags match but the source global IP does not).
3775This value is only useful if the
3776.Nm nat
3777is tracking global IP addresses.
3778.It Cm 3
3779ErrorM is sent in response to all OOTB packets on both
3780the local and global side
3781(DoS risk).
3782.El
3783.Pp
3784At the moment the default is 0, since the ErrorM packet is not yet
3785supported by most SCTP stacks.
3786When it is supported, and if not tracking
3787global addresses, we recommend setting this value to 1 to allow
3788multi-homed local hosts to function with the
3789.Nm nat .
3790To track global addresses, we recommend setting this value to 2 to
3791allow global hosts to be informed when they need to (re)send an
3792ASCONF-AddIP.
3793Value 3 should never be chosen (except for debugging) as the
3794.Nm nat
3795will respond to all OOTB global packets (a DoS risk).
3796.It Va net.inet.ip.alias.sctp.hashtable_size : No 2003
3797Size of hash tables used for
3798.Nm nat
3799lookups (100 < prime_number > 1000001).
3800This value sets the
3801.Nm hash table
3802size for any future created
3803.Nm nat
3804instance and therefore must be set prior to creating a
3805.Nm nat
3806instance.
3807The table sizes may be changed to suit specific needs.
3808If there will be few
3809concurrent associations, and memory is scarce, you may make these smaller.
3810If there will be many thousands (or millions) of concurrent associations, you
3811should make these larger.
3812A prime number is best for the table size.
3813The sysctl
3814update function will adjust your input value to the next highest prime number.
3815.It Va net.inet.ip.alias.sctp.holddown_time : No 0
3816Hold association in table for this many seconds after receiving a
3817SHUTDOWN-COMPLETE.
3818This allows endpoints to correct shutdown gracefully if a
3819shutdown_complete is lost and retransmissions are required.
3820.It Va net.inet.ip.alias.sctp.init_timer : No 15
3821Timeout value while waiting for (INIT-ACK|AddIP-ACK).
3822This value cannot be 0.
3823.It Va net.inet.ip.alias.sctp.initialising_chunk_proc_limit : No 2
3824Defines the maximum number of chunks in an SCTP packet that will be parsed when
3825no existing association exists that matches that packet.
3826Ideally this packet
3827will only be an INIT or ASCONF-AddIP packet.
3828A higher value may become a DoS
3829risk as malformed packets can consume processing resources.
3830.It Va net.inet.ip.alias.sctp.param_proc_limit : No 25
3831Defines the maximum number of parameters within a chunk that will be
3832parsed in a
3833packet.
3834As for other similar sysctl variables, larger values pose a DoS risk.
3835.It Va net.inet.ip.alias.sctp.log_level : No 0
3836Level of detail in the system log messages (0 \- minimal, 1 \- event,
38372 \- info, 3 \- detail, 4 \- debug, 5 \- max debug).
3838May be a good
3839option in high loss environments.
3840.It Va net.inet.ip.alias.sctp.shutdown_time : No 15
3841Timeout value while waiting for SHUTDOWN-COMPLETE.
3842This value cannot be 0.
3843.It Va net.inet.ip.alias.sctp.track_global_addresses : No 0
3844Enables/disables global IP address tracking within the
3845.Nm nat
3846and places an
3847upper limit on the number of addresses tracked for each association:
3848.Bl -tag -width indent
3849.It Cm 0
3850Global tracking is disabled
3851.It Cm >1
3852Enables tracking, the maximum number of addresses tracked for each
3853association is limited to this value
3854.El
3855.Pp
3856This variable is fully dynamic, the new value will be adopted for all newly
3857arriving associations, existing associations are treated
3858as they were previously.
3859Global tracking will decrease the number of collisions within the
3860.Nm nat
3861at a cost
3862of increased processing load, memory usage, complexity, and possible
3863.Nm nat
3864state
3865problems in complex networks with multiple
3866.Nm nats .
3867We recommend not tracking
3868global IP addresses, this will still result in a fully functional
3869.Nm nat .
3870.It Va net.inet.ip.alias.sctp.up_timer : No 300
3871Timeout value to keep an association up with no traffic.
3872This value cannot be 0.
3873.It Va net.inet.ip.dummynet.codel.interval : No 100000
3874Default
3875.Cm codel
3876AQM interval in microseconds.
3877The value must be in the range 1..5000000.
3878.It Va net.inet.ip.dummynet.codel.target : No 5000
3879Default
3880.Cm codel
3881AQM target delay time in microseconds (the minimum acceptable persistent queue
3882delay).
3883The value must be in the range 1..5000000.
3884.It Va net.inet.ip.dummynet.expire : No 1
3885Lazily delete dynamic pipes/queue once they have no pending traffic.
3886You can disable this by setting the variable to 0, in which case
3887the pipes/queues will only be deleted when the threshold is reached.
3888.It Va net.inet.ip.dummynet.fqcodel.flows : No 1024
3889Defines the default total number of flow queues (sub-queues) that
3890.Cm fq_codel
3891creates and manages.
3892The value must be in the range 1..65536.
3893.It Va net.inet.ip.dummynet.fqcodel.interval : No 100000
3894Default
3895.Cm fq_codel
3896scheduler/AQM interval in microseconds.
3897The value must be in the range 1..5000000.
3898.It Va net.inet.ip.dummynet.fqcodel.limit : No 10240
3899The default hard size limit (in unit of packet) of all queues managed by an
3900instance of the
3901.Cm fq_codel
3902scheduler.
3903The value must be in the range 1..20480.
3904.It Va net.inet.ip.dummynet.fqcodel.quantum : No 1514
3905The default quantum (credit) of the
3906.Cm fq_codel
3907in unit of byte.
3908The value must be in the range 1..9000.
3909.It Va net.inet.ip.dummynet.fqcodel.target : No 5000
3910Default
3911.Cm fq_codel
3912scheduler/AQM target delay time in microseconds (the minimum acceptable
3913persistent queue delay).
3914The value must be in the range 1..5000000.
3915.It Va net.inet.ip.dummynet.fqpie.alpha : No 125
3916The default
3917.Ar alpha
3918parameter (scaled by 1000) for
3919.Cm fq_pie
3920scheduler/AQM.
3921The value must be in the range 1..7000.
3922.It Va net.inet.ip.dummynet.fqpie.beta : No 1250
3923The default
3924.Ar beta
3925parameter (scaled by 1000) for
3926.Cm fq_pie
3927scheduler/AQM.
3928The value must be in the range 1..7000.
3929.It Va net.inet.ip.dummynet.fqpie.flows : No 1024
3930Defines the default total number of flow queues (sub-queues) that
3931.Cm fq_pie
3932creates and manages.
3933The value must be in the range 1..65536.
3934.It Va net.inet.ip.dummynet.fqpie.limit : No 10240
3935The default hard size limit (in unit of packet) of all queues managed by an
3936instance of the
3937.Cm fq_pie
3938scheduler.
3939The value must be in the range 1..20480.
3940.It Va net.inet.ip.dummynet.fqpie.max_burst : No 150000
3941The default maximum period of microseconds that
3942.Cm fq_pie
3943scheduler/AQM does not drop/mark packets.
3944The value must be in the range 1..10000000.
3945.It Va net.inet.ip.dummynet.fqpie.max_ecnth : No 99
3946The default maximum ECN probability threshold (scaled by 1000) for
3947.Cm fq_pie
3948scheduler/AQM.
3949The value must be in the range 1..7000.
3950.It Va net.inet.ip.dummynet.fqpie.quantum : No 1514
3951The default quantum (credit) of the
3952.Cm fq_pie
3953in unit of byte.
3954The value must be in the range 1..9000.
3955.It Va net.inet.ip.dummynet.fqpie.target : No 15000
3956The default
3957.Cm target
3958delay of the
3959.Cm fq_pie
3960in unit of microsecond.
3961The value must be in the range 1..5000000.
3962.It Va net.inet.ip.dummynet.fqpie.tupdate : No 15000
3963The default
3964.Cm tupdate
3965of the
3966.Cm fq_pie
3967in unit of microsecond.
3968The value must be in the range 1..5000000.
3969.It Va net.inet.ip.dummynet.hash_size : No 64
3970Default size of the hash table used for dynamic pipes/queues.
3971This value is used when no
3972.Cm buckets
3973option is specified when configuring a pipe/queue.
3974.It Va net.inet.ip.dummynet.io_fast : No 0
3975If set to a non-zero value,
3976the
3977.Dq fast
3978mode of
3979.Nm dummynet
3980operation (see above) is enabled.
3981.It Va net.inet.ip.dummynet.io_pkt
3982Number of packets passed to
3983.Nm dummynet .
3984.It Va net.inet.ip.dummynet.io_pkt_drop
3985Number of packets dropped by
3986.Nm dummynet .
3987.It Va net.inet.ip.dummynet.io_pkt_fast
3988Number of packets bypassed by the
3989.Nm dummynet
3990scheduler.
3991.It Va net.inet.ip.dummynet.max_chain_len : No 16
3992Target value for the maximum number of pipes/queues in a hash bucket.
3993The product
3994.Cm max_chain_len*hash_size
3995is used to determine the threshold over which empty pipes/queues
3996will be expired even when
3997.Cm net.inet.ip.dummynet.expire=0 .
3998.It Va net.inet.ip.dummynet.red_lookup_depth : No 256
3999.It Va net.inet.ip.dummynet.red_avg_pkt_size : No 512
4000.It Va net.inet.ip.dummynet.red_max_pkt_size : No 1500
4001Parameters used in the computations of the drop probability
4002for the RED algorithm.
4003.It Va net.inet.ip.dummynet.pie.alpha : No 125
4004The default
4005.Ar alpha
4006parameter (scaled by 1000) for
4007.Cm pie
4008AQM.
4009The value must be in the range 1..7000.
4010.It Va net.inet.ip.dummynet.pie.beta : No 1250
4011The default
4012.Ar beta
4013parameter (scaled by 1000) for
4014.Cm pie
4015AQM.
4016The value must be in the range 1..7000.
4017.It Va net.inet.ip.dummynet.pie.max_burst : No 150000
4018The default maximum period of microseconds that
4019.Cm pie
4020AQM does not drop/mark packets.
4021The value must be in the range 1..10000000.
4022.It Va net.inet.ip.dummynet.pie.max_ecnth : No 99
4023The default maximum ECN probability threshold (scaled by 1000) for
4024.Cm pie
4025AQM.
4026The value must be in the range 1..7000.
4027.It Va net.inet.ip.dummynet.pie.target : No 15000
4028The default
4029.Cm target
4030delay of
4031.Cm pie
4032AQM in unit of microsecond.
4033The value must be in the range 1..5000000.
4034.It Va net.inet.ip.dummynet.pie.tupdate : No 15000
4035The default
4036.Cm tupdate
4037of
4038.Cm pie
4039AQM in unit of microsecond.
4040The value must be in the range 1..5000000.
4041.It Va net.inet.ip.dummynet.pipe_byte_limit : No 1048576
4042.It Va net.inet.ip.dummynet.pipe_slot_limit : No 100
4043The maximum queue size that can be specified in bytes or packets.
4044These limits prevent accidental exhaustion of resources such as mbufs.
4045If you raise these limits,
4046you should make sure the system is configured so that sufficient resources
4047are available.
4048.It Va net.inet.ip.fw.autoinc_step : No 100
4049Delta between rule numbers when auto-generating them.
4050The value must be in the range 1..1000.
4051.It Va net.inet.ip.fw.curr_dyn_buckets : Va net.inet.ip.fw.dyn_buckets
4052The current number of buckets in the hash table for dynamic rules
4053(readonly).
4054.It Va net.inet.ip.fw.debug : No 1
4055Controls debugging messages produced by
4056.Nm .
4057.It Va net.inet.ip.fw.default_rule : No 65535
4058The default rule number (read-only).
4059By the design of
4060.Nm , the default rule is the last one, so its number
4061can also serve as the highest number allowed for a rule.
4062.It Va net.inet.ip.fw.dyn_buckets : No 256
4063The number of buckets in the hash table for dynamic rules.
4064Must be a power of 2, up to 65536.
4065It only takes effect when all dynamic rules have expired, so you
4066are advised to use a
4067.Cm flush
4068command to make sure that the hash table is resized.
4069.It Va net.inet.ip.fw.dyn_count : No 3
4070Current number of dynamic rules
4071(read-only).
4072.It Va net.inet.ip.fw.dyn_keepalive : No 1
4073Enables generation of keepalive packets for
4074.Cm keep-state
4075rules on TCP sessions.
4076A keepalive is generated to both
4077sides of the connection every 5 seconds for the last 20
4078seconds of the lifetime of the rule.
4079.It Va net.inet.ip.fw.dyn_max : No 8192
4080Maximum number of dynamic rules.
4081When you hit this limit, no more dynamic rules can be
4082installed until old ones expire.
4083.It Va net.inet.ip.fw.dyn_ack_lifetime : No 300
4084.It Va net.inet.ip.fw.dyn_syn_lifetime : No 20
4085.It Va net.inet.ip.fw.dyn_fin_lifetime : No 1
4086.It Va net.inet.ip.fw.dyn_rst_lifetime : No 1
4087.It Va net.inet.ip.fw.dyn_udp_lifetime : No 5
4088.It Va net.inet.ip.fw.dyn_short_lifetime : No 30
4089These variables control the lifetime, in seconds, of dynamic
4090rules.
4091Upon the initial SYN exchange the lifetime is kept short,
4092then increased after both SYN have been seen, then decreased
4093again during the final FIN exchange or when a RST is received.
4094Both
4095.Em dyn_fin_lifetime
4096and
4097.Em dyn_rst_lifetime
4098must be strictly lower than 5 seconds, the period of
4099repetition of keepalives.
4100The firewall enforces that.
4101.It Va net.inet.ip.fw.dyn_keep_states : No 0
4102Keep dynamic states on rule/set deletion.
4103States are relinked to default rule (65535).
4104This can be handly for ruleset reload.
4105Turned off by default.
4106.It Va net.inet.ip.fw.enable : No 1
4107Enables the firewall.
4108Setting this variable to 0 lets you run your machine without
4109firewall even if compiled in.
4110.It Va net.inet6.ip6.fw.enable : No 1
4111provides the same functionality as above for the IPv6 case.
4112.It Va net.inet.ip.fw.one_pass : No 1
4113When set, the packet exiting from the
4114.Nm dummynet
4115pipe or from
4116.Xr ng_ipfw 4
4117node is not passed though the firewall again.
4118Otherwise, after an action, the packet is
4119reinjected into the firewall at the next rule.
4120.It Va net.inet.ip.fw.tables_max : No 128
4121Maximum number of tables.
4122.It Va net.inet.ip.fw.verbose : No 1
4123Enables verbose messages.
4124.It Va net.inet.ip.fw.verbose_limit : No 0
4125Limits the number of messages produced by a verbose firewall.
4126.It Va net.inet6.ip6.fw.deny_unknown_exthdrs : No 1
4127If enabled packets with unknown IPv6 Extension Headers will be denied.
4128.It Va net.link.ether.ipfw : No 0
4129Controls whether layer2 packets are passed to
4130.Nm .
4131Default is no.
4132.It Va net.link.bridge.ipfw : No 0
4133Controls whether bridged packets are passed to
4134.Nm .
4135Default is no.
4136.It Va net.inet.ip.fw.nat64_debug : No 0
4137Controls debugging messages produced by
4138.Nm ipfw_nat64
4139module.
4140.It Va net.inet.ip.fw.nat64_direct_output : No 0
4141Controls the output method used by
4142.Nm ipfw_nat64
4143module:
4144.Bl -tag -width indent
4145.It Cm 0
4146A packet is handled by
4147.Nm ipfw
4148twice.
4149First time an original packet is handled by
4150.Nm ipfw
4151and consumed by
4152.Nm ipfw_nat64
4153translator.
4154Then translated packet is queued via netisr to input processing again.
4155.It Cm 1
4156A packet is handled by
4157.Nm ipfw
4158only once, and after translation it will be pushed directly to outgoing
4159interface.
4160.El
4161.El
4162.Sh INTERNAL DIAGNOSTICS
4163There are some commands that may be useful to understand current state
4164of certain subsystems inside kernel module.
4165These commands provide debugging output which may change without notice.
4166.Pp
4167Currently the following commands are available as
4168.Cm internal
4169sub-options:
4170.Bl -tag -width indent
4171.It Cm iflist
4172Lists all interface which are currently tracked by
4173.Nm
4174with their in-kernel status.
4175.It Cm talist
4176List all table lookup algorithms currently available.
4177.El
4178.Sh EXAMPLES
4179There are far too many possible uses of
4180.Nm
4181so this Section will only give a small set of examples.
4182.Ss BASIC PACKET FILTERING
4183This command adds an entry which denies all tcp packets from
4184.Em cracker.evil.org
4185to the telnet port of
4186.Em wolf.tambov.su
4187from being forwarded by the host:
4188.Pp
4189.Dl "ipfw add deny tcp from cracker.evil.org to wolf.tambov.su telnet"
4190.Pp
4191This one disallows any connection from the entire cracker's
4192network to my host:
4193.Pp
4194.Dl "ipfw add deny ip from 123.45.67.0/24 to my.host.org"
4195.Pp
4196A first and efficient way to limit access (not using dynamic rules)
4197is the use of the following rules:
4198.Pp
4199.Dl "ipfw add allow tcp from any to any established"
4200.Dl "ipfw add allow tcp from net1 portlist1 to net2 portlist2 setup"
4201.Dl "ipfw add allow tcp from net3 portlist3 to net3 portlist3 setup"
4202.Dl "..."
4203.Dl "ipfw add deny tcp from any to any"
4204.Pp
4205The first rule will be a quick match for normal TCP packets,
4206but it will not match the initial SYN packet, which will be
4207matched by the
4208.Cm setup
4209rules only for selected source/destination pairs.
4210All other SYN packets will be rejected by the final
4211.Cm deny
4212rule.
4213.Pp
4214If you administer one or more subnets, you can take advantage
4215of the address sets and or-blocks and write extremely
4216compact rulesets which selectively enable services to blocks
4217of clients, as below:
4218.Pp
4219.Dl "goodguys=\*q{ 10.1.2.0/24{20,35,66,18} or 10.2.3.0/28{6,3,11} }\*q"
4220.Dl "badguys=\*q10.1.2.0/24{8,38,60}\*q"
4221.Dl ""
4222.Dl "ipfw add allow ip from ${goodguys} to any"
4223.Dl "ipfw add deny ip from ${badguys} to any"
4224.Dl "... normal policies ..."
4225.Pp
4226The
4227.Cm verrevpath
4228option could be used to do automated anti-spoofing by adding the
4229following to the top of a ruleset:
4230.Pp
4231.Dl "ipfw add deny ip from any to any not verrevpath in"
4232.Pp
4233This rule drops all incoming packets that appear to be coming to the
4234system on the wrong interface.
4235For example, a packet with a source
4236address belonging to a host on a protected internal network would be
4237dropped if it tried to enter the system from an external interface.
4238.Pp
4239The
4240.Cm antispoof
4241option could be used to do similar but more restricted anti-spoofing
4242by adding the following to the top of a ruleset:
4243.Pp
4244.Dl "ipfw add deny ip from any to any not antispoof in"
4245.Pp
4246This rule drops all incoming packets that appear to be coming from another
4247directly connected system but on the wrong interface.
4248For example, a packet with a source address of
4249.Li 192.168.0.0/24 ,
4250configured on
4251.Li fxp0 ,
4252but coming in on
4253.Li fxp1
4254would be dropped.
4255.Pp
4256The
4257.Cm setdscp
4258option could be used to (re)mark user traffic,
4259by adding the following to the appropriate place in ruleset:
4260.Pp
4261.Dl "ipfw add setdscp be ip from any to any dscp af11,af21"
4262.Ss SELECTIVE MIRRORING
4263If your network has network traffic analyzer
4264connected to your host directly via dedicated interface
4265or remotely via RSPAN vlan, you can selectively mirror
4266some Ethernet layer2 frames to the analyzer.
4267.Pp
4268First, make sure your firewall is already configured and runs.
4269Then, enable layer2 processing if not already enabled:
4270.Pp
4271.Dl "sysctl net.link.ether.ipfw=1"
4272.Pp
4273Next, load needed additional kernel modules:
4274.Pp
4275.Dl "kldload ng_ether ng_ipfw"
4276.Pp
4277Optionally, make system load these modules automatically
4278at startup:
4279.Pp
4280.Dl sysrc kld_list+="ng_ether ng_ipfw"
4281.Pp
4282Next, configure
4283.Xr ng_ipfw 4
4284kernel module to transmit mirrored copies of layer2 frames
4285out via vlan900 interface:
4286.Pp
4287.Dl "ngctl connect ipfw: vlan900: 1 lower"
4288.Pp
4289Think of "1" here as of "mirroring instance index" and vlan900 is its
4290destination.
4291You can have arbitrary number of instances.
4292Refer to
4293.Xr ng_ipfw 4
4294for details.
4295.Pp
4296At last, actually start mirroring of selected frames using "instance 1".
4297For frames incoming from em0 interface:
4298.Pp
4299.Dl "ipfw add ngtee 1 ip from any to 192.168.0.1 layer2 in recv em0"
4300.Pp
4301For frames outgoing to em0 interface:
4302.Pp
4303.Dl "ipfw add ngtee 1 ip from any to 192.168.0.1 layer2 out xmit em0"
4304.Pp
4305For both incoming and outgoing frames while flowing through em0:
4306.Pp
4307.Dl "ipfw add ngtee 1 ip from any to 192.168.0.1 layer2 via em0"
4308.Pp
4309Make sure you do not perform mirroring for already duplicated frames
4310or kernel may hang as there is no safety net.
4311.Ss DYNAMIC RULES
4312In order to protect a site from flood attacks involving fake
4313TCP packets, it is safer to use dynamic rules:
4314.Pp
4315.Dl "ipfw add check-state"
4316.Dl "ipfw add deny tcp from any to any established"
4317.Dl "ipfw add allow tcp from my-net to any setup keep-state"
4318.Pp
4319This will let the firewall install dynamic rules only for
4320those connection which start with a regular SYN packet coming
4321from the inside of our network.
4322Dynamic rules are checked when encountering the first
4323occurrence of a
4324.Cm check-state ,
4325.Cm keep-state
4326or
4327.Cm limit
4328rule.
4329A
4330.Cm check-state
4331rule should usually be placed near the beginning of the
4332ruleset to minimize the amount of work scanning the ruleset.
4333Your mileage may vary.
4334.Pp
4335For more complex scenarios with dynamic rules
4336.Cm record-state
4337and
4338.Cm defer-action
4339can be used to precisely control creation and checking of dynamic rules.
4340Example of usage of these options are provided in
4341.Sx NETWORK ADDRESS TRANSLATION (NAT)
4342Section.
4343.Pp
4344To limit the number of connections a user can open
4345you can use the following type of rules:
4346.Pp
4347.Dl "ipfw add allow tcp from my-net/24 to any setup limit src-addr 10"
4348.Dl "ipfw add allow tcp from any to me setup limit src-addr 4"
4349.Pp
4350The former (assuming it runs on a gateway) will allow each host
4351on a /24 network to open at most 10 TCP connections.
4352The latter can be placed on a server to make sure that a single
4353client does not use more than 4 simultaneous connections.
4354.Pp
4355.Em BEWARE :
4356stateful rules can be subject to denial-of-service attacks
4357by a SYN-flood which opens a huge number of dynamic rules.
4358The effects of such attacks can be partially limited by
4359acting on a set of
4360.Xr sysctl 8
4361variables which control the operation of the firewall.
4362.Pp
4363Here is a good usage of the
4364.Cm list
4365command to see accounting records and timestamp information:
4366.Pp
4367.Dl ipfw -at list
4368.Pp
4369or in short form without timestamps:
4370.Pp
4371.Dl ipfw -a list
4372.Pp
4373which is equivalent to:
4374.Pp
4375.Dl ipfw show
4376.Pp
4377Next rule diverts all incoming packets from 192.168.2.0/24
4378to divert port 5000:
4379.Pp
4380.Dl ipfw divert 5000 ip from 192.168.2.0/24 to any in
4381.Ss TRAFFIC SHAPING
4382The following rules show some of the applications of
4383.Nm
4384and
4385.Nm dummynet
4386for simulations and the like.
4387.Pp
4388This rule drops random incoming packets with a probability
4389of 5%:
4390.Pp
4391.Dl "ipfw add prob 0.05 deny ip from any to any in"
4392.Pp
4393A similar effect can be achieved making use of
4394.Nm dummynet
4395pipes:
4396.Pp
4397.Dl "dnctl add pipe 10 ip from any to any"
4398.Dl "dnctl pipe 10 config plr 0.05"
4399.Pp
4400We can use pipes to artificially limit bandwidth, e.g.\& on a
4401machine acting as a router, if we want to limit traffic from
4402local clients on 192.168.2.0/24 we do:
4403.Pp
4404.Dl "ipfw add pipe 1 ip from 192.168.2.0/24 to any out"
4405.Dl "dnctl pipe 1 config bw 300Kbit/s queue 50KBytes"
4406.Pp
4407note that we use the
4408.Cm out
4409modifier so that the rule is not used twice.
4410Remember in fact that
4411.Nm
4412rules are checked both on incoming and outgoing packets.
4413.Pp
4414Should we want to simulate a bidirectional link with bandwidth
4415limitations, the correct way is the following:
4416.Pp
4417.Dl "ipfw add pipe 1 ip from any to any out"
4418.Dl "ipfw add pipe 2 ip from any to any in"
4419.Dl "dnctl pipe 1 config bw 64Kbit/s queue 10Kbytes"
4420.Dl "dnctl pipe 2 config bw 64Kbit/s queue 10Kbytes"
4421.Pp
4422The above can be very useful, e.g.\& if you want to see how
4423your fancy Web page will look for a residential user who
4424is connected only through a slow link.
4425You should not use only one pipe for both directions, unless
4426you want to simulate a half-duplex medium (e.g.\& AppleTalk,
4427Ethernet, IRDA).
4428It is not necessary that both pipes have the same configuration,
4429so we can also simulate asymmetric links.
4430.Pp
4431Should we want to verify network performance with the RED queue
4432management algorithm:
4433.Pp
4434.Dl "ipfw add pipe 1 ip from any to any"
4435.Dl "dnctl pipe 1 config bw 500Kbit/s queue 100 red 0.002/30/80/0.1"
4436.Pp
4437Another typical application of the traffic shaper is to
4438introduce some delay in the communication.
4439This can significantly affect applications which do a lot of Remote
4440Procedure Calls, and where the round-trip-time of the
4441connection often becomes a limiting factor much more than
4442bandwidth:
4443.Pp
4444.Dl "ipfw add pipe 1 ip from any to any out"
4445.Dl "ipfw add pipe 2 ip from any to any in"
4446.Dl "dnctl pipe 1 config delay 250ms bw 1Mbit/s"
4447.Dl "dnctl pipe 2 config delay 250ms bw 1Mbit/s"
4448.Pp
4449Per-flow queueing can be useful for a variety of purposes.
4450A very simple one is counting traffic:
4451.Pp
4452.Dl "ipfw add pipe 1 tcp from any to any"
4453.Dl "ipfw add pipe 1 udp from any to any"
4454.Dl "ipfw add pipe 1 ip from any to any"
4455.Dl "dnctl pipe 1 config mask all"
4456.Pp
4457The above set of rules will create queues (and collect
4458statistics) for all traffic.
4459Because the pipes have no limitations, the only effect is
4460collecting statistics.
4461Note that we need 3 rules, not just the last one, because
4462when
4463.Nm
4464tries to match IP packets it will not consider ports, so we
4465would not see connections on separate ports as different
4466ones.
4467.Pp
4468A more sophisticated example is limiting the outbound traffic
4469on a net with per-host limits, rather than per-network limits:
4470.Pp
4471.Dl "ipfw add pipe 1 ip from 192.168.2.0/24 to any out"
4472.Dl "ipfw add pipe 2 ip from any to 192.168.2.0/24 in"
4473.Dl "dnctl pipe 1 config mask src-ip 0x000000ff bw 200Kbit/s queue 20Kbytes"
4474.Dl "dnctl pipe 2 config mask dst-ip 0x000000ff bw 200Kbit/s queue 20Kbytes"
4475.Ss LOOKUP TABLES
4476In the following example, we need to create several traffic bandwidth
4477classes and we need different hosts/networks to fall into different classes.
4478We create one pipe for each class and configure them accordingly.
4479Then we create a single table and fill it with IP subnets and addresses.
4480For each subnet/host we set the argument equal to the number of the pipe
4481that it should use.
4482Then we classify traffic using a single rule:
4483.Pp
4484.Dl "dnctl pipe 1 config bw 1000Kbyte/s"
4485.Dl "dnctl pipe 4 config bw 4000Kbyte/s"
4486.Dl "..."
4487.Dl "ipfw table T1 create type addr"
4488.Dl "ipfw table T1 add 192.168.2.0/24 1"
4489.Dl "ipfw table T1 add 192.168.0.0/27 4"
4490.Dl "ipfw table T1 add 192.168.0.2 1"
4491.Dl "..."
4492.Dl "ipfw add pipe tablearg ip from 'table(T1)' to any"
4493.Pp
4494Using the
4495.Cm fwd
4496action, the table entries may include hostnames and IP addresses.
4497.Pp
4498.Dl "ipfw table T2 create type addr valtype ipv4"
4499.Dl "ipfw table T2 add 192.168.2.0/24 10.23.2.1"
4500.Dl "ipfw table T2 add 192.168.0.0/27 router1.dmz"
4501.Dl "..."
4502.Dl "ipfw add 100 fwd tablearg ip from any to 'table(T2)'"
4503.Pp
4504In the following example per-interface firewall is created:
4505.Pp
4506.Dl "ipfw table IN create type iface valtype skipto,fib"
4507.Dl "ipfw table IN add vlan20 12000,12"
4508.Dl "ipfw table IN add vlan30 13000,13"
4509.Dl "ipfw table OUT create type iface valtype skipto"
4510.Dl "ipfw table OUT add vlan20 22000"
4511.Dl "ipfw table OUT add vlan30 23000"
4512.Dl ".."
4513.Dl "ipfw add 100 setfib tablearg ip from any to any recv 'table(IN)' in"
4514.Dl "ipfw add 200 skipto tablearg ip from any to any recv 'table(IN)' in"
4515.Dl "ipfw add 300 skipto tablearg ip from any to any xmit 'table(OUT)' out"
4516.Pp
4517The following example illustrate usage of flow tables:
4518.Pp
4519.Dl "ipfw table fl create type flow:src-ip,proto,dst-ip,dst-port"
4520.Dl "ipfw table fl add 2a02:6b8:77::88,tcp,2a02:6b8:77::99,80 11"
4521.Dl "ipfw table fl add 10.0.0.1,udp,10.0.0.2,53 12"
4522.Dl ".."
4523.Dl "ipfw add 100 allow ip from any to any flow 'table(fl,11)' recv ix0"
4524.Ss SETS OF RULES
4525To add a set of rules atomically, e.g.\& set 18:
4526.Pp
4527.Dl "ipfw set disable 18"
4528.Dl "ipfw add NN set 18 ...         # repeat as needed"
4529.Dl "ipfw set enable 18"
4530.Pp
4531To delete a set of rules atomically the command is simply:
4532.Pp
4533.Dl "ipfw delete set 18"
4534.Pp
4535To test a ruleset and disable it and regain control if something goes wrong:
4536.Pp
4537.Dl "ipfw set disable 18"
4538.Dl "ipfw add NN set 18 ...         # repeat as needed"
4539.Dl "ipfw set enable 18; echo done; sleep 30 && ipfw set disable 18"
4540.Pp
4541Here if everything goes well, you press control-C before the "sleep"
4542terminates, and your ruleset will be left active.
4543Otherwise, e.g.\& if
4544you cannot access your box, the ruleset will be disabled after
4545the sleep terminates thus restoring the previous situation.
4546.Pp
4547To show rules of the specific set:
4548.Pp
4549.Dl "ipfw set 18 show"
4550.Pp
4551To show rules of the disabled set:
4552.Pp
4553.Dl "ipfw -S set 18 show"
4554.Pp
4555To clear a specific rule counters of the specific set:
4556.Pp
4557.Dl "ipfw set 18 zero NN"
4558.Pp
4559To delete a specific rule of the specific set:
4560.Pp
4561.Dl "ipfw set 18 delete NN"
4562.Ss NAT, REDIRECT AND LSNAT
4563First redirect all the traffic to nat instance 123:
4564.Pp
4565.Dl "ipfw add nat 123 all from any to any"
4566.Pp
4567Then to configure nat instance 123 to alias all the outgoing traffic with ip
4568192.168.0.123, blocking all incoming connections, trying to keep
4569same ports on both sides, clearing aliasing table on address change
4570and keeping a log of traffic/link statistics:
4571.Pp
4572.Dl "ipfw nat 123 config ip 192.168.0.123 log deny_in reset same_ports"
4573.Pp
4574Or to change address of instance 123, aliasing table will be cleared (see
4575reset option):
4576.Pp
4577.Dl "ipfw nat 123 config ip 10.0.0.1"
4578.Pp
4579To see configuration of nat instance 123:
4580.Pp
4581.Dl "ipfw nat 123 show config"
4582.Pp
4583To show logs of all instances:
4584.Pp
4585.Dl "ipfw nat show log"
4586.Pp
4587To see configurations of all instances:
4588.Pp
4589.Dl "ipfw nat show config"
4590.Pp
4591Or a redirect rule with mixed modes could looks like:
4592.Bd -literal -offset 2n
4593ipfw nat 123 config redirect_addr 10.0.0.1 10.0.0.66
4594			 redirect_port tcp 192.168.0.1:80 500
4595			 redirect_proto udp 192.168.1.43 192.168.1.1
4596			 redirect_addr 192.168.0.10,192.168.0.11
4597			 	    10.0.0.100	# LSNAT
4598			 redirect_port tcp 192.168.0.1:80,192.168.0.10:22
4599			 	    500		# LSNAT
4600.Ed
4601.Pp
4602or it could be split in:
4603.Bd -literal -offset 2n
4604ipfw nat 1 config redirect_addr 10.0.0.1 10.0.0.66
4605ipfw nat 2 config redirect_port tcp 192.168.0.1:80 500
4606ipfw nat 3 config redirect_proto udp 192.168.1.43 192.168.1.1
4607ipfw nat 4 config redirect_addr 192.168.0.10,192.168.0.11,192.168.0.12
4608				         10.0.0.100
4609ipfw nat 5 config redirect_port tcp
4610			192.168.0.1:80,192.168.0.10:22,192.168.0.20:25 500
4611.Ed
4612.Pp
4613Sometimes you may want to mix NAT and dynamic rules.
4614It could be achieved with
4615.Cm record-state
4616and
4617.Cm defer-action
4618options.
4619Problem is, you need to create dynamic rule before NAT and check it
4620after NAT actions (or vice versa) to have consistent addresses and ports.
4621Rule with
4622.Cm keep-state
4623option will trigger activation of existing dynamic state, and action of such
4624rule will be performed as soon as rule is matched.
4625In case of NAT and
4626.Cm allow
4627rule packet need to be passed to NAT, not allowed as soon is possible.
4628.Pp
4629There is example of set of rules to achieve this.
4630Bear in mind that this is example only and it is not very useful by itself.
4631.Pp
4632On way out, after all checks place this rules:
4633.Pp
4634.Dl "ipfw add allow record-state defer-action"
4635.Dl "ipfw add nat 1"
4636.Pp
4637And on way in there should be something like this:
4638.Pp
4639.Dl "ipfw add nat 1"
4640.Dl "ipfw add check-state"
4641.Pp
4642Please note, that first rule on way out doesn't allow packet and doesn't
4643execute existing dynamic rules.
4644All it does, create new dynamic rule with
4645.Cm allow
4646action, if it is not created yet.
4647Later, this dynamic rule is used on way in by
4648.Cm check-state
4649rule.
4650.Ss CONFIGURING CODEL, PIE, FQ-CODEL and FQ-PIE AQM
4651.Cm codel
4652and
4653.Cm pie
4654AQM can be configured for
4655.Nm dummynet
4656.Cm pipe
4657or
4658.Cm queue .
4659.Pp
4660To configure a
4661.Cm pipe
4662with
4663.Cm codel
4664AQM using default configuration for traffic from 192.168.0.0/24 and 1Mbits/s
4665rate limit, we do:
4666.Pp
4667.Dl "dnctl pipe 1 config bw 1mbits/s codel"
4668.Dl "ipfw add 100 pipe 1 ip from 192.168.0.0/24 to any"
4669.Pp
4670To configure a
4671.Cm queue
4672with
4673.Cm codel
4674AQM using different configurations parameters for traffic from
4675192.168.0.0/24 and 1Mbits/s rate limit, we do:
4676.Pp
4677.Dl "dnctl pipe 1 config bw 1mbits/s"
4678.Dl "dnctl queue 1 config pipe 1 codel target 8ms interval 160ms ecn"
4679.Dl "ipfw add 100 queue 1 ip from 192.168.0.0/24 to any"
4680.Pp
4681To configure a
4682.Cm pipe
4683with
4684.Cm pie
4685AQM using default configuration for traffic from 192.168.0.0/24 and 1Mbits/s
4686rate limit, we do:
4687.Pp
4688.Dl "dnctl pipe 1 config bw 1mbits/s pie"
4689.Dl "ipfw add 100 pipe 1 ip from 192.168.0.0/24 to any"
4690.Pp
4691To configure a
4692.Cm queue
4693with
4694.Cm pie
4695AQM using different configuration parameters for traffic from
4696192.168.0.0/24 and 1Mbits/s rate limit, we do:
4697.Pp
4698.Dl "dnctl pipe 1 config bw 1mbits/s"
4699.Dl "dnctl queue 1 config pipe 1 pie target 20ms tupdate 30ms ecn"
4700.Dl "ipfw add 100 queue 1 ip from 192.168.0.0/24 to any"
4701.Pp
4702.Cm fq_codel
4703and
4704.Cm fq_pie
4705AQM can be configured for
4706.Nm dummynet
4707schedulers.
4708.Pp
4709To configure
4710.Cm fq_codel
4711scheduler using different configurations parameters for traffic from
4712192.168.0.0/24 and 1Mbits/s rate limit, we do:
4713.Pp
4714.Dl "dnctl pipe 1 config bw 1mbits/s"
4715.Dl "dnctl sched 1 config pipe 1 type fq_codel"
4716.Dl "dnctl queue 1 config sched 1"
4717.Dl "ipfw add 100 queue 1 ip from 192.168.0.0/24 to any"
4718.Pp
4719To change
4720.Cm fq_codel
4721default configuration for a
4722.Cm sched
4723such as disable ECN and change the
4724.Ar target
4725to 10ms, we do:
4726.Pp
4727.Dl "dnctl sched 1 config pipe 1 type fq_codel target 10ms noecn"
4728.Pp
4729Similar to
4730.Cm fq_codel ,
4731to configure
4732.Cm fq_pie
4733scheduler using different configurations parameters for traffic from
4734192.168.0.0/24 and 1Mbits/s rate limit, we do:
4735.Pp
4736.Dl "dnctl pipe 1 config bw 1mbits/s"
4737.Dl "dnctl sched 1 config pipe 1 type fq_pie"
4738.Dl "dnctl queue 1 config sched 1"
4739.Dl "ipfw add 100 queue 1 ip from 192.168.0.0/24 to any"
4740.Pp
4741The configurations of
4742.Cm fq_pie
4743.Cm sched
4744can be changed in a similar way as for
4745.Cm fq_codel
4746.Sh SEE ALSO
4747.Xr cpp 1 ,
4748.Xr m4 1 ,
4749.Xr altq 4 ,
4750.Xr divert 4 ,
4751.Xr dummynet 4 ,
4752.Xr if_bridge 4 ,
4753.Xr ip 4 ,
4754.Xr ipfirewall 4 ,
4755.Xr ng_ether 4 ,
4756.Xr ng_ipfw 4 ,
4757.Xr protocols 5 ,
4758.Xr services 5 ,
4759.Xr init 8 ,
4760.Xr kldload 8 ,
4761.Xr reboot 8 ,
4762.Xr sysctl 8 ,
4763.Xr syslogd 8 ,
4764.Xr sysrc 8
4765.Sh HISTORY
4766The
4767.Nm
4768utility first appeared in
4769.Fx 2.0 .
4770.Nm dummynet
4771was introduced in
4772.Fx 2.2.8 .
4773Stateful extensions were introduced in
4774.Fx 4.0 .
4775.Nm ipfw2
4776was introduced in Summer 2002.
4777.Sh AUTHORS
4778.An Ugen J. S. Antsilevich ,
4779.An Poul-Henning Kamp ,
4780.An Alex Nash ,
4781.An Archie Cobbs ,
4782.An Luigi Rizzo ,
4783.An Rasool Al-Saadi .
4784.Pp
4785.An -nosplit
4786API based upon code written by
4787.An Daniel Boulet
4788for BSDI.
4789.Pp
4790Dummynet has been introduced by Luigi Rizzo in 1997-1998.
4791.Pp
4792Some early work (1999-2000) on the
4793.Nm dummynet
4794traffic shaper supported by Akamba Corp.
4795.Pp
4796The ipfw core (ipfw2) has been completely redesigned and
4797reimplemented by Luigi Rizzo in summer 2002.
4798Further
4799actions and
4800options have been added by various developers over the years.
4801.Pp
4802.An -nosplit
4803In-kernel NAT support written by
4804.An Paolo Pisati Aq Mt piso@FreeBSD.org
4805as part of a Summer of Code 2005 project.
4806.Pp
4807SCTP
4808.Nm nat
4809support has been developed by
4810.An The Centre for Advanced Internet Architectures (CAIA) Aq http://www.caia.swin.edu.au .
4811The primary developers and maintainers are David Hayes and Jason But.
4812For further information visit:
4813.Aq http://www.caia.swin.edu.au/urp/SONATA
4814.Pp
4815Delay profiles have been developed by Alessandro Cerri and
4816Luigi Rizzo, supported by the
4817European Commission within Projects Onelab and Onelab2.
4818.Pp
4819CoDel, PIE, FQ-CoDel and FQ-PIE AQM for Dummynet have been implemented by
4820.An The Centre for Advanced Internet Architectures (CAIA)
4821in 2016, supported by The Comcast Innovation Fund.
4822The primary developer is
4823Rasool Al-Saadi.
4824.Sh BUGS
4825The syntax has grown over the years and sometimes it might be confusing.
4826Unfortunately, backward compatibility prevents cleaning up mistakes
4827made in the definition of the syntax.
4828.Pp
4829.Em !!! WARNING !!!
4830.Pp
4831Misconfiguring the firewall can put your computer in an unusable state,
4832possibly shutting down network services and requiring console access to
4833regain control of it.
4834.Pp
4835Incoming packet fragments diverted by
4836.Cm divert
4837are reassembled before delivery to the socket.
4838The action used on those packet is the one from the
4839rule which matches the first fragment of the packet.
4840.Pp
4841Packets diverted to userland, and then reinserted by a userland process
4842may lose various packet attributes.
4843The packet source interface name
4844will be preserved if it is shorter than 8 bytes and the userland process
4845saves and reuses the sockaddr_in
4846(as does
4847.Xr natd 8 ) ;
4848otherwise, it may be lost.
4849If a packet is reinserted in this manner, later rules may be incorrectly
4850applied, making the order of
4851.Cm divert
4852rules in the rule sequence very important.
4853.Pp
4854Dummynet drops all packets with IPv6 link-local addresses.
4855.Pp
4856Rules using
4857.Cm uid
4858or
4859.Cm gid
4860may not behave as expected.
4861In particular, incoming SYN packets may
4862have no uid or gid associated with them since they do not yet belong
4863to a TCP connection, and the uid/gid associated with a packet may not
4864be as expected if the associated process calls
4865.Xr setuid 2
4866or similar system calls.
4867.Pp
4868Rule syntax is subject to the command line environment and some patterns
4869may need to be escaped with the backslash character
4870or quoted appropriately.
4871.Pp
4872Due to the architecture of
4873.Xr libalias 3 ,
4874ipfw nat is not compatible with the TCP segmentation offloading (TSO).
4875Thus, to reliably nat your network traffic, please disable TSO
4876on your NICs using
4877.Xr ifconfig 8 .
4878.Pp
4879ICMP error messages are not implicitly matched by dynamic rules
4880for the respective conversations.
4881To avoid failures of network error detection and path MTU discovery,
4882ICMP error messages may need to be allowed explicitly through static
4883rules.
4884.Pp
4885Rules using
4886.Cm call
4887and
4888.Cm return
4889actions may lead to confusing behaviour if ruleset has mistakes,
4890and/or interaction with other subsystems (netgraph, dummynet, etc.) is used.
4891One possible case for this is packet leaving
4892.Nm
4893in subroutine on the input pass, while later on output encountering unpaired
4894.Cm return
4895first.
4896As the call stack is kept intact after input pass, packet will suddenly
4897return to the rule number used on input pass, not on output one.
4898Order of processing should be checked carefully to avoid such mistakes.
4899