Rusty's Bleeding Edge Page

Thursday December 30 1999

Linux one article hit LWN Daily thanks to the ubiquitous Bill Stearns (Mason author). Then it was picked up by under the misleading title of `Samba.org: LinuxOne Wastes Investors' Time, Money'': of course, samba.org simply hosts my site, and this is not an official Samba Team opinion...

Spend a day tracking down bugs in my skb-reservation patch mk II; turns out that one of the things I was chasing is a generic 2.3.35 bug. I'll look at that tomorrow (not much has changed in the network code, so probably most productive would be to see if it's in 2.3.34).

Alexey didn't like the last version, due to lack of generality. Hope he likes this one more. If not, I'll have to think hard.

Tuesday December 28 1999

Another fairly productive day: netfilter test suite all passes (with the exception of the helper module deletion case, where I haven't decided what correct behaviour should be). Sent patch to netdev.

Had dinner with Stephen Rothwell, Adrienne his wife and their kids Anthony and Jacqui; had a really great time. I guess the married guys in the office feel I could use some real cooking once in a while, and if this is the standard of hospitality I can expect, I should encourage it...

Monday December 26 1999

Noone asked my opinion, but here it is anyway: LinuxOne. Executive summary: Bruce Perens was too kind. I really don't want my parents friends to say ``I've heard of Linux: they took all that investors' money then disappeared.''

Busy running test suite; fixing bugs left and right, as expected. Made it through the packetfilter and conntrack part of the suite: just the NAT and backwards compatibility to go.

Today's hint: a spin_lock_bh() is not enough to stop timers from going off on SMP machines.

Sunday December 26 1999

NAT and compatibility code updating to match conntrack update. Found interface-going-down race in masquerading. Ended up providing new iteration-with-delete facility in the conntrack code, and removing conntrack->dead member.

The new code is definitely more refined. I'll be testing it and benching it against my good old real packet dumps (that's going to take some time, since they are in tcpdump format, and I want to write some decent playback tools).

Adding another 2GB to the system reminds me: I need to get on top of the backup situation RSN; the current two-big-disks approach isn't going to scale (I'm thinking a dedicated box with raid-5...).

Friday December 24 1999

Productive day. Spent this afternoon/evening reengineering the conntrack code to use the skb destructors. Found at least one race in the current traversal code, which took some serious reengineering to fix (but the code is much neater now).

With skb field reservation, I can have the ftp conntrack module actually store the offset and length of the address within the IP packet for use by the ftp NAT module, to avoid duplicated effort. That implies that it's a good idea.

Tomorrow (Christmas) I'm forcing myself to take the day off, but on boxing day I attack the NAT code, then it's down to testing...

Thursday December 23 1999

Set up netfilter-core mailing list at samba.org for Marc, myself and future core members. Basically it's for discussion of CVS work, and netfilter direction; non-core members may be given posting access on a case-by-case basis. There are plenty of good hackers who don't have time to be on the core team.

In looking through dev.c, I noticed that someone had misspelled my name at the top of the file, so I sent a quick patch off to Alan and Linus:

From: Rusty Russell <rusty@linuxcare.com.au>
To: torvalds@transmeta.com, alan@lxorguk.ukuu.org.uk
Subject: [PATCH] Trivial name typo.
Date: Thu, 23 Dec 1999 15:24:07 +1100

Just noticed this... 2.2 and 2.3.  This Russel disease must be stamped
out before it becomes widespread.

Rust.
--- linux-2.2/net/core/dev.c.~1~	Sun Dec  5 13:24:45 1999
+++ linux-2.2/net/core/dev.c	Thu Dec 23 15:20:21 1999
@@ -56,7 +56,7 @@
  *		Adam Sulmicki   :	Bug Fix : Network Device Unload
  *					A network device unload needs to purge
  *					the backlog queue.
- *	Paul Rusty Russel	:	SIOCSIFNAME
+ *	Paul Rusty Russell	:	SIOCSIFNAME
  */

 #include <asm/uaccess.h>

--
Hacking time.

Here's Linus' response, cc'd to Alan:

On Thu, 23 Dec 1999, Rusty Russell wrote:
>
> Just noticed this... 2.2 and 2.3.  This Russel disease must be stamped
> out before it becomes widespread.

There's a serious shortage of the etter "", and we're trying to seriousy
cut down our usage of the etter in order to improve conditions in the
worst affected areas.

ettes "" an "" ae aso affecte, and may une cetain cicumstances be
epace with the ette "x" which is in pentifu suppy.

Patch appie,

		inus

Tuesday December 21 1999

I'm moving over to pushing my traffic through `penicillin', my new SMP box; if it holds up for 24 hours I'll do a new release. All my boxes are punchlines of jokes: for those who don't know, the joke is `What do you give a man who has everything?'. Hey, they don't have to be good jokes.

Got 4 patches in the pipe for linux-kernel; they're piling up enough for me to actually create a `patches' mail folder so I don't drop any.

Monday December 20 1999

Chris Yeoh (LokiHack winner, old University friend) visited on the weekend; we hung out around Canberra and among other things watch Flying High (`Airplane!' for the Americans), since we discovered to our shock that Lisa hadn't seen it. Spent this afternoon watching the DVD of Dr Strangelove which friends gave me for (early) Christmas.

Jan Harkes finally caught a clean netfilter oops, which explains a number of problems people have been seeing.

Thursday December 16 1999

Locked out of my apartment; once I got back in, afraid to leave again that night. So much for hacking late last night; woke at 5:40 am this morning and was in around 7:00am today.

New netfilter release tonight, after I figured out why usually fragments weren't being forwarded. Wanted to test on the SMP box, but the damn thing has the hard drive on IDE3; I'll wait for a PC guru tomorrow to figure out how to deal with this and concentrate on tonight's release.

No Christmas for Rusty this year. I'll be taking the holidays themselves off (if you don't do that it's just simply too depressing), but the rest of the time will be playing catchup with netfilter, SMP and User Mode Linux.

Tuesday December 15 1999

Went home to Adelaide for a day and a half to have a virtual Christmas and box up the last of my stuff. Took some time to talk to people I should talk to more often. One lesson I've learnt in the last two days: my taste in friends is excellent, even if I don't always understand them. You know who you are: thanks.

Alexey (rapidly becoming my hero) corrected me on concurrency in 2.3.x: FYI here it is:

Subject: Re: Concurrency within netfilter hooks
Date: Tue, 14 Dec 1999 17:53:10 +0300 (MSK)

Hello!

> For 2.4, it won't happen, except for packets from userspace being
> interrupted by bottom halves and timers, 

Processes from userspace really overlap since 2.3.15.

> but this is changing: you can look into Alexey's crystal ball at

It is not necessary to look into magic crystals. 8)

- Hooks, executed in process context, i.e. all output, post-routing
  etc. must be multithreaded.

- Hooks (and all the code), usually executed from net_bh (input, forwarding)
  also must be multithreaded, but not softnet is reason for this.
  Netfilter itself creates concurrency in all the paths, which
  used to be executed in net_bh context, when it reinjects packets.

Essentially, softnet adds __nothing__ new to these rules, except for one thing:
concurency becomes common, rather than marginal phenomenon in all the paths.
Essentially, it is the main argument, why I do not jest when proposing
to add softnet before 2.4. All the complexity and all the bugs are already
in 2.3 and softnet only clarifies code and fixes bugs. 8)8)

Alexey

Sunday December 12 1999

Housewarming was last night, which was fun. Kinda quiet, but what do you expect from Canberra? Paulus was still recovering from his recent return from SF, but everyone else made it, including Hugh and Lucy's 2-week old baby girl Rachael, who was pretty well behaved. My lovely ex-flatmates turned up to make sure I really did have somewhere else to live, and there was no chance of me trying to move back in with them 8-).

Fragment problems still blowing up the test suite. Fragments suck rocks. People forwarding fragments through connection tracking are going to see really bad performance. We have to defragment, then the forward code refragments, then we refragment.

Saturday December 11 1999

Tridge called around this morning seeking a vote on the latest hire: looks good.

User Mode Linux package release scheduled for Monday the 20th; we should make it in time.

Friday December 10 1999

Found out something interesting about Tridge today. Expect to read it soon my upcoming best-seller `Negotiating the Andrew Tridgell Way'.

Getting some (justified) flack for netfilter bugs. Need a new release this weekend, since gargle has been rock solid for 4 days for me. After this release I'll start running all of Linuxcare Ozlabs through it.

Tuesday December 7 1999

A productive day; maybe Telstra redirected the phones to Pizza Hut by accident or something. Conned Stephen into doing a Debian version of the User Mode Linux root filesystem, produced some (bad) web pages for the UML project, and registered usermodelinux.org.

My 386 seems to be netfiltering perfectly. And fairly fast, now I suppressed logging. Go figure; those bug reports must have been a subversive Microsoft plant 8-).

Monday December 6 1999

I've set up my laptop behind my 386 (running netfilter and masquerading for me), so I'm going to be living in my own shit from now on. This should help get the final bugs out.

Most of today was spent greeting people and drinking coffee; slow day. I've noticed that Tridge is starting to get edgy not coding, and he's really central to the office; I think that getting used to all the wierdness of being involved in a pre-IPO company is starting to get to us all.

Sunday December 5 1999

I cannot reproduce the 2.2.14-pre10 problem, despite stress testing this box for hours. I don't know why it doesn't happen to me, but I've sent mail to Richard Nelson in the hope that he can offer more clues.

iptables-save is written, I need to write iptables-restore. Also on my TODO list is the branch for netfilter 1.0 (the NAT replacement), which requires the new skb reservation code, which I need to feed to Alexey...

Finally got around to watching the last 3 episodes of Babylon 5; I think they lost some impact given the months-long hiatus I had before finally getting around to it (I'm not a TV person), but I definitely have that melancholy `end of a good long book' feeling after 5 years. I do plan on watching the entire thing again sometime, maybe one season a week. One day.

Saturday December 4 1999

Alexey rejected my patch on taste grounds. OK; we do it the hard way. Still want my skbuff reservation patch in, but I'll send that next.

Alan pulled my ipfw patch out for 2.2.14-pre11, because it seemed a likely candidate for memory corruption, and the problem has gone away. Symptoms are kmalloc corruption, which looks unlikely with my patch, which only alters locking. There's one dodgy `I assume this is safe' thing I did, which I'll try reverting.

Separated out the logging line-count patch and sent it to him separately in the meantime.

Friday December 3 1999

Sent off patch to Alexey for a taste-test, see if he likes it; it solves many of my dealing-with-fragment issues.

Camera crews came and went yesterday to inverview Tridge and Dave Mandala. The interview showed tonight, and came across really well I thought.

Discovered that my sense of humor not always appreciated by people I work with. At least it's not boring (Sam J. Bushell once had a T-shirt I always admired which went something like: `Where I come from, my behaviour is considered orthodox'). I guess I'm a goof.

Wednesday December 1 1999

Not much work done today. Updated the netfilter scoreboard to correctly age scores by 10% per month, and drop people with scores < 1 from the board.

Revised Makefile tags patch again; `this time for sure'. We'll see if Linus digests the last one before deciding how to feed him this one.

Got keys for the new place, and moving in tonight; expect to spend the next few weeks shopping for odds and ends. Housewarming is on the 11th December (Paul Mackerras should be back by then).

Tuesday November 30 1999

More netfilter bugs. Got to find that damn crash; the race before is unlikely to be the reason IMHO, and releasing without all known bugs fixed is very very bad form. Also, an FTP SMP repeatable crash was reported by Christopher Faylor (note: still need an SMP test box, or enhance the user mode kernel).

Wrote the first cut of a User Mode Kernel HOWTO, which I hope Jeff polishes a little and we can then expand and release. Need to hack on module support too; if I'm really lucky, Marc Boucher will find my bug and make a netfilter release before I do (now I sorted out his samba.org CVS access).

Bugger; just found another fragment problem. I think I'll have to ask Alexey to move the

Signed the lease today on the apartment; move in tomorrow after work. The inimitable Miguel de Icaza sent a congratulation EMail is his inimitable effusive style (Subject: WOOOHOOO!) on my move to Linuxcare.

Monday November 29 1999

Waded through netfilter bugs today. Found some nice ones, unfortunately; good news is that they're fixable. The wierd one (mainly triggered running Netscape on the masquerading box, and widely reported) is still elusive. Reproducing this with some new tests is tomorrow's priority.

Did some work on User Mode Linux yesterday; incorporated in a new release. I'm trying to convice Jeff Dike to take it to the next level with a core team, regular releases with announcements, and get some real momentum up. This is an extremely important development, since otherwise you can't really debug a kernel without duplicate hardware of VMWare; and everyone knows what I think about making kernel development dependent on proprietary software.

Moving into apartment Wednesday; Real Estate agents suck. Booked travel back to Adelaide for a few days to gather my stuff.

For future reference: don't stay with friends for longer than three weeks at the outside, however wonderful they may be; they're great to live with, but for a moment consider that I may not be.

Sunday November 28 1999

OK. Wow, I have to keep this more up-to-date. That's it: at least 5 times a week from now on. Really.

This week I will do another netfilter release; preferably with the fragment and crash fixes; I found a race, thanks to discussions with Paul Mackerras (Linux PPC legend and all-round nice guy), and reworked some locking, but I don't think that's the problem.

Compressed read-only loopback also needs a release; there seems to be a great deal of interest in this code, so I'm brushing it up for inclusion in 2.4 as experimental.

I must say living with two attractive intelligent women who regard a bath towel as suitable morning attire has been a wonderful experience, but I'm finally moving into my new place on Tuesday; inner-city furnished two bedroom apartment. Two-bedroom so I can finally return some of the offers to crash at other hackers' places around the world (thanks guys!) without having to resort to the sofa.

Saturday November 20 1999

Officially signed with Linuxcare on Thursday 11th November. Linuxcare seems like a really good company, but committing to any company is something that was a big decision for me (having been working for myself for the last several years). The contract you sign is pretty scary; I finally decided that I would quit on the slightest hint of conflict with my Open Source committments; fuck the options. Made me feel much better.

The trip was good, but hell. I'll take a couple of days off, and just rest; when I get too little sleep for an extended period, I get oversensitive and generally useless. This weekend was filled with interviews, dinners and flying to Melbourne for an engagement party, so I didn't manage to get my recovery time.

There's a netfilter bug in masquerading local packets. Must look into it.

Tuesday November 9 1999

Sorry; long time no write. Things here are fun; caught up with some people I haven't seen in a while, which is always great.

Netfilter interest is picking up as more people realize that we're missing functionality, and that it's fairly easy to do it. I'm hoping that Daniel Stone will come up with a workable IRC module. Fingers crossed.

I've been distracted by this trip; and today was a particularly poor day towards the end. I'm disappointed not to be getting to Ottawa this trip; maybe next time.

Monday November 1 1999

Spent Friday afternoon (a very long day, crossing the date line and all), Saturday and Sunday in at LinuxCare, talking to people and kicking the tires. Finally met Rick Moen and his SO Saoirse Deirdre: I knew Rick by reputation only, and it turns out he's a great guy with excellent taste in restaurants and impeccible manners when being questioned by policemen.

Turns out that I can get a flight from here (SF) to Montreal for US$400; given the amount (and quality) of work which Marc Boucher has done on netfilter recently, I can't turn down the opportunity to meet him, even if it means another four flights (via Chicago).

In my copious free time, I'm trying to write a compressed block device for the next generation of the LinuxCare Bootable Business Card, because it looks like a fun hack. I'm also trying to talk to everyone here about what they're doing, and what's happening; it's heaps of fun.

Friday October 29 1999

Caught 6:50am flight from Canberra... It's now 11:15pm Canberra time, and we're landing in LAX in under an hour. Then the connection to SF, then (hopefully) sleep. Got some reading, and serious amount of freeciv in this flight.

Thursday October 28 1999

Flying off to SF to visit LinuxCare tomorrow morning, so naturally I did a netfilter release tonight. Good luck, guys 8-).

Tuesday October 26 1999

Didn't do a pre-release, talked to Tridge and now have an anonomous CVS instead (gave up on holding out for Bitkeeper). Unfortunately, being on samba.org, I can't give others write access, otherwise we'd be really close to getting a core team (first person to hit 100 on the scoreboard would have to be a strong contender).

Marc Boucher submitted a patch against CVS already; a nice fix for ftp (which the testsuite should have found, but didn't, because it's too simple).

I trialled Andrew Tridgell's separate-the-men-from-the-boys support questionaire today; took me 1 hour, I got 6/10, and I didn't do it properly. If anyone beats me, they can have my job.

Monday October 25 1999

Got my NAT changes to pass the test suite today: I might do a pre-release.

Sunday October 24 1999

Marc Boucher sent in an excellent patch: I think I prefer his fix (with a modification) over mine. This makes Marc a candidate for a core team, if I ever form one (Peter Benie would be the other strong candidate). It would mean setting up a CVS tree, which I can now do (thanks to LinuxCare's OzLabs).

Finally printed and read Alexey's documentation on the `ip' command last night, and was stunned; I was expecting to wade through an incredibly complex and obtuse document, but it's fantastic. Is there anything Alexey can't do? I must meet this guy (he got an invitation to submit to the Australian Linux Expo, but AFAIK he didn't respond).

Friday October 22 1999

Busy: looking for furnished 1br apartments in inner Canberra is not fun. But my copy of Schimmel's Unix Systems for Modern Architectures has arrived: I know what I'm doing this weekend (after looking at two grotesquely overpriced places).

Netfilter debugging continues; problems, but I will overcome.

Tuesday October 19 1999

I'm going to start updating my diary more regularly: every 1-3 days, since the others are doing it . Horrible picture huh?

Watching Paul Mackerras do the IBM RS/6000 port is cool: he's showing an admirable degree of persistence and it's paying off: the bootloader and serial port work, but Linux proper doesn't boot yet. Maybe tomorrow. You can tell he's done this kind of thing before.

Second version of the Linux Graphing Project is up; I'm getting this one printed out. gargle (my 386 test box) now boots again, and is on the network: I'm compiling a new kernel for it to run the netfilter testsuite. Double-NAT and ftp fixes done.

Spoke with Art Tyde on the phone: the great thing about Linux work has been the quality of people you meet, and talking with Art reminded me of that. Added him to my mental list of people to meet.

Sunday October 17 1999

What a complex week. Move to Canberra Wednesday, went straight from airport into the LinuxCare office to set up and start hacking (having taken time off earlier in the week to pack and say goodbye to people, I already felt I was behind). There was random setting up and touring of facilities, and dumping my stuff and setting up.

I think this will work out really well: I just need to make sure that it doesn't all go horribly wrong (I think it was an Apple employee who said `A people hire A people; B people hire B and C people', and that applies here).

I'm staying in Canberra with my old friend Lisa and her stunning flatmate Vanessa; I'm looking for my own place, and won't be here long; at least hopefully before Vanessa gets sick of my drooling and throws me down the stairs. I'm sure she'll miss me when I'm gone. Sure.

Thursday I went to Tridge's lecture on parallel external sorting, which was really interesting. That night went to an SGI future-directions talk, which was mainly Linux; it looks very good. My 386 should arrive tomorrow; if it does, I can finish dual-table NAT, run the testsuite and get netfilter 0.1.11 out the door.

Sunday October 10 1999

netdev list is down again, and Miguel's too busy to fix it. My patch is in limbo; I'll have to send it straight to Dave, Alexey and Andi, which I hate doing.

Errors on my HDD this evening: running badblocks over /home found some. This is a worry: I backed up (full) last night, and did an incremental immediately after the error. I don't want to lose this drive. I have been thrashing it alot to produce my images, but I expect my hardware to simply take it. Preparing for the Canberra move.

Thursday October 7 1999

netfilter is coming along nicely: got distracted doing a `finally-rid-of-fwmark' patch: see how Dave and the others like it.

BIG NEWS! I'm going to Canberra next week until March (IETF), to work in the same office as the LinuxCare guys (ie. ANU people: Tridgell, Mackerras, Rothwell). I've postponed the knee recon until after that. Tridge wants me to work for LinuxCare, which would be kinda fun, but I'm still with WatchGuard at the moment.

This week's distraction project is starting to bear some fruit as well, but it's gonna have to hold for a while.

Tuesday October 5 1999

0.1.10 is out, with a cleanup patch right behind it. The conntrack resizing, and multiport testsuites are done as well; quite a productive day. The big NAT rework is tomorrow, then doco and release. 0.1.10 released early because it needed permissions checks which used to be done by the netfilter framework, but soon won't be.

Hoping for a productive few days to get 0.1.11 out the door. Glad I'm not going to ALS, because I'm just starting to get on a roll, and travelling right now would fuck it up.

Sunday October 3 1999

Peter Benie is racking up points on the old scoreboard; it's nice to have patches I can basically just apply rather than have to scour through. Given the number of mods which have been coming in, I'd better release 0.1.10 soon; I also added loop detection today.

Documentation has fallen behind again, and needs updating. That's what I'll be working on for a while; other than that the 0.1.10 release is almost ready, just the NAT rewrite to go.

Wednesday September 29 1999

The scoreboard seems to be getting some attention; if it actually yields a decent core team, it'll be worth it. At least I update it more regularly than my diary 8-)

0.1.9 released, another couple of minor bugs reported, and I realized a fairly significant one (hint: don't insmod the ip_nat module for the first time in heavy traffic). More stuff on the scoreboard, and a major documentation update going on at the moment (I've been slack).

Monday September 27 1999

Debugging NAT now. Fixed local NFS and the gcc 2.7.2 compilation bugs. 0.1.9 should be a good release (unless you're running SMP, in which case, good luck!).

After bitching in my last entry about netfilter not turning in a Bazaar project, I got a empathic mail from Bill Stearns, who has similar issues with Mason. I thought about it for a bit, then decided to make a scoreboard of contributors; sure, it'll be a drain to process, but if it works in encouraging people to participate, it's a small price to pay.

My frustrations, however, must have been showing: I sent a mail to linux-kernel about the impenetrability of the networking code. Alexey took it with good humour, but as I started writing it I realized how trivial much of it is to fix: it's not bad code, it's just the naming of structure members and functions is such a mess. The skbuff functions are too widely-used to be repaired, but most of the non-exported functions could well be fixed without too much distruption.

It's important, which means I'd better do it.

Thursday September 23 1999

Tiring couple of days: banging my head against locking in Linux. Despite my hopes in writing extensive kernel documentation, there simply isn't enough talent and interest to make netfilter a Bazaar-style project. This means that if I don't figure this out, noone will, and that kind of sucks. At least if I weren't on the edge of the planet out here there might be other people who I could cross ideas with.

The left mouse button on my VIAO stopped working too; I can tap the pad to get the same effect, but can't use chord middle button (ie. no paste in xterms). I'm chasing down the paperwork to see if I need to return it to the US to get it repaired; either way, getting it fixed is going to cost serious time I simply don't have. Looks like I'll have to live with it.

Found bug (I mod_timer() then add_timer() for tcp, due to a reordering ARGH). Going to bed.

Tuesday September 21 1999

LinuxSA meeting tonight. Richard Sharpe talked about his development: quite interesting. I offered to give a `Idiot's Guide to Kernel hacking Unleashed in a Nutshell for Dummies in 24 hours!' talk; not that I feel that I'm the person to give a talk, but I want more feedback for my HOWTO

Locking coding finished in conntrack: for NAT I got lazy, but it'll get better next release (promise... well, maybe not). First test tommorrow. Sent Andi Kleen my Netwinder (it was just gathering dust here, and Andi said he wanted one); cost be about $35 shipping. Ordered Curt Schimmel's book from Amazon, since it's not available for 4-5 weeks locally, and I want to read it.

Monday September 20 1999

Yesterday I tried to get my body clock back to Australian time, and I'm paying for it today. Still, got the basic connection track locking rewritten: reference counts on the conntracks, and relying on single-user-context access and synchronize_bh() for the protocol and helper registration.

Normally, conntrack locking would be simple: the reference count starts at one, and every skb which is attached to it bumps the count. Destroying the skb would drop the count. If the connection times out, mark it as dead and drop the count by one. Whoever drops it to zero gets to free it. Great.

Except I don't get to track skb destruction, so it's harder. Basically, you take the skb, you do a read lock on the hash, find the connection, bump its reference and drop the read lock. Then you can play with the conntrack all you want (as long as you don't want to alter it; for that you'd need another lock; I'd better check that). When you release the skb, you drop the reference.

This has all kinds of ugly side effects, such as what happens when a connection track is deleted, the NAT looks for it? (Answer: don't do that; we always delete on a timer, ick).

Moreover, helpers and protocols have a different problem. You could use the same trick: when a new connection comes in, bump the protocol or helper reference count, and when it's destroyed drop it. Unfortunately, connections can last a very long time, and you don't want to have to wait for them all to expire before you can rmmod the helper.

Good locking is hard. Hard locking is bad.

Sunday September 19 1999

Yep, 0.1.8 released two days ago. Seems OK; am still stupidly blocking interrupts; rewriting locking now (but I could have simply changed the _irq locks to _bh locks and kept 99% of people happy, I realized after diving into it). Finished LM article (thy're getting harder). Bought `Artist's Guide to the GIMP', which so far isn't great (but then I'm not an artist).

Linux Kongress was good: met some new faces; highlight of the trip was meeting Andi Kleen. Now if I have a vodka with Alexey, I can die happy. Andi convinced me to expand my planned kernel locking HOWTO into a kernel hacking HOWTO: that's bounced around a little now, and is ready for first release.

It was at Linux Kongress that someone mentioned the credit to Rob Malda in the kernel: I thought Alexey had dropped that patch (I didn't look carefully though, obviously). It's amazing how many people actually read the kernel; I sent mail to Rob, and he indicated that I was not the first to tell him.

I'm not travelling again for a while: certainly not ALS. I lost far too much time, and even now my body clock is badly fucked up (don't want to lose more time trying to sync it). The Bazaar is also out.

Monday September 6 1999

Released 0.1.7 late last night: of course, it has a one-liner bug in it. Just tidying up before hopping on the plane: travelling to Germany is always a chore (Adelaide->Melbourne->Singapore->Frankfurt->Munich/Augsburg).

I know what's going to be in 0.1.8; should be finished by the time I return on Wednesday the 15th. Have to write my Linux Magazine article, and want to rewrite my netfilter talk for Linux Kongress; this is what planes and spare laptop batteries are for (thank god for my VIAO).

0.1.8 should have the static mapping stuff (which means rewriting the ipnatctl shared library infrastructure closer to the iptables one), stateful packet filtering, chain renaming, and more testsuites work.

Saturday September 4 1999

Furiously working on 0.1.7, to get it completed before I jump on the plane to Germany midday Monday (then I'm out of the loop for > 24 hours at best). Main bug on buglist is SMP; even on UP machines, an SMP kernel seems to behave oddly from the reports I get. I've delayed this as it means two entire kernel rebuilds (one to SMP and one back again), and I've been doing other stuff, but the complaints keep coming in so it's top of the list.

Argh: I wrote netfilter because of all the cool stuff I could do on top of it (especially userspace), but I'm still caught in the kernel, while cool stuff (like the easter egg in iptables) tempts me away...

Oh, and The Bazaar is actually happening; Steve Blood sent me a mail. That would make 6 conferences this year, which is about 3 too many. On the other hand, I like the idea of The Bazaar... so I'm delaying my decision.

ARGH: late news. I figured out the SMP problem! Fuck. CONFIG_SMP isn't enough to control SMP, you need __SMP__ which all the headers use. I hacked this in for some modules, but not globally. Fixed in the Makefile: must retransmit cleanup patch to Linus.

Thursday September 2 1999

It wasn't the power switch; looks like the power supply. Local nfs solution will have to wait until I get a new one (tomorrow morning).

Compiling 2.3.16 SMP now, to see if my testsuite still passes. 2.3.16 broke initfunc, so my 0.1.5 doesn't compile, forcing a release. The `local connect to masquerade' bug can be neatly solved by a small kernel patch I sent to netdev: Alexey may not like it, in which case I'll work around it at my end. With that patch, my entire test suite should pass, and the only bug left is the wierd NFS one I found.

Not many people using 2.3 kernels: haven't had the flood I expected. I think the fs corruption problems plus the fact that 2.3.16 doesn't compile on UP has saved me from a trial-by-fire, and gentled the user upramp. I did need the increase in users, however, to flush some bugs; testsuites can't do everything.

On the home front, my knee seems to be improving nicely, by the time I return from Linux Kongress in a week, I should be able to drive again. Then I go in for the reconstruction in mid-October (this is the second time I've torn the Anterior Cruciate Ligament in my left knee, and I'm sick of it), and I'll be off my feet for two-three weeks. Delaying Canberra move. 8-(.

Wednesdau September 1 1999

Fixing bugs for netfilter, only two to go (plus one I discovered). Unfortunately, after blowing half a day getting my 386 set up as a scratch machine, and getting another half day of really productive work, I discovered this morning that it won't turn on: looks like the rocker power switch is gone.

So I'm back to experimenting on my production box, slow and dangerous work. Tomorrow I'll head into town and see if I can get a replacement switch of some kind.

One interesting bug has got me slightly stuck: fixing it one way requires a kernel change, and fixing it another requires a semantic change. I've proposed the kernel change to netdev, but I don't think Alexey will like what I've done (he'll almost certainly say I should be altering the source of a packet after it's been routed). That leaves only two known bugs: the fact that defragmentation and local nfs traffic don't seem to mix (which I need my 386 box back to test) and the crashing when rmmod'ing on SMP kernels.

So much for a release tonight. Oh well.

Monday August 30 1999

Bug reports coming in for netfilter. Most are simple one-liners. Each one is getting a new test written in my test suite. Ignoring jet lag for the moment. Setting up test machine because I don't trust the 2.3 series: it probably still corrupts file systems and I don't need that.

Talked to David Bonn of WatchGuard while I was in Seattle: because of lucrative (and numerous) offers elsewhere, I will probably be leaving them. Now that netfilter is in the official kernel it seems a good time. They're more than happy to keep paying me, but I'd prefer to start moving into some other area (netfilter will occupy me for some time to come though).

Oh, and I moved my diary. Got to go fight more bugs...

Thursday August 26 1999

Yes! I'm in the official kernel! That marks the end of my holiday though 8-(. Just as well, since I tore the ligament in my left knee in Cody, Wyoming (I jumped of a playground swing, much to the amusement of Ace). Got 0.1.5 rushed out the door (some new tests in the testsuite don't pass, but that's probably the tests, rather than a flaw).

Tomorrow I fly back to Australia, and don't really get to rest: I have to do some kernel patches and process bug reports (I expect a reasonable number).

I'm going out tonight with Ace to celebrate. Chinese all round 8-).

Monday August 23 1999

Long time no write. Much has happened. The netfilter mirrors are up and running, however, they are out-of-date. Alexey Kuznetsov merged my netfilter stuff (with some locking changes which I have studied with some interest: I really do have to write the Kernel Locking HOWTO now). I made some fixes on top of that, and he should be pushing it to Linus any day now.

I've been nominally taking two weeks off (bad timing, but isn't it always?); after my tutorial at LinuxWorld, Ace and I have been touring the US: Disneyland before the conference, San Francisco afterwards, then Vegas, Denver, and a huge driving trip through Wyoming, ducking into Utah, and back to Wyoming/Montana for Yellowstone. In Cody, Wyoming, I threw my left knee out again, and currently am hobbling on crutches. It's probably reopened the partial tear in my Anterior Cruciate Ligament. I'm hoping to be back on my feet for Linux Kongress, but from previous experience I'll have a marked limp.

Anyway, my net access has been really dodgy: some of these hotels don't have direct long-distance dialling from the rooms (you need to use credit cards). My Sony Viao has been great for these long trips: each batter is worth about 4 hours of Freeciv.

At LinuxWorld Expo, I crashed Alexey's kernel (with my mods) with my test suite, and corrupted my /home really badly. Ted T'so was interested because fsck didn't fix it *sigh*. I've been nervous about furthur testing while I've only got one box here, and no decent net. A number of people have suggested VMWare, but I figure if you need a proprietary piece of software to develop Free software, we might as well make the whole thing proprietary.

Should be able to handle stairs by the time I reach Canberra, which is a requirement for getting into the space the guys rented. I should learn to be more careful.

Sunday August 1 1999

rustcorp.com is still down; has been unreachable for three days now. I've contacted a few people to ask for netfilter mirrors, and Andrew Tridgell has given me a netfilter mailing list on lists.samba.org.

Got a reply already from Jim Pick of kernelnotes.org, so it looks like http://netfilter.kernelnotes.org will be the first site. www.kernelnotes.org is my homepage, so this is really nice; I owe Jim a beer or three.

Andi Kleen gave my latest patches (up on my ISP's web space for want of a real site) the thumbs up, hence the need for a reliable site: one netfilter goes into the development kernels, it's going to need a set of reliable sites.

Wednesday July 28 1999

Nightmare day. First, someone found a whole in the ipchains kernel code; one I should never have missed. The changes I made at the request of either Alexey, Andi or Dave (can't recall exactly) before inclusion in the main kernel turned out to be fatal; they wanted me not to drop packets I didn't have to: my code then became too liberal. Noone noticed for a while, but it's going to be hard to hold my head up at LinuxWorld after this error.

I read the report this evening when someone sent it to ipchains (no, I still haven't been able to subscribe to bugtraq, even though I try every six months or so). My first reaction was to jump online and look at the bugtraq archives to see the response. That's when I found out that my ISP (Camtech, now OzEMail) had cancelled my account: calling the helpdesk revealed that it had expired on the 11th. Of course, I had renewed online after they sent me a letter; he told me to take it up with accounts, which is only open during office hours.

Tomorrow I get a new ISP; Camtech's service was OK, but their billing system was always completely fucked, and it funally bit me.

So I made my patch, and wrote some EMails, dropped them on a floppy and called Duncan Grove (Michael was out somewhere, got his answering machine). A couple of hours later, I was in the University updating my web page and dumping a couple of mails to the net using telnet port 25.

Netfilter work continues; I fixed the truncated-packets problem in both the new ip_tables code, and the backwards compatibility code. Moreover, it inspired me to take a detour and start hacking up my `ipt_unclean' module for iptables which matches on suspicious packets (eg. ping of death, wierd fragments, etc). The tests for the short packets case went into the testsuite, which is slowly increasing in scope.

Monday July 26 1999

Back from Canberra; I'm really looking forward to my trip now. Last night made a large amount of progress, and fixed a stupid netfilter typo. Today I'm working on state generation counting for 0.1.4: that and masquerading taking down established connections when the interface goes down. Should release tonight if I'm lucky.

Lost a lot of work when 2.3.11 screwed my disks over. Fortunately, I had backed up a couple of hours before, but any file I had touched during that time was deleted by fsck. Needless to say, I don't trust the 2.3 series as far as I can throw them, and 2.2 is bad at the moment: I'm building a 2.2.5 (last known kernel which didn't corrupt file systems).

This development is very disturbing to me. I'm used to trusting the Linux kernel implicitly, and fs corruption makes me feel like someone who was hit by an earthquake and never views the ground quite the same again.

It also makes me want to look deeper into the port of Linux to userspace. A kernel in a window is a nice idea, and probably worth investigating. And no, I won't use VMWare; if kernel debugging can't be done without a proprietary product, what's the point?

Friday July 23 1999

Tomorrow morning (6:05am!) Ace and I fly to Canberra to check it out. Ace isn't looking forward to it, since it heralds the extended visit to Canberra in a few months. I'm looking forward to it, since it means working with other hackers again.

One of the good things about writing a test suite is that you actually find some of the stupid mistakes. Too bad I kept working on the test suite after 0.1.3 was released. Hence 0.1.3.1 tonight, to fix the two most glaring errors.

Wednesday July 21 1999

Hi again. My old laptop hard drive just died (I wanted to pull off the old ipchains test suite for iptables, and backwards compatibility). It was the one which stopped running except on batteries.

Laptops suck.

Hence my insistance on a >2 year warrantee on the new one. Looks like I'll be picking up my Dad's old G3 powerbook, which still has 2 years left on the clock, and the price is right ($2000 US).

Working on running the test suite I wrote this afternoon. It's tough when it crashes your machine. Looking through kern.log, found part of the Debian package list at offset 241665. I'd just screwed my kernel over, so maybe it's just a glitch, but with Linus's warning about fs corruption, it scared the fuck out of me. Booted back to 2.2.10 for the moment to recompile and backup.

Thursday July 15 1999

Long time no write. Very little actual work has been going on over the last two weeks (I'll not be billing WatchGuard for them, in fact). I've been doing the Conference of Australian Linux Users. Everyone liked it alot; of course, they didn't have to organize it. Looks like it will happen again, but I will only be peripherally involved. Tomorrow I'll actually invoice Linux Australia for all the expenses, and redo the budget to reflect (at least, approximately) the final tallies. This should be useful for future conferences.

The conference has left me physically and emotionally exhausted. By the end it was all I could do in some cases to be civil; I didn't accompany the others who went to Sydney. I took a day off and read Cryptonomicon; I am doing netfilter debugging in slow mode as well. Looks like I'm going to Augsburg in September for Linux Kongress again.

The big news is that I am planning to move to Canberra in September to work with Tridgell, Paul Mackerras et. al. for 3-6 months. This will be supurb; I can finally set up my test network like I want, and test on other machines, etc.

At the conference Dave Miller explained the new locking strategy in the network code. Now I understand what netfilter needs to do; I'm going to need the sk reference counts implemented to fix things properly though (I can hack around it for the moment). Writing the Kernel Locking HOWTO is on my TODO list.

Much other hackery occurred: I now understand rsync, and think Tridge is a God (imagine, a useful PhD subject!). devfs is the Right Way (or very close approximation), although I want the naming thing formalized a little. I want to be a founding member of maddog's School of Microcomputing and Microbrewing.

Uncrashing netfilter code. Yummy.

Wednesday June 30 1999

Andi Kleen had some useful comments on my kernel patches; I worked on them tonight (it's 5am now). Also sent out the conference `here are your tutorial assignments & details' EMails. Better late than never.

There seem to be a number of developments with other people doing packet-mangling stuff; a 2.0.36 version of FreeBSD's direct sockets, a double-masquerading (effectively masq+portforward) patch, and some work on masquerading speedups. We need to get netfilter in soon, so people can use it as a base; of course I want Andi to be happy with it.

I've been increasingly dropping EMails on the floor; netfilter stuff and conference work take priority, and everything else a distant second. My usually well-formatted and verbose style has become more terse under the pressure. Those who know me realize I'm busy, but I feel sorry for people who send me ipchains questions; I have no choice but to respond, but my recent responses have been less-than-helpful in some cases. Fortunately, the mailing list seems self-sustaining at the moment (thank God for those guys).

I'm not expecting much sleep in the next couple of weeks. And as soon as the conference is over, I'm back to netfilter coding; God I'm looking forward to it. Tonight was almost all netfilter; last night was too (rewrote the Makefiles to be non-recursive based on a discussion with Andrew Tridgell long ago, and they are sweet). Tridge was trying to convince me to use VMWare for debugging, but I don't want to be reliant on proprietary tools again. I figure VMWare only have about 18 months before a free version comes out anyway; I can wait.

Monday June 28 1999

Phew. Who would have expected that a day which started with a meeting with my accountant would turn out to be productive. Finally (after much Coke, coffee and chocolate) have FTP NAT working. Preparing snapshot now (compiling new kernel).

This is the version I want to merge with DaveM if no problems appear in the next few days. I found a few bugs in my code, and did a couple of cleanups in the stuff I'm sending Dave. I know this is going to hit me at the same time the conference does, but I'll let Dave decide.

Sunday June 27 1999

Wasted a week on shit jobs. I am really pissed off; I don't think Ace had seen me really frustrated before. Gargle won't upgrade to the latest unstable Debian. Binaries compiled on my (unstable Debian) machine won't run on gargle; I'm guessing this is glibc 2.1 bullshit. Static binaries behave funny. Three Debian installs on gargle, and an upgrade to 16MB of RAM. My modem card will run with my ethernet card, but only at about 1.5k/second (unsolvable interrupt problem, AFAICT; why don't serial interrupts show up in /proc/interrupts?).

So I've made a full backup (at least that now works again!) and I'm having to run 2.3.8 on my production machine. I'm resolved to get a test network, but not here; this episode has taught me that I need the company of fellow hackers. I'm persuing a couple of different options at the moment, and as soon as the conference is over, that'll get my full attention.

My patience has run out.

Tuesday June 22 1999

Dammit. I wanted to test that damn code and release a snapshot today. But no luck. 2.3.6 doesn't boot on my machine, and I decided that with all the warnings about 2.3.7, I'd best use a scratch monkey. Hence Ace and I continued working on gargle. He's now a Debian machine, with a PPP connection to ketchup (my old, doesn't-run-on-batteries laptop), which has the modem card in it, and an ethernet connection to ketchup, my broken-screen laptop.

The ethernet card and modem card don't both work at the same time. I have only one monitor, which needs to be flipped between gargle and ketchup. Gargle has 4MB of RAM, and the serial line connecting it to kevin seems to create enough noise to blow TCP performance to the Internet to shreds. I need a serious fucking test network; unfortunately test networks aren't mobile, and I am. That's gonna have to change.

Just trying to get the package list so I can install netbase (and hence have remote loginn capability to gargle has taken me over two hours. Setting up machines is such a PITA.

Monday June 21 1999

I've been slack updating my diary, but it's been damn busy. I foolishly agreed to help out with a major local Linux problem which is having a Cyclades problem. I was pretty convinced I'd found it on Friday, but several hours there late this afternoon and this evening showed I was wrong: it still crashes even after my `fix'.

So now I'm going to have to go through the driver with a fine-tooth comb, seeking something that's not under a lock (the card simply stops responding after a while).

Conference is coming along nicely. Spent the weekend on my tutorial presentation with Michael; he's proofreading it today. Decided it was worth converting to LyX for the workbooks, since it looks so much nicer than printed HTML.

Ace is helping me set up my 386 box; it has only 4MB, so installing Debian or Redhat was out (we also tried Slackware). Smalllinux worked though; tomorrow I'll get her to put in the NE2000 network card, and start the slow climb of building it into a Debian box. She called the box `gargle', in keeping with my network theme of joke punchlines (`kevin', `hambush' and `ketchup' are the others).

Monday June 14 1999

It's a public holiday here today (Queen's Birthday). Another snapshot out tonight. This one finally has masquerading in the compatibility layer; I had to back down on my first approach, which slowed progress. My ideas was to create a file in the state/ and NAT/ subdirs which had all the routines required by the compatibility layer. I quickly realized that that was almost all of them, so I changed tack to have a separate file for the routines which weren't shared, and that was much nicer. Just got the masquerading to work (it doesn't do protocol-specific masquerading, but I'll probably just use standard NAT modules rather than building the infrastructure for using the old masq modules).

Booked my tickets for the August trip to LinuxWorld, with Ace. She and I had much fun using travelocity to find reasonable fares. Bought Ace a copy of `Learning the UNIX Operating System', an O'Reilly book; if she makes it through that I'll get her Linux in a Nutshell. She's playing on my old laptop 9the one that doesn't run on batteries anymore).

Bought her a Furby. Don't worry, I'll balance it all out by getting her a Palm Pilot later. Really.

Thursday June 10 1999

Sometimes if I don't write in my diary for a while it means I've simply been slack, and other times it means I've been really busy. This time, it's really busy. netfilter v 0.1.0 (aka "The Phantom Maintenance") was released yesterday, and actually announced on freshmeat. I've been working on better backwards compatibility: it's not as hard as I feared (REDIRECT was easy, MASQ means ripping out bits of my NAT later, but that doesn't seem too hard either). I was tempted to call this release "donkeyfucker" in resistance to the unenforcable new net censorship laws we have here in Australia, but in the end common sense ruled.

It sometimes seems to me that all the other kernel guys have mastery of a large number of areas of the kernel, and I have this tiny bit. I think I need to attack more areas, especially SMP and locking issues. Dave Miller shocked me by reworking the interaction with NFS and the page cache: it's really very little to do with his home ground of networking per se, but it's a major rework which appeared inside a week. I wish I could grab some area of the kernel and rework it in days, not months. I think I'm too used to working with people I'm much more experienced than; the Linux world is much more competitive, and I'd like to work closer with some gurus to hone my skills which plateaued here on the end of the earth.

So, after netfilter, I think I'm going to find a more collaborative Free Software project to tackle; I don't really care what it is, as long as there are tip people and the project is interesting. Rusty's Fucked-up Network Protocol might be the just ticket.

Monday June 6 1999

Fairly heavy weekend, and long day today, but the new NAT (built on the new state tracking module) now compiles. Once it passes basic testing, I'll upload a new snapshot for people to hack on and find bugs. People are going to complain about me taking away device matching, but it will return later (it's quite tricky in the new model, and I wanted to get this finished fast).

I'm looking forward to getting this finished, then it's ftp data mangling (shouldn't be too hard). Then on to writing the compatibility layer, then we'll be ready for the masses; I'll take a weekend off for a change, and then come back and code audit, probably some minor cleanups, and rewriting the HOWTO.

Wrote another column for Linux Magazine; this one is pretty cool. That means I don't have to do it in the middle of conference organization, so it's out of the way already. A friend of mine from Canberra called about tickets in August (LinuxWorld and a small tour of the US with my SO and misc. others) so I'm chasing that up as well. On the topic of conferences, they are trickling in for CALU in July; looks like we'll hit about 200 or so.

Tentative August itinerary:

Friday 6th - Sunday 8th: Disneyland, Anaheim, CA (again).
Sunday 8th (evening) - Friday 13th: LinuxWorld, San Jose CA.
Friday 13th (morning) - Sunday 15th: San Francisco CA.
Sunday 15th (evening) - Tuesday 17th: Las Vegas.
Tuesday 17th (evening) - Sunday 22nd: Denver CO... drive... (probably) Winnipeg Canada.
Sunday 22nd - Tuesday 24th: Train from Winnipeg to Vancouver
Wednesday 25th - Friday 27th: Seattle, visit WatchGuard.
Friday 27th - Sunday 29: Seattle back to Adelaide (lose a day).

Thursday June 3 1999

StarWars I opened here today. I saw in two weeks ago in NC (and dozed off in a couple of places; they gave me shit about that). I'll see it again this weekend with Ace; it's not so good that I can't wait.

Netscape was crashing for me on the appindex entry form, so I didn't get to enter the details for the ipchains 1.3.9 release. Before I got around to upgrading Netscape though, someone (someone I don't know) beat me to it; that's really cool!

Got state tracking to work, without crashing. Expect a snapshot tonight (with ftp tracking). I'm going to leave the addition of an iptables module for state tracking to someone else; it's not that hard, and it's a good project for someone.

At Linux Expo DaveM asked how Juanjo Ciarlante (who has been doing alot of 2.1 masq work) felt about me replacing the masq stuff with my NAT layer. I sent Juanjo a mail when I got back; and he's looking forward to hacking on netfilter, and likes my HOWTO!

You have to be careful not to trash someone's project; everyone knows masq needs a rewrite, but it can still be a wrenching feeling to have your code taken out. Of course, much of my code was inspired by the masquerading code and BSD's ipfilter anyway, so it lives on. I'm looking to Juanjo hacking on netfilter; he's got RL experience which will be a great contribution.

Monday May 31 1999

New netfilter snapshot this morning. It requires a patch if you don;t compile with CONFIG_NETFILTER_DEBUG (found by Michael Hasenstein, a person I really want to meet sometime):

--- linux-netfilter/net/core/netfilter.c.~6~	Sun May 30 12:22:22 1999
+++ linux-netfilter/net/core/netfilter.c	Mon May 31 21:21:26 1999
@@ -493,6 +493,7 @@
 		printk("Crap bits: 0x%04X", nf_debug);
 	printk("\n");
 }
+#endif /* CONFIG_NETFILTER_DEBUG */
 
 /* One semaphore for all of them. */
 DECLARE_MUTEX(modreg_sem);
@@ -615,4 +616,3 @@
 {
 	return modreg_find(headaddr, name, name_cmpfn);
 }
-#endif /* CONFIG_NETFILTER_DEBUG */

Been working on connection tracking. Cut less than a thousand lines today; should have been higher. Still, I'm quite happy with it. One advantage of separating connection tracking from NAT is that it's going to be fairly easy to test.

Posted my minmax.h patch to linux-kernel. Linus may be stubborn, but I really think it's for the best. I'll probably get some flames, and Linus'll probably just drop it on the floor. Oh well.

Need to sleep; this stuff is not going to be solved by a single allnighter, so it's best to keep hacking away at it. Did some conference stuff today too; it's going to be really cool (in retrospect).

Sunday May 30 1999

Next two weeks are going to be hell. Just compiling up the final cut of the netfilter patch to send off to DaveM. A new netfilter snapshot release; hopefully I've fixed the ICMP panic people reported. Then I've dedicated the next two weeks solid to three things:

The NAT rewrite: separating connection tracking from NAT, and implementing FTP tracking/NAT.
The compatibility layer: so people can use ipchains and ipfwadm.
The test suite for iptables, to ensure that it works as advertized.

The clock is ticking: once this goes into the devel kernel, I'll probably get a flood of problem reports, especially on SMP. No sleep expected.

Wednesday May 26 1999

It's the little things that get me. Like my laptop screen cracking. I went into maintenance-only mode for a while (ie. just doing EMail support); using my other laptop (dodgy screen, bad keyboard, doesn't run off battery) over PLIP and remote displaying everything. Ordered a 15in monitor (I have a keyboard, had to buy a serial mouse), and am only now back to normal: my laptop is now a desktop.

Andrew Tridgell called, and we talked about many things; congratulated him on his impending new job, discussed finance. Andrew is organizing a minibus from Canberra. Andrew should have been the one to organize the conference, except he's only just recently come out of his thesis-induced hole, and if he'd done it the conference would have been in Canberra. He did make me think about other ways of getting finance for the conference, in particular, advertizing space. I've been persuing these in parallel.

Worked on netfilter some more; two changes I have been resisting but became neccessary. Firstly, hooks now have a priority, so we can ensure that local NAT occurs before packet filtering, preserving their independence. Secondly, hooks can return NF_STOLEN, to indicate that it has taken control of the skbuff. This is required to efficiently support ipfilter's "fastroute" option, which queues the skb. I disagree with the ipfilter "all-in-one" approach, but it is a valid use, and I am not going to dictate it by designing limitations into the netfilter infrastructure.

Had a big win when Ronald Kuetemeier said he implemented SAMBA failover using netfilter's NAT layer in a couple of hours. He hit some panics though, so I'll be hunting those down today.

Saturday May 22 1999

OK. Wrapping up here at Linux Expo. I'm not going to give a blow-by-blow report, but I'll talk about the things that actually got done.

DaveM took all the locking code out of the network stack, and benched it (until, obviously, it crashed). Twice as fast (I'm guessing it was a big SMP machine). Hence, he is itching to do away with net_bh, and packet queues for each CPU. I'm going to merge into his tree before that, which means in the next couple of weeks.

He did tell me that my ipchains code and the TCP stack were the only parts that didn't need fixing for the new locking.

Talked to Alan Cox about locking in the firewall code; he says it doesn't really get any better than 1 read semaphore on traversal. Larry McVoy has said a few times that Irix got so many locks that the overhead of grabbing dozens of them made performance suck (and I'd be worried about deadlocks).

Wensong Zhang didn't make it from China; the US wouldn't give him a VISA due to the embassy bombing fiasco. Larry spent a while on the phone to the US embassy in Beijing; no luck. It sucks, because I wanted to talk about his stuff on top of netfilter.

Staying with Raster, we discussed a whole heap of stuff. Future developments in E, CPU usagage, etc. The Linux Magazine guys paid me for my articles. I got my copy of Open Sources off Chris DiBona, finally. Discussed the LSB with Dan Quinlan: he hopes for something serious by end of year. Met one of my most prominent users and related-project developers, Bill Stearns, and I walked through the problems of ipfwadm-to-ipchains conversion; he's the author of ipfwadm2ipchains. Some mods are already forthcoming.

Spoke briefly with Werner Almesburger after his excellent talk on Traffic Queuing under Linux; I'd read some of the code briefly, but didn't really have a good understanding of its flexibility, which I do now. He tole me that the u32 classifier is faster than ipchains; not a huge shock, but definitely something I'll be looking at carefully.

Speaking of Alexey's code, he admitted I was right recently (he suggested the ability to mark interfaces with a number; I said the ability to rename interfaces and use interface wildcards was better). Makes me think that my work isn't a complete waste of time; after Werner's talk, I've even more respect for Alexey's coding ability...

Finally met Paul Maccarras, PPP guy, and as well as telling him to come to CALU, told him about renaming interfaces under Linux 2.2; I'll send him that patch for pppd if I can find it...

Looks like there really is going to be a Linux Developer con happening. The idea has been expressed by many people, and it loos like Larry McV and Victor Y might actually get it off the ground. I am also looking at The Bazaar, in December; but if I do Ottowa as well, that's 6 conferences this year; three above my limit.

Tuesday May 18 1999

Sitting in the Admiral Lounge in LAX. It doesn't have a shower, which for me negates the point of having an airport lounge. I've been remiss in my diary entries for the last week.

Conference registrations are starting to trickle in; each one offers assurance that I'm not going to go bankrupt. This makes life easier for me.

I released v1.1.2 of ipchains-scripts, finally. The ipchains-1.3.9 awaits only the new Quick Reference Card (I just sent a reminder to Scott; he promised it this week). The new HOWTO only awaits ipchains 1.3.9.

I've been having a really interesting dialogue with my BSD counterpart, Darren Reed. Actually, I'm flattering myself, since his ipfilter does more than ipchains by a long shot, is older, more mature, and cross-platform. I've shared a few concerns and questions from skimming his source, and we've traded problem reports where there may be overlaps. It's been really instructive on a number of fronts, mainly for NAT implementation issues (see, Darren has real, live users, something I currently lack for netfilter).

This trip is going to be hell; in my three-and-a-half days I have to catch up with my old friend Chris Yeoh, who now lives in Denver, shmooze with the LinuxWorld organizers Natalie and Kathy (who run a real conference, and have been really good telling me how it's supposed to be done), talk to the PowerPC guys about that laptop for development, give Raster the six-pack of vintage beer for letting me stay with him, give hemos the nice bottle of wine I brought (free slashdot ads), and Larry McVoy the other nice bottle (Michael and I stayed with him in SF in March), catch up with Wensong Zhang to talk about the virtual-server project, catch up with the Linux Magazine editors to get my payment, maddog Hall to discuss conference airfares and US date format, and hopefully have time to see Alan Cox again and drag him to my netfilter WIP to get his criticism. Hence I'll be restrained in my drinking this trip. No, really.

Then, hopefully, I'll get a respite to do some actual coding before getting swamped by the Australian Conference. Argh.

Wednesday May 12 1999

Woohoo. Released another snapshot. My data mangling stuff doesn't work, but that's a userspace problem: the kernel is pumping packets through it fine.

Some people asked me about my routine. Well, I get up (around 11am), dial up my ISP, tell my laptop to upgrade to the latest Debian (I love apt), grab news and mail, then shut down the laptop and head into town (bus or walk, it's onlt 25 mins) for coffee. Over coffee I read my mail and reply to it. My batteries last for about 90 minutes, so sometimes I get to do some hacking in that time as well, but usually it is all spent on ipchains support.

Then I usually play something at the arcade (currently Gauntlet Legends, what a money pit), and head back home. If there's anything urgent, I connect again to let my mail out, otherwise I start hacking. The serious work doesn't usually start until around 7pm, after I've eaten and settle down for some serious hacking. Around 2/3am, I hit the sack and repeat.

Not very exciting, but it works for me.

Monday May 10 1999

Ick. I'm upset that last week was so slow, codingwise. Between Mother's Day and conference stuff last week, I haven't been earning my keep. And it's not going to get any better next week, with LinuxExpo.

At least I got the IPX packet filtering stuff off to Jay; not tested, but it compiles. I was planning on spending tonight on data mangling, but issues with iptables got in the way, and I ended up fixing some icky bugs with TCP.

Snapshot tomorrow: it's been too long, and now I've got some iptables fixes which need to go in; since Jerome and Herve are actively working on that stuff, we need to keep in sync.

Keeping an online diary is wierd. I get EMail, (and off-the-cuff comments: thanks Jerome; I'll have to introduce you to Meryki) from people about it. I'm tempted to take down the link from the front page. Still, it's my space to rant, and I hope noone takes it too seriously. If it keeps my ranting off linux-kernel, which can only be a good thing (Alex Buell, are you listening?).

Sunday May 9 1999

My first real cut of TCP data mangling compiles. Now I have to run it to watch it explode. Thank God it's in userspace, where that doesn't mean a reboot (unless I find netfilter bugs).

I ended up stripping SACK permission and window scale options from the initial SYN. I'm not going to rewrite SACK options, and I don't want to allocate huge buffers for giant windows, so this seemed the easiest path. Not exactly non-intrusive, but we're violating so many boundaries with this stuff anyway, that I don't think it matters.

My current implementation is a sledgehammer, and it will be *slow*. There are several worthy optimizations which I've avoided until I get it working, then I'll look at speed. My ISP is having issues at the moment, so I've been unable to grab mail. Hope nothing important has happened.

Wednesday May 5 1999

TCP data mangling is tough. There's a good reason why noone has done it well before. Replacing arbitrary patterns on the fly is painful; at least I'm not trying to make it efficient.

Let's step back a bit: why do we want to replace data inside a packet as it flies past?

FTP. It puts the address of where to connect the data backchannel to in the data stream. We have to find out what this is, and replace it. Due to the format, this may involve changing the length of the packet.

There's a hack in the current masquerading code, but it assumes that the command is in a single packet (it isn't always). I wanted to solve a more generic problem: replacing a pattern which is less than the size of one packet, with something else. This could give spectacular side effects: imagine your Linux router substituting "idiot" for "boss" on TCP streams going out from the research network, and "boss" for "idiot" on the way back.

Nobody ever suspects the router...

Anyway, turns out that this problem is hard. What if there is more that one replacement in the packet? What if it's a fragment? What if doing the replacement(s) causes a packet to exceed the MTU of the link? Or the MSS of the receiver? What about out-of-order packets, or partial matches?

So this project turns out to be bigger than I originally intended. For the FTP case, you can probably just drop all partially matching packets, and hope they'll be coalesced on retransmission. For the generic case, we have to get tricky... thank God this is all in userspace.

This is my current obsession, and I'll know I've succeeded when I can ftp large files full of matches through my Linux box, and get the same results as `sed', even with deliberately induced packet losses.

Tuesday May 4 1999

Wrote my new Linux Magazine column (TCP stuff), sent of invoices for conference sponsors an another one to WatchGuard. Usually I leave invoicing until I need money, but I have to be organized for the con.

I can't put it off any longer; Jay Schulist is probably out there hunting me now. IPX firewalling compiles (both kernel and userspace tools). Neither tested, nor neat, but I'm diffing up a 2.2.7 patch to appease him now.

Happy thoughts of packet data mangling are wandering through my brain. What if the new packet exceeds MSS? Or MTU? I think we have to chop to length. Should work.

Monday May 3 1999

Mainly conference work today, and some netfilter development correspondance. Created my first `web button' with the Gimp (also made a banner ad which is too large <SIGH>):

The Limit of Rusty's GIMP skills

Sunday May 2 1999

A slow day yesterday. Without my modem, I was offline until I get back home Sunday night. Saw a movie, hangover gone by by the time it finished. Watched `This is Spinal Tap', which Chris had on video, and I haven't seen for a couple of years.

Today, got a call from Darren and Petrina; two of my Canberran friends, who were in Sydney for the weekend. They knew I was there too, since they read it here. Cool. Had lunch; Amex still worked.

Just before I flew out, met up with Meryki, one of the girls from the pub-crawl group. Well, who am I kidding; the only girl from that group. I figured it couldn't be a bad thing to spend a couple of hours in the company of a tall, leggy blonde, and we had fun. Told Ace all the details when she picked me up from the airport, so I had to be well-behaved, and I was.

Dedicating some time to conference organization, but I'm hoping to get my packet data substitution code working in the next couple of days. netfilter is still moving, though, due to iptables patches streaming in from Jerome de Vivie and Herve Eychenne.

Friday Apr 30 1999

How come, however much money I'm earning, my finances are always on the verge of becoming a train-wreck? Maybe I should invoice after work done, rather than waiting until a crisis to send out my invoices... American Express do not appreciate cheques bouncing.

That was the least of what happened over the last couple of days. I flew to Sydney on Thursday, and stayed with Chris Saunderson, and old Adelaide friend who escaped to Sydney. Nice to talk to someone dealing with serious networks, and Chris is really cool.

I got my coffee fix at Bambini's on Liverpool street; the place I learnt to drink short blacks three years ago. Friday lunch I had a meeting about the Conference with Grahame Kelly, Jamie Honan and Terry Dawson. It was really great; everyone saw eye-to-eye and I'm working on a number of ideas which came out of that. Talking with Terry about LDP stuff afterwards was really informative: the next version of my HOWTO will now be in docbook form.

Friday night was the SLUG meeting, and (as a disinterested observer), I acted as returning officer for the voting in of the committee. A little unexpected. I was really there to promote the conference, which I did, and many brochures were snapped up.

Then Horms (Simon Horman, ZipWorld guy who I met at LinuxWorld March) invited me along to a pub crawl. I returned to Chris's apartment at 5:37am (he had to wander down and let me in, since I didn't have a key). I knew he was going to be downloading the Quake III demo, so I figured he might still be up: no such luck, as his Voodoo I wasn't up to the task, apparently.

Wednesday Apr 28 1999

I've started keeping a ChangeLog of my netfilter stuff. With no large rewrites anticipated, this should allow me to make release notes easily: important since I'm going to be releasing more frequent snapshots, and more people are hacking on netfilter now.

After finishing the netfilter HOWTO, and doing some minor netfilter_dev tweaks, I went out to dinner (Zest) this evening. I needed to, because it's the only place I know of which sells Coopers Vintage Ale, and I need some of that for bribery at my upcoming stay with Chris Saunderson (Sydney this weekend) and Raster (North Carolina, LinuxExpo); both are Coopers drinkers. Not cheap, but neither are hotel rooms...

Tomorrow, I'll be beginning to design and writing of my userspace content matching code. I'm not entirely sure how it's best approached; I'm going to need to think through some scenarios. FTP control channel mangling is particularly difficult.

BTW, taper sucks. It keeps core dumping. I think it's time for tar.

Tuesday Apr 27 1999

Releasing netfilter snapshots is getting easier; less breakage each time, and I'm not as worried about things exploding completely since I'm doing fewer fundamental changes now. Another snapshot tonight; waiting for PCMCIA stuff to compile for this new kernel so I can dial up and drop it off.

Documentation almost finished; then I can get back to the IPX firewalling I promised Jay, and some conference organization issues which need to be addressed this week. I want to have things well in hand before LinuxExpo, so I'm not stressing out on the plane.

Monday Apr 26 1999

Lazy weekend (long weekend here for ANZAC day). Upgraded the netfilter patch to 2.2.6, more work on the HOWTO; hope to have another release tonight. I went to the Adelaide Zoo yesterday and watched the fairy penguins being fed. Too bad I wasn't wearing my penguin T-shirt (I was wearing an older Linux T-shirt, and a waiter at the coffee shop I favor said to me `I figured you for a Linux user'. World domination, here we come...)

Thursday Apr 22 1999

Went to Richard Stallman's talk at Adelaide Uni today. I've heard bit of it before, but this was the first time I sat through the lot. Like Richard's writings, I found it lucid and appealing. I've always preferred Richard's approach to free software on ethical grounds to ESR's `free beer' approach.

Anyway, as I was writing documentation, I decided that I should rename all the references to `bind's to `rule's, and all `perconns' to `bindings'. A huge search and replace job, but it has the benifit of making the nomenclature match the draft NAT RFC, the netfilter HOWTO, and the user's perspective. It also happens to be more accurate.

Still writing documentation; the netfilter HOWTO. Now I'm on programmers' documentation. I have to finish by the end of the weekend, so I can release it and another snapshot. After the documentation is done, I expect more users, more bug reports, and maybe more patches and enhancements.

Wednesday Apr 21 1999

Given the number of times I've written an explanation of why I dropped `match by interface address' in ipchains (it was a feature in ipfwadm), I figured I'd write down my canonical answer here:

When I was implementing ipchains, I noticed that the kernel firewall code to match interfaces (the `-V'' option) was broken in 2.1. It had been broken by someone who didn't understand all the issues who adopted it for the new interface/alias code.

When someone breaks a feature, you have to look at how fragile that feature was in the first place; will it break again? When the breakage was undiscovered for so long, you have to ask how many people actually use it anyway, and is it vital to those people?

Combine this with the fact that I could drop a whole heap of code (in particular, notifiers for devices going up and down), avoid a loop in the critical path of the packet filter code, and generally make it simpler, I decided that was what I would do. The `-V' predated the `-W' option (match by interface name), so its existence made sense in the early days of ipfwadm, but now?

For a long time, no-one came up to me with a reason for wanting the `-V' option back, until an ISP system administrator came up with a convincing one. They assigned all their dialup PPP customers the same interface address, and used one set of rules for all of them. Thus I implemented what he really wanted: wildcard interface names (eg. `ppp+').

It was another ISP who came up with the second fair reason. They had pre-configured rules for each interface address for their static-IP dialup customers. The normal authentication mechanisms took care of assigning the address to the interface, and thus ensure the correct filtering rules were used. The interface name depended on which line they dialled in on, which varied.

Thus the existence of the SIOCSIFNAME in 2.2; you can actually alter the name of an interface. The idea was to add a pppd option to allow it to change the interface name to some name (depending on client), thus allowing filtering by interface name. It'd be pretty cool for an ISP to do an ifconfig and see a list of clients (eg bigcorp-ppp4 instead of `ppp4').

Tuesday Apr 20 1999

iptables is now mainly working; it needs a few thorough tests to fix it up some more. My backup issue has (for the moment) been resolved, and taper is fairly happy. I think I understand it better now; I found the error reports in the logs.

Tonight was the local Linux User Group meeting (LinuxSA) where I got to give out the glossy brochures for the first time. A number of people pointed out that we need a mail address for Linux Australia, where we can tell corporate types to send cheques.

Richard Stallman turned up at the meeting. Richard was focussed, as always, on freedom in software. I think he made a number of people think. I like Richard's mind, too bad his body doesn't bathe more often. Ace cringed when he picked his nose and ate it, but hey, I'm more easy-going than her (but then, I didn't see it, and wasn't eating pizza across from him at the time).

RMS is giving his usual talk on Thursday night. I'll probably go and drag Ace along; he's usually entertaining for the first hour.

Work on the netfilter HOWTO continues. Slowly.

Friday Apr 16 1999

iptables no longer crashes on insmod, nor takes down the machine. I can list rules, and insert new ones again. The userspace code is back to a working state; I just need to write the module to tell it how to handle standard rules again, then test the new extended modules (both kernel and userspace). Then I really need to get back to doco...

Some fun with backups: taper told me there were "5 errors" after it completed my backup. Didn't say what they were, and a Verify gave 7 errors and 5 warnings. After it segfaulted the first time, I'm not really ready to trust it that much.

Still, I'm giving it a go for a week, to see how well it does. At the end of that I'll try a full restore, and to a compare. It'd be nice if my modem and SCSI PCMCIA cards worked at the same time though (a bit much to ask, since my modem card is really dodgy, and doesn't even work with my ethernet card).

A PowerPC user with compilation problems sent me a mail: not much I can do about it. I sent Cort Dougan a plea; if they want PowerPC supported, they need to get a machine to me (ideally a G3 laptop).

While I was writing this, taper segfaulted again. Great. Maybe (just maybe) it's running out of RAM: I'll try upping the swap.

Thursday Apr 15 1999

Not much progress today, unfortunately. I'm busy debugging the new iptables code (it's currently crashing on insmod). That decided me that I finally had to get serious about backups.

Previously, I just made big tarballs on my laptop (and more recently, my 2GB Jaz drive), but (being manual) it's prone to error, and doesn't cover much of my home dir (only the devel stuff, not my mail). So I installed taper a while back in the hope of getting to use it.

The documentation is a little long (I can sympathize with those who look at the ipchains HOWTO and say `You want me to read THAT?'), but after a bit of fumbling, I've figured it out. It's really quite cute. I guess like every coder I've tried my hand at homebrew backups (and had them replaces by others' homebrew backups), and I've used Solstice Backup (IIRC it's rebadged Legato Networker), which is a VERY nice backup utility. Any backup utility which gracefully recovers from kill-9'ing the various processes, and also has cool features like allowing the user to do their own restores (I would have liked a simpler front-end for the lusers though) is ubercool.

So, I can't do devel while I'm backing up, hence the blurb here. Early morning tomorrow (11am), hence the early night tonight.

Wednesday Apr 14 1999

End of another late night, but I promised myself I'd write here more regularly.

iptables made it down to text size ~3800 before creeping back up to 4004 bytes once all the FIXMEs were resolved. I'm not too unhappy with that; if modules discarded their initdata, it'd be even better, but I think that's planned for 2.3.

Work progresses on the iptables userspace tool (it's currently in pieces); separating out the various protocol handling in userspace as it is now done in the kernel. Gives me a chance to review some old cruft, and fully support some options (like arbitrary TCP flags detection, and TCP option detection).

My worry now is that testing this stuff is going to be so hard. It's going to have to be an exhaustive test, and those things take time to write (and test the testsuite). Meanwhile, this coding isn't writing my HOWTO any faster (really, after this, I'll get back to it. I promise).

I want to get the HOWTO finished well before Linux Expo next month, and at least two more snapshots under my belt. We'll see.

Tuesday Apr 13 1999

I'm supposed to be writing the netfilter HOWTO. I got up to the section on "iptables", and then decided to do that put-off iptables rewrite to make it more flexible.

The good news is that iptables has shrunk again; I've almost got it under 1 x86 page (although actually insmod'ing it into the kernel seems to add some weight). Under one page is the holy grail, but I'll settle for "smaller than ipfwadm", as long as its also faster then ipfwadm (which, per-rule, is marginally faster than ipchains).

For the curious:

bash-2.02$ ls -l ip_tables.o
-rw-rw-r--   1 rusty    fwdev        8840 Apr 15 07:40 ip_tables.o

bash-2.02$ size ip_tables.o
   text	   data	    bss	    dec	    hex	filename
   4397	    620	      0	   5017	   1399	ip_tables.o

bash-2.02$ lsmod
Module                  Size  Used by
ip_tables               5880   0  (unused)

The ipchains mailing list seems broken for the moment: hope it's back by the time I return, because I spent (wasted) some time replying to a "what i he latest version of ipchains and where can I get it" question.

Back to putting iptables back together again tomorrow, then onward with the documentation. Hell, I might go all out and produce a web page.

Monday Apr 12 1999

Got the latest release out on Friday morning, and took a weekend off (ie. only support work, and conference organisation, no new coding). Now I'm doing the Netfilter HOWTO.

Spent much of today reworking my March LinuxWorld conference tutorial based on the feedback summary LinuxWorld sent us. I felt that the tutorial response was disappointing, mainly because techniques which work with 40 people (as attended our practice run at LinuxSA) don't scale to 200 people. And August is presumably going to be even bigger, so I'm taking the knife to the tutorial.

SuSE's Michael Hasenstein has been banging on my netfilter releases, with little joy it seems. I'm trying to get everything to work for him: he seems to be a man of much patience, given the amount of bug reports he has sent me already, and that's exactly the kind of person you want to help you in the early stages.

Really, it works for me!(TM)

Wednesday Apr 7 1999

Some things never change. At the end of an all-nighter; netfilter is almost compiling again.

Hard coding is like combing long hair. You can't just run the comb through once; you have to do it repeatedly. While there are some knots which benifit from repeated short strokes, generally it's best to comb through the entire thing before repeating. And NAT is very hairy; I didn't even understand the problem when I started (fortunately, I knew it).

The week before last, I rewrote a major part of netfilter (basically the binding management). I've just rewritten the other part (the connection management), because by the time I finished the last cut, I realized that my nice, neat model was stretched out of shape by the addition of local packet handing.

At least I've been having fun along the way, and learnt a whole heap about NAT and networking in general. It'll be fun to compare notes with other implementors of this stuff; there are heaps of fun issues.

Well, it all compiles; userspace and all. 5:40am; I'm not even going to try to see what it will do to when I insmod it and pump a packet through it. Debugging is tomorrow's task (I hate crashing my machine). I'll back up before going to bed.

Monday Apr 5 1999

Some time off for Easter, but not much: I worked Saturday, and am working now. Saturday night was really productive: I was almost tempted to make a new release, given the number of bugs I had fixed, (it feels strangely good to stare ar hex dumps of packets and figure that the checksum is wrong) but I'd set the mental milestone at getting local NAT working, so decided against it.

I was really happy to discover that Wensong Zhang (the virtual server project head, based in China ) is going to be at Linux Expo, so I'll finally get to meet him. I'll just be flying there and back, and staying at Raster's if all goes according to plan. Must remember to buy beer to bribe him with...

Netfilter feels more mature now. This is the last major feature-add release, from here on it should be doco and bugfixing.

Thursday Apr 1 1999

Happy April Fools; unlike Linus, I'm not moving to Russia.

I did realize, today, that port allocation is less symmetrical than I thought. Consider these cases:

I want all packets to X sent to Y instead.
I want all packets from Y to look like they came from X.

In the first case, you can change the IP address, but you can't change the TCP (destination) port. In the second case, you can change the TCP (source) port. But in the case of ICMP ids, which are only meaningful to the sender, you can always mangle them.

There goes my pretty model. Oh well, it was months old anyway.

So I figure this asymmetry is a detail to be handled at the per-protocol level; I'll tell them what direction this was initiated in, and let them sort it out.

The other problem (discovered just before release, and hacked) is that there are cases where only per-protocol mapping is to be done, not IP mapping. I hacked in a special case (if the IP specification is the full range, it means "don't change it"), but it's not very neat.

Wednesday Mar 31 1999

Latish night last night debugging the new NAT stuff; looks like it crashes on the first TCP reply packet. Some memory overwriting bug; chasing phantoms until I gave up for the night. Didn't sleep well; never do when I go to bed with something unfinished.

Recompiled with serial-console; today I'll hook up the machines together so I can see the messages when it crashes, and maybe I can find a clue. It's almost certainly some stupid mistake; I found a couple already.

After my LinuxWorld talk, more people are asking about the netfilter stuff, so I need to get a new snapshot out ASAP. I hate network debugging; it's a huge hassle (three machines, two connected with ethernet, two with a parallel cable, one machine headless). The kernel debugging interface is also damn primitive for those of us used to source-level debuggers. Still, I don't get paid for my looks...

Still, I did manage one success last night; I set up and somewhat configured Enlightenment 0.15.4 for my SO. I think Ace will like it.

PS. Woohoo! It was a *&%!ing debugging printk trying to deref a NULL pointer (thanks, serial console!). Preparing the release now: looks good!

Tuesday Mar 30 1999

Waiting for the "so-experimental-it's-not-even-in-unstable" Debian packages of Enlightenment to download. I feel guilty for not trying it earlier, but it seems like Raster has more than enough bug reports without me.

Introduced my SO to Spellcast last night, after she proof-read my Linux Magazine column. One day in my copious free time, I'll have to do a Gnome-Spellcast rewrite. Don't busy-wait on that one...

At least I'm not alone in writing a LM column; Alan Cox writes one as well. Hope he gets paid more than I do. I ran out of inspiration so this one is just a list of various IP stack bugs (mainly fragment problems). I know Alan would write this column way better than me, but I can simply say I ran out of space when someone points out that I missed a major one. Next week, TCP bugs.

Well, it's back to crashing my box with netfilter. Never a dull moment.

Thursday Mar 25 1999

Another cool EMail to add to my collection. It's always rewarding to receive tidbits like this:

Hello Paul:

I just wanted to write and thank you for the tremendous job you did
writing the ipchains howto. I've been working with Linux on my home
network for about a year, and network security is an area I've long been
interested in. Until now, I haven't made the time to learn about it.
Your howto has provided me with a lot of useful general information, as
well as the inspiration to dig deeper. You obviously put a lot of work
into it!

Also, I'd like to send my thanks to you and all of the others who worked
to make ipchains available. I use it on my home firewall to provide
acces to a mixture of Win95/NT/CE, MacOS, OpenVMS, and Linux machines.
Please forward my gratitude to other contributors.

Best regards,

Earl Morren
River Falls, Wisconsin, USA

You can see my previous response under ``Wednesday February 16 1999''. Anyone who attended my `Future Plan for Linux Packet Filtering' talk at LinuxWorld will know that my greatest contribution to Linux Packet filtering was the HOWTO (the changes to the packet filter code were evolutionary, and insufficient). While netfilter will change all this (and ipchains was a neccessary stepping stone for me to get the experience and user feedback required for the netfilter infrastructure), I still regard the HOWTO as my greatest Linux achievement.

There are now four people doing translations of my HOWTO into other languages, and I consider this to be a huge compliment.

Meanwhile, I've reworked NAT (again). I think the new stackable framework for NAT binding is more efficient and generally nicer. You will now be able to specify rules like "masquerade everything out ppp0", and have TCP, UDP, ICMP and other handled, with full per-protocol support (without having to insert specific rules). In addition, if someone were to write a TCP load-sharing module, you'd be able to say things like "redirect,tcp-loadshare", which would first redirect packets to a local port range, then loadshare them between that. Each stage is responsible for calling the next stage, so you get to pre- and post-filter their actions.

Each protocol provides a simple default binding, which does the actual allocation of the new connection, based on its "range" parameter. Other things (such as load-sharing, redirect, masquerading) can hand it a different range parameter. It's sweet.

Per-protocol handling is done very similar to the old code: both tcp and udp allow you to register "per-protocol" handlers, in which you specify alternate timeout and callbacks for a given destination port. This works whether the connection is being NAT'ed or RNAT'ed.

The new infrastructure is neccessary because the old way of having the user specify what bind function to call was getting messy. Firstly, it meant a rule for each protocol, and secondly, "null" bindings didn't know which bind function to call, and didn't get the benifit of per-protocol handling (mainly timeout differences).

Now I have my test network up (two laptops, kevin and ketchup, connected by a PLIP cable, and a network connection from ketchip to hambush, my Netwinder), it should speed development and testing of NAT.

Local NAT, now I know how to do it properly, is on hold in order to speed up release. Once through-NAT is stable, I'll do a release then work on local NAT again.

Tuesday March 23 1999

Spent the day working on an update of a program sent to ipchains-dev by Didier Dhaenens. It was designed to reorder a chain's rules into an optimal order, but it was overly simplistic.

To do it correctly, you need to create a Directional Acyclic Graph of the rule dependencies, then sort them without putting any rule before another rule it depends on. Rule B "depends on" Rule A if there is a packet which could match both, and the verdicts are different (here we don't care about counters). Figuring out the intersection of two rules (consider the case of interface name comparisons with possible wildcards and inverses involved).

My brain hurt trying to remember DAG stuff from my undergrad days, when I realized that it was far easier to assign a score to each rule, and sort them into descending order. Each rule has a score which is the number of packet matches it has, plus the scores of each of its dependents, plus the number of dependents. This means that if B depends on A, then A will always have a score > B. Since we have a valid order already (the original rules), it's trivial to traverse this backwards to calculate the scores, then sort into score order.

Netfilter tomorrow, I promise...

Saturday March 20 1999

Just released pre-releases of ipchains, the HOWTO and the ipchains scripts. None of them had really pressing issues except the HOWTO (two incorrect examples), but the main reason for doing the new versions was because I'm getting too much ipchains mail to me personally (about half a dozen a day). Now all bug reports are to be sent to the mailing list, where many problems are solved without my help.

This week should see my test network up and running; while I've given up on the Netwinder as a development box (2.2 isn't ready on Netwinder yet), with a serial cable I can use it as a client. This should fast-track the next netfilter development phase, which will be my focus for the next two weeks.

The main issue is going to be speed; the first cut of netfilter's NAT will be slow. Not as slow as a 2.2 kernel with transparent proxy compiled in which is also doing masquerading, but still too slow. The real benchmark in this battle is either a FreeBSD box, or (closer to home), Alexey's iproute NAT code. If I can get within 10% of Alexey for real traffic, I'll cut his code out as well (removing code == GOOD).

Of course, the real aim is to use the cache code to allow Alexey's fast forwarding to work in as many cases as possible; even if you're doing NAT, portforwarding, packet filtering etc on some packets.

I have a 6GB disk with a 2GB real-life packet dump on it, thanks to WatchGuard. In about a month I hope to have the tools in place for using this to stress-test my laptop; this is the stuff I will be benchmarking on. Finally, I'll have a reasonable response to "what size pipe can I masquerade on my Pentium 166 laptop?".

IPX firewalling is also coming along; only two weeks behind the schedule I promised Jay. Kernel module compiles; working on userspace.

Saturday March 6 1999

Things are calming down after the conference. Quiet day; sitting in a laundromat in San Jose at the moment. Caught up on my ipchains mail, but haven't even looked at linux-kernel yet.

I promised Jay Schulist that I'd finish IPX firewalling for him, so that should be done tonight (need to finish the userspace tool). Tomorrow we (Michael Neuling and I) catch the train to San Francisco, and Larry McVoy has offered to put us up. Then we fly to Orlando for DisneyWorld, then New York, then home.

I want to release another snapshot soon; NAT in particular is getting interesting (but needs far more testing). I'm pretty sure locking is still hosed, but what's an occasional crash between friends?

Thursday March 4 1999

Long time no write. I'm sitting on the floor of a room in the Clown Plaza in San Jose, with Alex deVries and some other cool Puffin guys. They have net!

Far too much to write about at LinuxWorld. I'm pretty much committed for LinuxExpo in May. Don't know about LinuxWorld August, although if they get Alexey, I'll be there.

Random ideas that have come forth this week include: the Linux Kernel developer human pyramid, the Linux Enquirer, the Kernel Hacker secret handshake, the Linux development ship which circles the world in International waters, allowing crypto development.

I won't do a write-up, as everyone else will. It was big, but it also had worrying shades to it.

Wednesday February 24 1999

Damn; how did a whole week go by? I've found a couple of hairy problems with port allocations, but to be honest, most of my time has been spent in preparation for LinuxWorld Expo. I tried (and failed) to get the glossy brochures for the Conference of Australian Linux Users organized before I left; Geoffrey Bennett is left holding the fort on that one.

How do you allocate ports for masquerading (or any NAT where you're sharing the address space you're mapping to with a real interface)? This is done in the older code by simply hardcoding the 61000 - 65095 range port for masquerading.

This is bad because it breaks rlogin: basically, privileged ports should get mapped to priviledged ports. It also restricts the number of connections you can masquerade. You also have to decide whether your NAT overlaps with an interface address (what if they bring up an interface in the middle of the NAT range?), or restrict all NAT to those ports.

Previously I had something to allow the NAT code to `claim' ports from the TCP and UDP layers. This is nicer, but still has the problems above, and means that the UDP and TCP layers need to be altered. Also, consider the case of port 8080 being allocated by NAT, and you want to start a web server there: you're out of luck.

OK. The other solution is to keep track of all connections (even those not being NAT'ed), and simply make sure no allocations clash. This should work quite well (with caching, these `null' perconns are cheap), and even allows us to share a NAT range with a real IP from a box behind the NAT machine.

The only design problem is that there is a race when two NAT boxes happen to map UDP packets going to each other over the other packet's server port. For example, say we have a UDP server on port 50000 on box A, and port 60000 on box B. Both boxes are masquerading for networks behind them. Box A masquerades an initial UDP packet going to box B's port 60000; it happens to set the source port to 50000. Box B masquerades an initial UDP packet going to box A's port 50000; it happens to set the source port to 60000. The two packets cross in transit.

Each box will think the other packet is a reply, and demasquerade it (which is wrong). This only happens if both are intial packets (if either box has seen the other packet first, it won't assign that port, since it would be a duplicate perconn). Moreover, we can detect this case for TCP, so it has to be UDP.

The worst case is for servers on low ports (we map ports < 1025 to 1-1024), giving a 1 in a million chance. Consider two DNS servers/NAT boxes, each masquerading another DNS server. The DNS requests cross; the incoming request will be demasqueraded (instead of going to the local server) and the internal server will reply (instead of the external server). If the masquerading is one-shot (ie. expires after the first reply), then the reply will be masqueraded on a new port, and ignored by the initial server. The next request will work. Otherwise, the answer will be accepted as kosher.

It might be possible to come up with a less contrived case, but it seems that this is unlikely to be a real issue.

Wednesday February 16 1999

Someone recently (obviously euphoric at setting up their net access through masquerading successfully; a feeling I know well), ``How can I personally thank the author of IP Chains?''.

I guess it's natural to blame the entire thing on one person, but this is ridiculous; masquerading should be credited to the original BSD authors, or anyone but me. The current masquerading code is even more bazaar-like than most of the code: there are many more names spread throughout its parts.

With that in mind, I replied thus:

Well, Linus started the kernel, Fred van Kempen did most of the the
early networking code, Alan Cox then took it over, Daniel Boulet and
Ugen J.S.Antsilevich did the original BSD firewalling code, Alan Cox
and Jos Vos ported it and modified it for Linux, Pauline Middelink
did the masquerading additions, and most recently Juan Jose Ciarlante
has been maintaining and enhancing it while I reworked the packet
filtering code for 2.2.  David S. Miller is the main current
maintainer of the IP code, and Alexey Kuznetsov is the main TCP/IP
hacker at the moment.

Help an old lady across the road; she probably wrote one of the
per-protocol masquerading modules or something.

Tuesday February 15 1999

OK. Completed FTP module, and gained new appreciation for the masq code. It's really hard to alter TCP packets in midstream, and handle the case of retransmission correctly. My dream of altering partial packets in a graceful and flawless manner will remain just that for a while.

Now I just have to test the module (tomorrow...). A week until I leave for Seattle then LinuxWorld Expo, and I have to get registrations for the Conference of Australian Linux users organized before I leave...

Monday February 15 1999

Quiet weekend; didn't touch my laptop once (had a great Valentine's Day with my SO). I've been paying for it today, catching up on mail.

I realised that my idiotic library to match patterns in packets is a complete waste of space; I'll steal Brian Murrell's code I think. Brian reports that his web server occasionally splits PASV responses (no doubt due to Nagle): this will break the current MASQ code, and we must handle this case, even though it's mega icky. I wonder how many people are getting 1 in 100 masq ftp failures and not realising it (you'd have to be using a browser, or something else which uses passive ftp).

Meanwhile I'll do naive ports and fix them later. Release another snapshot on Wedenesday, I hope. Tonight Michael and I went over the tutorial, and tomorrow night is the LinuxSA meeting.

Tuesday February 9 1999

Another day at the forge. Released a new HOWTO version (it needed doing, there were some irritating mistakes and it needed some extra sections), and another ipchains-scripts version, which had accumulated bug fixes. I'm putting off a new ipchains release (there's nothing major in it, just linenumbers) at least until Scott Bronson updates the Quick Reference page in about 3 weeks.

Cleared the way for userspace handling of per-protocol issues; now I need to port the per-protocol modules from the old ip_masq code and test them. Of particular interest is Quake, where the detrimental effects of shuffling each packet through userspace is most likely to be noticed (I'd guess up to 200 microseconds extra delay each way on my Pentium 166). Basically, if I can get away with Quake, I can do anything (well, scanning each packet of a CU-SeeMee stream might chew CPU, but millisecond latency doesn't matter much there).

The way it's implemented is not what I was originally planning, but it makes sense. The tcp-nat and udp-nat modules take a setsockopt(), which allows you to add or delete a port from the `userspace' list. Then, any new connections set up to that port pass packets to userspace, with mark equal to that port (so different processes can wait for different protocols).

This can be trivially extended to allow the handling to be done by a kernel module, should userspace be too slow for some cases (but I don't want to encourage this unless I'm backed into a corner. With a knife at my throat).

Also figured out the `genuine transparent proxy' solution; writing a special NAT module to support it should be trivial, and it'll be functionally superior to the current setup as well (errors on outgoing connection establishment can be forwarded to the original client).

Monday February 8 1999

Released a new snapshot. Highlights are:

Various bugs fixed, including routing-effecting changes to local packets, and redirect to used port.
Convenient --redirect option for ip_nat_ctl.
Updated against 2.2.1.
Doesn't include RCS archives: smaller.
Example iptables add-on module (MAC matching and REJECT).
Much nicer local packet handling for NAT.

Get it from: ftp://ftp.rustcorp.com/netfilter/netfilter-1999-02-08.tar.bz2

Sunday February 7 1999

Corrections welcome.

Advertising & News Inc -- Wednesday February 4th 2037

RADICAL "FREE VISION" BILL UNLIKELY TO PASS CONGRESS

An independent bill curtailing business rights on advertisements is
extremely unlikely to obtain `serious consideration' according to
Whitehouse spokesperson David Gammet.

David Stallman, Independent congressman and grandson of the late Free
Software advocate Richard Stallman, described the Advertising
Liability Repeal Bill as `a return to the intentions of the
constitution' regarding copyright law.  `There is no evidence that
Advertising Liability does anything other than reduce freedom to line
the pockets of large corporations, such as Advertising & News Inc'.
[thispublicationisawhollyownedsubsidiaryofadvertisingandnews].

Advertising Liability can be traced back to the landmark Pearl And
Dean vs. Presley Estate case in 2013, in which the Supreme Court ruled
that `use of copyrighted artwork for public viewing ... whether for
advertisement or other purpose ... implies a liability on behalf of
the viewer'.  In recent years, the Free Vision Foundation has promoted
the use of "Open" advertisements, for which no liability is incurred;
that is, the viewer pays nothing for seeing the ad.  In certain niche
markets (mainly educational and technical fields) these Open
Advertisements claim increasing market share.

According to Advertising & News spokesman William Gateman, people want
to spend money to see advertisements.  `The so-called Open Ads have
their place in niche markets, but it takes large teams of artists,
focus groups and market research to produce quality advertisements.
Obviously, noone can afford to do this for free.  People are prepared
to pay in return for high quality advertisements; it costs over five
million dollars for a twenty second slot in the Superbowl, and we
can't just give that away.'

Chairman of the Artist Protection Agency, Paul Johnston, goes furthur.
`What anti-business radicals like the Free Vision Foundation can't
seem to understand is that Advertising Liability creates thousands of
jobs, and is one of the leading exports of the United States.  The
average person pays just 27c a day for advertising or advertising
liability insurance; if it weren't for rampant liability evasion, this
amount would be reduced even furthur.'  The Advertising Liability
Protection Bill, due to be introduced next month, increases fines for
Liability evasion, offers increased rewards for reporting, and
simplifies collection procedures.  It is widely expected to pass.

Neither party has announce support for the Advertising Liability
Repeal Bill, so this reporter won't be letting her insurance lapse
just yet.

[Emily Postnews, Washington DC]

Wednesday February 3 1999

Rather than rant about the postponement/cancellation of The Bazaar, I thought I'd write a short essay about the Linux Community. I imagine there are parallels with the Apache people, the FreeBSD people, etc, but I'll write about what I know.

The Open Source Community

The phrase "Members of the Open Source Community" is a phrase used by Eric S. Raymond and others with fair regularity. It's a label you have to be very careful with, because some people read into that phrase levels of implication which are misleading.

I'm told by those who claim to know, that modern terrorist attacks are frequently done by a group of disparate people who come together for one job, complete the task and then go their separate ways. The Harvard Business Review (IIRC) took the typical Open Source organisation model as a new way of doing business: rather than a static organization with multiple goals, one organization per task, lasting only as long as the task takes.

Thus, the "Open Source Community" is an even more vague term (cf. "The Business Community" or "The Terrorist Community"); there are members of the Linux Community who aren't on speaking terms with members of the FreeBSD Community, even those driven mainly by their antipathy for the other project!

Someone used to dealing with legal entities like a large corporation bases their interation on this fact: The individual they are speaking with has power to enter into agreements on behalf of the corporation.

Thus, you can treat the individual as if they were the corporation itself. It's a fundamental assumption, so much so that people honour the assumption even when it's not true (eg. Nick Leeson, Barings Bank). The individual is "responsible for" the company, and "speaks for" the company.

Mr. Raymond's self-stated aim of selling Free Software to corporations means explaining it in terms they can relate to. This means adopting the role of "spokesman for" the Open Source Community, and representing them as an organisation. The message: "you can deal with the Open Source Community to your advantage".

One of the golden rules of engineering is "you can't push a string". Well, as a general rule you can't push the Open Source Community. It's not that responsibility is "decentralized", it's that there isn't any; we're not a corporation, or even a conglomorate of corporations.

Otherwise Netscape would have been able do make a deal: they release the source to their browser, in return for Apache not competing head to head with Netscape's SuiteSpot. Even considering such a deal is ludicrous, and shows a fundamental misunderstanding of "the Free Software Community".

If you're tempted to think this way, just replace "Open Source Community" with "Everyone Who's Name Starts With Q". Try cutting a deal with "Everyone Who's Name Starts With Q"; the implication that you'll have to deal with each one, one at a time, is correct. You can't push a string.

To be honest, it is possible to push; it's trickier because you can only effect individuals. Sue them; one at a time. Tip them off to the SPA; at least the software audit will cause the problems. Push their employer. Attack with frivolous patents. Of course, you'd better be ready for some really bad backlashes...

Without a stick, what interactions are possible? You can offer a carrot, and pull. Find something you want that some people out there might also want, and use it as your carrot. Netscape used their browser; even going so far as placing ads on slashdot for developers. Hardware vendors use their hardware itself; release the specs, and people who have the card will be able to use it by writing a driver. Corel are adding to Wine because they want to use it.

Realize also that the Bazaar phenomenon is a statistical effect: once there is only one member it becomes a Cathedral. In fact, at one user the distinction between Free Software and proprietary software vanishes. The theory that all bugs will be found quickly assumes that taking "care factor" multiplied by "skill factor" of each user, and adding them together, reaches a sufficient amount to overcome bugs. But like any statistical effect, there will be cases where it doesn't happen.

This is why Mr. Raymond's fetchmail program crashes for me about once a month. So I type:

   (sleep 2; echo USER rustcorp
    sleep 2; echo PASS password
    sleep 2; RETR 1
    sleep 2; DELE 1
    sleep 2; QUIT
    sleep 2) | telnet mail.camtech.net.au pop-3 > /tmp/mail

and continue as normal. Care factor: v. low. I did submit a bug report once.

Saturday January 30 1999

Signs of Success for Free Software:

The Three Tenors singing ``Join Us Now And Share The Software''.
Successful franchises throughout North America of ``Eric Raymond's Kill and Code Shooting Ranges''.
Latest trend: Geek Chic. Cindy Crawford's date for the Oscars: Richard Stallman.
Alan Cox gets knighted (now Sir Alan).
Third StarWars film has the Empire hunting down ``rebels, traitors, open source advocates and other scum''.
Coke + RedHat. Pepsi + SuSE.
Jolt + Debian.
Small Canadian town has ``World's Largest Tux The Penguin''.
Statue built of Linus in his hometown. [From Dave Miller]
When you say you're into computers, the cab driver says ``So, when's 2.4 gonna be released, ya reckon?''