Originally Published: Wednesday, 20 December 2000 Author: Brian Richardson
Published to: enhance_articles_hardware/Hardware Articles Page: 1/1 - [Printable]

Linux, Windows NT, and the Pentium 4

Brian Richardson investigates exactly why the Linux kernel could not identify the Intel Pentium 4. Read more to find out how a Redmond, Washington software company caused Linus to release a "bug" fix.

   Page 1 of 1  

As you may well know, the Intel Pentium 4 is not getting along well with Linux these days. Unless you run a patch on your distribution, or upgrade to the 2.2.18 kernel, Linux recognizes the Pentium 4 as an "unknown processor." Most Linux users see this as a reason to attack Intel for not working closely with Linux developers. The real reason for this processor problem is an odd tale which has roots in Microsoft Windows NT.

I could tell you now why this processor problem is really a Windows NT problem . . . but that wouldn't be any fun. Right now, it's time for a history lesson.

CPUID: What's My Name, Fool

Prior to the Pentium, software had to jump through elaborate loops to determine exactly what type of CPU was installed on an 80x86 computer. These methods involved checking for illegal opcodes, using known bugs in prior processors, a voodoo doll of Charles Babbage, and a ouija board. Intel fixed some of these problems with CPUID.

The CPUID opcode was introduced to the late models of the Intel 486 (486SL and 486DX4). The Intel Pentium, along with its various clones and successors, have all included this instruction. CPUID allows software to gain information on the CPU type and version. CPUID Function 0 returns an ASCII string, identifying the vendor ("GenuineIntel," "CyrixInstead," "AuthenticAMD," etc.). CPUID Function 1 returns the CPU family, model, and stepping.

An explanation of the full CPUID syntax is left as an exercise to the reader. For way too much information on CPUID, see Page 3-113 of IA-32 Intel Architecture Software Developer's Manual Volume 2: Instruction Set Reference Manual

Keep It In The Family

Intel identifies its various processor using a combination of the family and model codes. Pentium processors are identified by a family code of 5. A family code of 6 covers the PentiumPro and all of its variants. Since the PentiumPro, Pentium II, Pentium III and Celeron are all based on the same processor architecture, they are all part of the P6 family( hence, family code 6). The model code is used to tell the various P6 processors apart, along with the cache size and brand ID, depending on the CPU (it's messy; don't ask).

Intel decided to make a new family code for the Pentium 4. That's where the fun begins.

The Past Comes Alive

The average person would think Intel would just increment the family code, making the Pentium 4 part of 'family 7'. That does make sense, but Intel already has a family code 7 processor : the Itanium (it came before the Pentium 4, even though the P4 hit the market first).

Ok, no problem. Just make the Pentium 4's family code 8 instead of 7. Wrong . . . big problem.

Bit By Bit

Microsoft Windows NT 4.0 ran into a bit of a snag with "family 8." For those not schooled in the ways of binary, the decimal number 8 is "1000" in binary. That's four binary digits . . . four bits.

Four bits. Remember that, because it's important.

When Windows NT 4.0 and its six service packs were released, the largest CPU family code was 6. That's "110" in binary . . . only three bits. So the NT code only looks at the first three bits of the CPU family when configuring the system.

If you haven't figured it out by now, the first three bits of 8 are zero, zero and, you guessed it, zero. Windows NT goes wacko when it sees a CPU family zero. Serious wacko. Jack with an axe at the end of The Shining wacko. Since Windows 2000 wasn't in wide release at the time, and Intel wanted to avoid this tech support issue, the family code had to be changed to avoid a conflict with Windows NT.

So now the family code for the Pentium 4 is 15, or "1111" in binary, so the first three bits look like 'CPU family 7' to Windows NT. Of course, this wasn't revealed to software developers outside of a non-disclosure agreement (NDA) till the official release of the Pentium 4.

So that's why Linux doesn't properly identify the Pentium 4 without a kernel upgrade.

It's Microsoft's fault.

The lesson: it's amazing what one bad assumption can do.

Brian Richardson knows this information is good because a little penguin in a trenchcoat gave it to him one night on the bottom level of a parking deck. Brian isn't sure who the penguin is, but he looked oddly familiar. He said his name was "Deep Herring."

   Page 1 of 1