Julie asked me to answer the Little Green Footballs analysis of the Obama long form forgery issue. And I can’t. I am prone to believe LGF. The faked birth certificate line demands an extremely high burden of proof considering all the people who would have to be in on the deception.
Arpaio’s investigators think they have met that level –LGF says no– but aside from confirming that the PDF does open in layers (which surprised the hell out of me) as I demonstrate in the previous post… I am out of my league, and a little chagrined for having posted it so quickly without consulting the skeptics.
Here is Charles Johnson’s analysis including his gratuitous kick in the teeth at the end suggesting all doubt about Obama’s birth certificate is based in racism.
PDF Documents, OCR, and Conspiracy Theories
TECHNOLOGY • Fri Apr 29, 2011 at 3:20 pm PDT • Views: 28,617
Let’s get all tech-nerdish for a minute, because I’ve seen an inaccurate statement reported several times now, about the latest inane birth certificate conspiracy theory, most recently at TPMDC: With Drudge Report’s Help, Birthers Latch Onto Phony Forgery Theory.
In fact, the effect was not a sign of foul play at all, but a common attribute of PDF files containing text as an image. On many PDFs, a feature called OCR (optical character recognition) recognizes the letters in the image and separates them into their own layer. This explains why you’re able to highlight and copy raw text from some PDF files even though it’s actually not a word processing document.
As I pointed out yesterday, the OCR setting in Adobe Acrobat is actually irrelevant to this issue; OCR (Optical Character Recognition) has nothing to do with the “layers” you see if you open a PDF file with Adobe Illustrator. Even if you scan a document with OCR turned off (which is the case with the birth certificate PDF released by the White House), these “layers” are still created.
In Portable Document Format, they’re not actually “layers” at all. They’re a result of the method Adobe Acrobat uses to compress and optimize scanned images.
When a PDF document is created from a scanner (even with OCR turned off), areas that contain text are recognized, isolated, and compressed differently than background patterns, lines, and other elements, because different compression algorithms work best for these different types of graphics. When the resulting PDF file is opened with Adobe Illustrator, these elements are interpreted as “layers,” but in terms of the PDF file they’re really not like Illustrator layers at all. The reason for breaking down the image in this fashion is to yield the smallest, most efficient PDF file.
And that’s why, in the White House’s PDF file, the “text” elements are separated (imperfectly) from the background pattern, but remain un-searchable images, not text.
The key point: the layers will still exist, even in documents that don’t use the OCR feature or don’t contain a black President’s birth certificate.