Ian (lovingboth) wrote,

  • Mood:

Google & PDFs, partly a note to myself

When converting these to HTML, it doesn't look like it can cope with some ligatures - using a special character to represent particular combinations of two letters: '&' is the most common example (it's 'et', the Latin for 'and') and 'æ' would be another example. Google is ok with these.

But some programs use the ability in some fonts to do this for combinations of letters that otherwise clash visually like fi, fl, ft, tt etc. And google isn't ok with these.

So "This will be the best chance in fifty years to change things for the better" can become, in google's eyes, "This will be the best chance in fi y years to change things for the be er"!

It certainly can't cope with images in PDFs, which is odd. And text at an angle produces some interesting effects...

How often do people use this feature? Is it worth setting up the PDF to be usable in this way, or do people look at / print PDFs directly?

  • Failing as snow again, never wanted to..

    A tiny bit, but gosh, this will have been the snowiest winter in London for ages. The walk to school did help me wake up a bit (although I feel the…

  • One for Bad Science?

    Hmm, a Press Association story saying breastfeeding has almost no benefits to the baby has been picked up by the Mail and the Telegraph and... with…

  • Two recent stories

    I've forgotten to go 'hooray' here over this story from a couple of weeks ago: Food labels advice change over Palestinian territories. I am happy to…

  • Post a new comment


    Anonymous comments are disabled in this journal

    default userpic

    Your reply will be screened

  • 1 comment