Recent Articles
« Against Alleged Goal Conflicts | Main | Introduction »
Tuesday
Jun232009

The Cline from Image to Text

I strongly agree with Audenaert's assertion that text and image form "two ends of a continuum rather than two poles of a dichotomy." Consider this range of possibility:

  • a photograph
  • an image of a painting
  • a photograph containing readable text
  • an image of a painting with some textual elements
  • artwork consisting mainly of textual representation
  • an image of text with illustrated elements
  • an image of text with annotations
  • a PDF containing text and image regions
  • a PDF consisting of a collage of texts by multiple authors
  • an image of text consisting purely of text written by one author
  • a PDF of text consisting purely of text written by one author

Purely in terms of logical analysis of this continuum, there doesn't appear to be any single point at which it is worthwhile demarcating text from image. Moreover, the following discussion will demonstrate that imagery and textuality are inherently related. Even the most plain-text of documents communicate through extra-linguistic semiota: font, spacing, pagination, formatting, relationship of headings with text, etc.

A purely textual manuscript

Consider the requirements for analysing the following, purely textual document. This manuscript would barely meet Audenaet's requirements to qualify as a visually complex document. Yet one would imagine the tools required to analyse the document would be similar to those employed in photographs or paintings (perhaps with an extra affordance here or there). Fortunately, there is no driver from linguistic or semiotic theory for separating the study of images and text: in the book, The Language of Displayed Art, Michael O'Toole describes how to analyse paintings, photographs and sculptures using analytical techniques borrowed from lingustics.

First Epistle of John (begins in right column). From: Codex Sinaiticus. 4th Century Majuscule. Care of CSNTM.org.

Plain text? or Collagic complexity?

Now consider the following image. The right-hand page consists purely of text. Despite the familiarity of the type-setting, and the evident bookishness, this image is arguably more complex than the manuscript above, because it is highly collagic. Consider just one parameter: the variety of authors and contributors to this one page:

  • The introductory material (top of page), and the notes (bottom of page) were written by a committee of ten academics from the Reformed theological tradition during the period 1988-1995.
  • The headings "The Word of Life", "Walking in the Light" and the translation notes at the bottom of the right-hand column were written by the ESV translation committee circa 2001.
  • The cross-references in the middle column were developed by a team of Bible scholars from Oxford and Cambridge Universities in the 19th Century, incorporating a cross-reference system developed by the translators of the 1611 King James Version.
  • The versification was designed by Robert Estienne in 1551.
  • This version of the biblical text was translated by the ESV translation committee circa 2001, from the United Bible Society's 1993 Greek New Testament (4th corrected ed.), which is a critical text derived from comparison of hundreds of manuscripts, including Codex Sinaiticus pictured above.

Despite this obvious complexity, we have referenced only one parameter out of the total parameter-space for analysing complexity. We've said nothing about the graphic layout, the punctuation, the paragraphing, the visual layout of the outline. While this clearly qualifies as a "visually complex document," it consists almost entirely of plain text.

First Epistle of John. From the Reformation Study Bible, English Standard Version. Published by Ligonier Ministries, Orlando, Florida.

Mechanics of document capture

I captured this image by placing my Bible on my CanoScan LiDE 25, and causing the CanoScan software to capture the image to PDF. So this image is actually an image of a PDF document. When I open the PDF document, I can clearly see that the CanoScan software has represented the image as text despite the obvious skew. But when I position my mouse pointer over the text that reads "That which was from the beginning", and then down into the second verse, the software highlights the middle column, the right-hand column and even some of the words from the facing page. The PDF may have recognized letters; it certainly does not recognize the columnar format.

Attempting to highlight an area of text using Apple's Preview software, on Mac OS X 10.5.7. Scanned from a CanoScan LiDE 25 using CanoScan software.
The down-ranking of meaning in PDF is even a challenge in documents that are printed directly to PDF and then transmitted electronically: the headers and footers, callouts, and other extra-linguistic textual features are intermingled within the text stream. PDF was designed to drive printers, not to facilitate textual analysis by knowledge workers.

At all levels of textual analysis, from authorship to graphology, from linguistic to extra-linguistic semiotic, from imagery or textuality, from source or down-stream digital processing tools, the cline between image and text is gradual. The analytical requirements are largely shared. The tool requirements for many types of text will likely draw from techniques developed for image analysis; image analysis will likely benefit from a common framework with a digital textual analysis tool, particularly one that incorporates notions of spatial hypertext.

Next Article: Assembling the Text

PrintView Printer Friendly Version

EmailEmail Article to Friend

Reader Comments (23)

You work very good, I like it very much. Thank you for sharing!Hope you can have more astonishing works great!ps2 game console

11.3.21 | Unregistered CommenterEllier

It is the happies thing that do one’s best for his dreams. Althought at this monent I can not own my dream hanova-----Montblance women’s watch. But I will work hard for mading my dream come true.

11.5.9 | Unregistered Commenteryoyo

One small problem solved just in time for a bigger one to emerge. On Valentine's Day, my fiance called.Raymond Weil replica rolex daytona.

11.5.27 | Unregistered Commenternew

Robert Estienne designed the versification he though about almost all details, but he forgot to explain its relationship with Generic Viagra ... I know somw pwople thinks it isn't important, but they are wrong.

11.6.28 | Unregistered CommenterDwayne 11

Your article is nice, I read your article to learn a lot and hope to see your next article -cheap Montblanc pens ,look forward to your masterpiece.

Althought at this monent I can not own my dream hanova-----Montblance women’s watch. But I will work hard for mading my dream come true. wordpress premium themes

Replica Watch Shops is the best place to buy online Replica watch. We deliver you the best replica watch in cheap prices.

Replica Watch Shops is the best place to buy online Replica watch. We deliver you the best replica watch in cheap prices.

Replica Audemars Piguet can make you feel good, and that may improve your grade. A successfui businessman must have one kind of watch to show his special quality. for a lady the watch also can attract many eyes. that is a amazing feeling for owning a watch. these watches are selected for all kinds of people. you can get your favorite watch freely.

11.8.16 | Unregistered Commenterlisayun

Die G570 kommt mit einem glänzend und hell 15,6-Zoll-Anti-Glare Widescreen-Display mit einer nativen Auflösung von 1366 x 768. Wir sahen schöne Bilder Akku Dell Latitude D610 während der Transformers: Dark of the Moon Trailer; Rosie Huntington-Whiteley die blauen Augen tauchte inmitten all der mutwilligen Zerstörung, ebenso wie Optimus Prime ist rot und blau-Chassis, wie er Akku Dell Latitude D520 Decepticons gehackt, um Bits.

The best way that people can picture a term. it is with a image. Also comic strips can easily make people to understand what it is behind. For example cialis online has been doing a great job with it. Because every post have those two things.

11.9.22 | Unregistered CommenterDan

Thanks for your sharing, this gucci handbags on salearticle is very good, I like it very much, as you learn a lot!

What are the Green Lanterns powers? I have no idea and me and my mate regulary discuss this?

from : omega replica planet ocean

11.10.6 | Unregistered Commentergoodmape

Nice article, i appreciate for putting this gucci handbags on sale together!"This is obviously one great post.

There are a lot of software that they can check those scans and transform them into editable text for PC. There are companies that they like in that. I read that all xlpharmacy documents were transfrom thanks to the program.

11.10.28 | Unregistered CommenterPeter

Buy the things you like not only cheap and high quality, are you happy?then you do not miss the replica christian louboutin shoes , men down coats , louis vuitton Artsy and louis vuitton Insolite , Don't miss opportunities, hurry to buy one!

11.10.31 | Unregistered CommenterYablly Craig

I have been looking for a software to scan all images and get all that text from it into a word processing program but I can't find someone that it can do it properly and faster. i read some recommendation on viagra online but none has worked in my terms

11.11.8 | Unregistered Commenterdan

I don't know that documents need more works. I think that it needs a better scan. It would be better if they do that. There is a software that viagra online used to digitalizes their documents.

11.11.15 | Unregistered Commenterdan

PostPost a New Comment

Enter your information below to add a new comment.

My response is on my own website »
Author Email (optional):
Author URL (optional):
Post:
 
Some HTML allowed: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <code> <em> <i> <strike> <strong>