[jira] [Commented] (TIKA-2224) OneNote formats support - Mime Magic and Parser

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (TIKA-2224) OneNote formats support - Mime Magic and Parser

Markus Jelsma (Jira)

    [ https://issues.apache.org/jira/browse/TIKA-2224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16992873#comment-16992873 ]

Tim Allison commented on TIKA-2224:
-----------------------------------

Whoa!  It looks like OneNote is storing OCR'd text in the richedittextunicode.  I extracted embedded images and turned on OCR.  Search for "milk" and you'll see it near some noisy text in 0: and then later OCR'd from an image.

{noformat}
0: cbLegacyExpectedFileLength : 0x0
0: cTransactionsInLog : 0x63
0: ffvNewestCode : 0x2a
0: X-Parsed-By : org.apache.tika.parser.DefaultParser
0: X-Parsed-By : org.apache.tika.parser.microsoft.onenote.OneNoteParser
0: nFileVersionGeneration : 0xd1
0: X-TIKA:content_handler : ToXMLContentHandler
0: rgbPlaceholder : 0x0
0: ffvOldestReader : 0x2a
0: buildNumberOldestWritten : 0xfa929b5
0: grfDebugLogFlags : 0x0
0: cbFreeSpaceInFreeChunkList : 0x3f8
0: crcName : 0xfbeeb230
0: cbLegacyFreeSpaceInFreeChunkList : 0x0
0: buildNumberCreated : 0xfa929b5
0: cbExpectedFileLength : 0x57f58
0: X-TIKA:parse_time_millis : 8409
0: X-TIKA:embedded_depth : 0
0: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="cbLegacyExpectedFileLength" content="0x0" />
<meta name="cTransactionsInLog" content="0x63" />
<meta name="ffvNewestCode" content="0x2a" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.microsoft.onenote.OneNoteParser" />
<meta name="nFileVersionGeneration" content="0xd1" />
<meta name="rgbPlaceholder" content="0x0" />
<meta name="ffvOldestReader" content="0x2a" />
<meta name="buildNumberOldestWritten" content="0xfa929b5" />
<meta name="grfDebugLogFlags" content="0x0" />
<meta name="cbFreeSpaceInFreeChunkList" content="0x3f8" />
<meta name="crcName" content="0xfbeeb230" />
<meta name="cbLegacyFreeSpaceInFreeChunkList" content="0x0" />
<meta name="buildNumberCreated" content="0xfa929b5" />
<meta name="cbExpectedFileLength" content="0x57f58" />
<meta name="ffvLastCode" content="0x2a" />
<meta name="buildNumberLastWroteToFile" content="0xfa929b5" />
<meta name="buildNumberNewestWritten" content="0xfa929b5" />
<meta name="Content-Type" content="application/onenote; format=one" />
<title></title>
</head>
<body><p>(jjjU
Share with anyone
on PC, phone, or tablet
Sync to OneDrive</p>
<p>0</p>
<a href="http://o15.officeredir.microsoft.com/r/rlidOneNoteGuideVideo15?clid=1033">2 minute video</a><a href="http://o15.officeredir.microsoft.com/r/rlidOneNoteGuideVideo15?clid=1033">Watch the</a><p>Write your name here</p>
<p>1. Take notes anywhere on the page</p>
<p>You start with "My Notebook" - everything lives in here</p>
<p>Add sections for activities like:</p>
<p>2. Get organized</p>
<p>Trips jf Recipes ¡Shopping
Add sections!</p>
<p>Add pages inside of each section:</p>
<p>Add pages!
Summer vacation
Camping
Family visit
Europe</p>
<p>For more tips, check out 30 second videos</p>
<p>(Pages are over there)</p>
<p>OneNote: one place for all of your notes</p>
<p>Tickets Reservation
*E*E</p>
<a href="http://o15.officeredir.microsoft.com/r/rlidOneNote15Tutorial1?clid=1033">Clip from the web</a><p>Shoppirg list
Books to read
n
Search all notebooks,..</p>
<a href="http://o15.officeredir.microsoft.com/r/rlidOneNote15Tutorial2?clid=1033">Plan a trip with others</a><a href="http://o15.officeredir.microsoft.com/r/rlidOneNote15Tutorial3?clid=1033">Search notes instantly</a><p>gv-e4t’</p>
<a href="http://o15.officeredir.microsoft.com/r/rlidOneNote15Tutorial4?clid=1033">Write notes on slides</a><p>Create your first page</p>
<p>You're in the Quick Notes section - use it for random notes</p>
<p>Quick Notes ________ _____________ Add your first pag&amp;
Add Page</p>
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<div class="embedded" />
<p>TABLE TOOLS
LAYOUT
_ Iji
Shading Sort Convert to Excel
Spreadsheet</p>
<p>HOME —+
To Do
Tag</p>
<p>▹Create your own custom tags</p>
<p>▹Take notes on Outlook or Lync meetings </p>
<p>Integrate with Outlook</p>
<p>HOME —
OneNote</p>
<p>▹Add Outlook tasks from OneNote</p>
<p>Status meeting ;
Conf room 36
John, Felicity
r Follow up with John
I
—</p>
<p>▹Insert meeting details</p>
<p>From Outlook:</p>
<p>HOME —b
Outlook Meeting
Tasks Details</p>
<p>▹They go into your Quick Notes section</p>
<p>Take quick notes</p>
<p>Organize with tables</p>
<p>▹Convert tables to Excel spreadsheets</p>
<p>▹Type, then press TAB to create a table</p>
<p>OneNote Basics</p>
<p>▹Quickly sort and shade tables</p>
<p>in your taskbar
+ N on your keyboard</p>
<p>▹Quickly jot down thoughts and ideas</p>
<p>bu Quarter 1 revenue
Sales Revenue Expenses
Scott 4 5 3
James 2 1 4  .1</p>
<p>▹Preview updates on the page</p>
<p>INSERT —.
Spreadsheet</p>
<p>▹Track finances, budgets, &amp; more </p>
<p>Add Excel spreadsheets</p>
<p>ELi Quick Notes ?
ç
...</p>
<p>in the top comer of the page</p>
<p>▹Hide everything but the essentials</p>
<p>Brainstorm without clutter</p>
<p>▹Extra space to focus on your notes</p>
<p>Don’t forget to buy rìk
on the way home!</p>
<p>▹Annotate with a stylus on your tablet</p>
<p>in your taskbar
+ N on your keyboard</p>
<p>Write notes on slides</p>
<p>i1ie
L0okS gr</p>
<p>▹Highlight and finger-paint</p>
<p>▹Send PowerPoint or Word docs to OneNote</p>
<p>▹Take screenshots of products online</p>
<p>in your taskbar
+ S on your keyboard</p>
<p>▹Save important news articles </p>
<p>�</p>
<p>Sunday retreat
Atenci’q? Cern qh:? Vegetria?
Cris
Molly
Deter
Samuel
Stacy
A
z</p>
<p>▹Accessible from any device</p>
<p>..
Description I
Pricing
$249.99—599.99</p>
<p>L4j
_I</p>
<p>▹Real-Time Sync on the same page</p>
<p>Keep everything in sync</p>
<p>▹People can edit pages at the same time</p>
<p>▹Everything stored in the cloud</p>
<p>�</p>
<p>Share</p>
<p>▹Anyone can edit in a browser</p>
<p>�</p>
<p>Collaborate with others</p>
<p>�</p>
<p>Flight details Sights to see
Transportation Reservation
. Arrive a: airport at Earn Ben Hotel is for the 6:h lOth Tom
. P1aie depars at 8arr Do we need to extend
. Plane lands a: 2pm the reser’a:on oy a day?</p>
<p>▹Share with friends and family </p>
<p>To-do lists
Shopping list Priorities
j Milk D Check messages
E Oranges * Call Dave
El Potatoes ? Follow up with Jim
Ri Bread J Schedule appt.
El Cereal E] Call Janet
Ri Sugar</p>
<p>▹Keep your notebooks on OneDrive</p>
<p>Remember everything </p>
<p>▹Add Tags to any notes</p>
<p>▹Make checklists and to-do lists</p>
<p>Clip from the web</p>
<p>▹Quickly clip anything on your screen</p>
</body></html>
0: ffvLastCode : 0x2a
0: buildNumberLastWroteToFile : 0xfa929b5
0: buildNumberNewestWritten : 0xfa929b5
0: Content-Type : application/onenote; format=one
1: Transparency Alpha : none
1: X-TIKA:content_handler : ToXMLContentHandler
1: tiff:ImageLength : 170
1: Compression CompressionTypeName : deflate
1: Data BitsPerSample : 8 8 8
1: Data PlanarConfiguration : PixelInterleaved
1: Dimension VerticalPixelSize : 0.26462027
1: IHDR : width=220, height=170, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
1: Chroma ColorSpaceType : RGB
1: tiff:BitsPerSample : 8 8 8
1: Content-Type : image/png
1: height : 170
1: gAMA : 45455
1: X-Parsed-By : org.apache.tika.parser.DefaultParser
1: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
1: X-Parsed-By : org.apache.tika.parser.image.ImageParser
1: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
1: Chroma Gamma : 0.45455
1: Dimension PixelAspectRatio : 1.0
1: sRGB : Perceptual
1: Compression NumProgressiveScans : 1
1: Dimension HorizontalPixelSize : 0.26462027
1: Chroma BlackIsZero : true
1: Compression Lossless : true
1: X-TIKA:embedded_depth : 1
1: width : 220
1: X-TIKA:parse_time_millis : 418
1: Dimension ImageOrientation : Normal
1: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="170" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=220, height=170, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="170" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="220" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-1" />
<meta name="tiff:ImageWidth" content="220" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr" />
</body></html>
1: X-TIKA:embedded_resource_path : /embedded-1
1: tiff:ImageWidth : 220
1: Chroma NumChannels : 3
1: Data SampleFormat : UnsignedIntegral
2: Transparency Alpha : none
2: X-TIKA:content_handler : ToXMLContentHandler
2: tiff:ImageLength : 254
2: Compression CompressionTypeName : deflate
2: Data BitsPerSample : 8 8 8
2: Data PlanarConfiguration : PixelInterleaved
2: Dimension VerticalPixelSize : 0.26462027
2: IHDR : width=651, height=254, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
2: Chroma ColorSpaceType : RGB
2: tiff:BitsPerSample : 8 8 8
2: Content-Type : image/png
2: height : 254
2: gAMA : 45455
2: X-Parsed-By : org.apache.tika.parser.DefaultParser
2: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
2: X-Parsed-By : org.apache.tika.parser.image.ImageParser
2: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
2: Chroma Gamma : 0.45455
2: Dimension PixelAspectRatio : 1.0
2: sRGB : Perceptual
2: Compression NumProgressiveScans : 1
2: Dimension HorizontalPixelSize : 0.26462027
2: Chroma BlackIsZero : true
2: Compression Lossless : true
2: X-TIKA:embedded_depth : 1
2: width : 651
2: X-TIKA:parse_time_millis : 309
2: Dimension ImageOrientation : Normal
2: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="254" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=651, height=254, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="254" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="651" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-2" />
<meta name="tiff:ImageWidth" content="651" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">sH GID) am

Share with anyone
on PC, phone, or tablet

 

Sync to OneDrive
</div>
</body></html>
2: X-TIKA:embedded_resource_path : /embedded-2
2: tiff:ImageWidth : 651
2: Chroma NumChannels : 3
2: Data SampleFormat : UnsignedIntegral
3: Transparency Alpha : none
3: X-TIKA:content_handler : ToXMLContentHandler
3: tiff:ImageLength : 61
3: Compression CompressionTypeName : deflate
3: Data BitsPerSample : 8 8 8
3: Data PlanarConfiguration : PixelInterleaved
3: Dimension VerticalPixelSize : 0.26462027
3: IHDR : width=59, height=61, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
3: Chroma ColorSpaceType : RGB
3: tiff:BitsPerSample : 8 8 8
3: Content-Type : image/png
3: height : 61
3: gAMA : 45455
3: X-Parsed-By : org.apache.tika.parser.DefaultParser
3: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
3: X-Parsed-By : org.apache.tika.parser.image.ImageParser
3: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
3: Chroma Gamma : 0.45455
3: Dimension PixelAspectRatio : 1.0
3: sRGB : Perceptual
3: Compression NumProgressiveScans : 1
3: Dimension HorizontalPixelSize : 0.26462027
3: Chroma BlackIsZero : true
3: Compression Lossless : true
3: X-TIKA:embedded_depth : 1
3: width : 59
3: X-TIKA:parse_time_millis : 197
3: Dimension ImageOrientation : Normal
3: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="61" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=59, height=61, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="61" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="59" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-3" />
<meta name="tiff:ImageWidth" content="59" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr" />
</body></html>
3: X-TIKA:embedded_resource_path : /embedded-3
3: tiff:ImageWidth : 59
3: Chroma NumChannels : 3
3: Data SampleFormat : UnsignedIntegral
4: Transparency Alpha : none
4: X-TIKA:content_handler : ToXMLContentHandler
4: tiff:ImageLength : 138
4: Compression CompressionTypeName : deflate
4: Data BitsPerSample : 8 8 8
4: Data PlanarConfiguration : PixelInterleaved
4: Dimension VerticalPixelSize : 0.26462027
4: IHDR : width=127, height=138, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
4: Chroma ColorSpaceType : RGB
4: tiff:BitsPerSample : 8 8 8
4: Content-Type : image/png
4: height : 138
4: gAMA : 45455
4: X-Parsed-By : org.apache.tika.parser.DefaultParser
4: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
4: X-Parsed-By : org.apache.tika.parser.image.ImageParser
4: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
4: Chroma Gamma : 0.45455
4: Dimension PixelAspectRatio : 1.0
4: sRGB : Perceptual
4: Compression NumProgressiveScans : 1
4: Dimension HorizontalPixelSize : 0.26462027
4: Chroma BlackIsZero : true
4: Compression Lossless : true
4: X-TIKA:embedded_depth : 1
4: width : 127
4: X-TIKA:parse_time_millis : 226
4: Dimension ImageOrientation : Normal
4: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="138" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=127, height=138, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="138" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="127" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-4" />
<meta name="tiff:ImageWidth" content="127" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">vo
Notebook |
</div>
</body></html>
4: X-TIKA:embedded_resource_path : /embedded-4
4: tiff:ImageWidth : 127
4: Chroma NumChannels : 3
4: Data SampleFormat : UnsignedIntegral
5: Transparency Alpha : none
5: X-TIKA:content_handler : ToXMLContentHandler
5: tiff:ImageLength : 104
5: Compression CompressionTypeName : deflate
5: Data BitsPerSample : 8 8 8
5: Data PlanarConfiguration : PixelInterleaved
5: Dimension VerticalPixelSize : 0.26462027
5: IHDR : width=742, height=104, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
5: Chroma ColorSpaceType : RGB
5: tiff:BitsPerSample : 8 8 8
5: Content-Type : image/png
5: height : 104
5: gAMA : 45455
5: X-Parsed-By : org.apache.tika.parser.DefaultParser
5: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
5: X-Parsed-By : org.apache.tika.parser.image.ImageParser
5: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
5: Chroma Gamma : 0.45455
5: Dimension PixelAspectRatio : 1.0
5: sRGB : Perceptual
5: Compression NumProgressiveScans : 1
5: Dimension HorizontalPixelSize : 0.26462027
5: Chroma BlackIsZero : true
5: Compression Lossless : true
5: X-TIKA:embedded_depth : 1
5: width : 742
5: X-TIKA:parse_time_millis : 238
5: Dimension ImageOrientation : Normal
5: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="104" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=742, height=104, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="104" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="742" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-5" />
<meta name="tiff:ImageWidth" content="742" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">LL My Notebook ~ Sf

Add sections!

 
</div>
</body></html>
5: X-TIKA:embedded_resource_path : /embedded-5
5: tiff:ImageWidth : 742
5: Chroma NumChannels : 3
5: Data SampleFormat : UnsignedIntegral
6: Transparency Alpha : none
6: X-TIKA:content_handler : ToXMLContentHandler
6: tiff:ImageLength : 211
6: Compression CompressionTypeName : deflate
6: Data BitsPerSample : 8 8 8
6: Data PlanarConfiguration : PixelInterleaved
6: Dimension VerticalPixelSize : 0.26462027
6: IHDR : width=563, height=211, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
6: Chroma ColorSpaceType : RGB
6: tiff:BitsPerSample : 8 8 8
6: Content-Type : image/png
6: height : 211
6: gAMA : 45455
6: X-Parsed-By : org.apache.tika.parser.DefaultParser
6: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
6: X-Parsed-By : org.apache.tika.parser.image.ImageParser
6: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
6: Chroma Gamma : 0.45455
6: Dimension PixelAspectRatio : 1.0
6: sRGB : Perceptual
6: Compression NumProgressiveScans : 1
6: Dimension HorizontalPixelSize : 0.26462027
6: Chroma BlackIsZero : true
6: Compression Lossless : true
6: X-TIKA:embedded_depth : 1
6: width : 563
6: X-TIKA:parse_time_millis : 274
6: Dimension ImageOrientation : Normal
6: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="211" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=563, height=211, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="211" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="563" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-6" />
<meta name="tiff:ImageWidth" content="563" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

Summer vacation

Camping
‘Add pages! Family visit
Europe
</div>
</body></html>
6: X-TIKA:embedded_resource_path : /embedded-6
6: tiff:ImageWidth : 563
6: Chroma NumChannels : 3
6: Data SampleFormat : UnsignedIntegral
7: Transparency Alpha : none
7: X-TIKA:content_handler : ToXMLContentHandler
7: tiff:ImageLength : 103
7: Compression CompressionTypeName : deflate
7: Data BitsPerSample : 8 8 8
7: Data PlanarConfiguration : PixelInterleaved
7: Dimension VerticalPixelSize : 0.26462027
7: IHDR : width=123, height=103, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
7: Chroma ColorSpaceType : RGB
7: tiff:BitsPerSample : 8 8 8
7: Content-Type : image/png
7: height : 103
7: gAMA : 45455
7: X-Parsed-By : org.apache.tika.parser.DefaultParser
7: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
7: X-Parsed-By : org.apache.tika.parser.image.ImageParser
7: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
7: Chroma Gamma : 0.45455
7: Dimension PixelAspectRatio : 1.0
7: sRGB : Perceptual
7: Compression NumProgressiveScans : 1
7: Dimension HorizontalPixelSize : 0.26462027
7: Chroma BlackIsZero : true
7: Compression Lossless : true
7: X-TIKA:embedded_depth : 1
7: width : 123
7: X-TIKA:parse_time_millis : 199
7: Dimension ImageOrientation : Normal
7: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="103" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=123, height=103, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="103" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="123" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-7" />
<meta name="tiff:ImageWidth" content="123" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr" />
</body></html>
7: X-TIKA:embedded_resource_path : /embedded-7
7: tiff:ImageWidth : 123
7: Chroma NumChannels : 3
7: Data SampleFormat : UnsignedIntegral
8: Transparency Alpha : none
8: X-TIKA:content_handler : ToXMLContentHandler
8: tiff:ImageLength : 131
8: Compression CompressionTypeName : deflate
8: Data BitsPerSample : 8 8 8
8: Data PlanarConfiguration : PixelInterleaved
8: Dimension VerticalPixelSize : 0.26462027
8: IHDR : width=163, height=131, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
8: Chroma ColorSpaceType : RGB
8: tiff:BitsPerSample : 8 8 8
8: Content-Type : image/png
8: height : 131
8: gAMA : 45455
8: X-Parsed-By : org.apache.tika.parser.DefaultParser
8: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
8: X-Parsed-By : org.apache.tika.parser.image.ImageParser
8: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
8: Chroma Gamma : 0.45455
8: Dimension PixelAspectRatio : 1.0
8: sRGB : Perceptual
8: Compression NumProgressiveScans : 1
8: Dimension HorizontalPixelSize : 0.26462027
8: Chroma BlackIsZero : true
8: Compression Lossless : true
8: X-TIKA:embedded_depth : 1
8: width : 163
8: X-TIKA:parse_time_millis : 190
8: Dimension ImageOrientation : Normal
8: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="131" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=163, height=131, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="131" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="163" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-8" />
<meta name="tiff:ImageWidth" content="163" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

 

 

 
</div>
</body></html>
8: X-TIKA:embedded_resource_path : /embedded-8
8: tiff:ImageWidth : 163
8: Chroma NumChannels : 3
8: Data SampleFormat : UnsignedIntegral
9: Transparency Alpha : none
9: X-TIKA:content_handler : ToXMLContentHandler
9: tiff:ImageLength : 49
9: Compression CompressionTypeName : deflate
9: Data BitsPerSample : 8 8 8
9: Data PlanarConfiguration : PixelInterleaved
9: Dimension VerticalPixelSize : 0.26462027
9: IHDR : width=46, height=49, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
9: Chroma ColorSpaceType : RGB
9: tiff:BitsPerSample : 8 8 8
9: Content-Type : image/png
9: height : 49
9: gAMA : 45455
9: X-Parsed-By : org.apache.tika.parser.DefaultParser
9: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
9: X-Parsed-By : org.apache.tika.parser.image.ImageParser
9: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
9: Chroma Gamma : 0.45455
9: Dimension PixelAspectRatio : 1.0
9: sRGB : Perceptual
9: Compression NumProgressiveScans : 1
9: Dimension HorizontalPixelSize : 0.26462027
9: Chroma BlackIsZero : true
9: Compression Lossless : true
9: X-TIKA:embedded_depth : 1
9: width : 46
9: X-TIKA:parse_time_millis : 179
9: Dimension ImageOrientation : Normal
9: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="49" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=46, height=49, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="49" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="46" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-9" />
<meta name="tiff:ImageWidth" content="46" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr" />
</body></html>
9: X-TIKA:embedded_resource_path : /embedded-9
9: tiff:ImageWidth : 46
9: Chroma NumChannels : 3
9: Data SampleFormat : UnsignedIntegral
10: Transparency Alpha : none
10: X-TIKA:content_handler : ToXMLContentHandler
10: tiff:ImageLength : 126
10: Compression CompressionTypeName : deflate
10: Data BitsPerSample : 8 8 8
10: Data PlanarConfiguration : PixelInterleaved
10: Dimension VerticalPixelSize : 0.26462027
10: IHDR : width=749, height=126, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
10: Chroma ColorSpaceType : RGB
10: tiff:BitsPerSample : 8 8 8
10: Content-Type : image/png
10: height : 126
10: gAMA : 45455
10: X-Parsed-By : org.apache.tika.parser.DefaultParser
10: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
10: X-Parsed-By : org.apache.tika.parser.image.ImageParser
10: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
10: Chroma Gamma : 0.45455
10: Dimension PixelAspectRatio : 1.0
10: sRGB : Perceptual
10: Compression NumProgressiveScans : 1
10: Dimension HorizontalPixelSize : 0.26462027
10: Chroma BlackIsZero : true
10: Compression Lossless : true
10: X-TIKA:embedded_depth : 1
10: width : 749
10: X-TIKA:parse_time_millis : 203
10: Dimension ImageOrientation : Normal
10: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="126" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=749, height=126, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="126" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="749" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-10" />
<meta name="tiff:ImageWidth" content="749" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">—

Add your first page!

 
</div>
</body></html>
10: X-TIKA:embedded_resource_path : /embedded-10
10: tiff:ImageWidth : 749
10: Chroma NumChannels : 3
10: Data SampleFormat : UnsignedIntegral
11: Transparency Alpha : none
11: X-TIKA:content_handler : ToXMLContentHandler
11: tiff:ImageLength : 131
11: Compression CompressionTypeName : deflate
11: Data BitsPerSample : 8 8 8
11: Data PlanarConfiguration : PixelInterleaved
11: Dimension VerticalPixelSize : 0.26462027
11: IHDR : width=165, height=131, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
11: Chroma ColorSpaceType : RGB
11: tiff:BitsPerSample : 8 8 8
11: Content-Type : image/png
11: height : 131
11: gAMA : 45455
11: X-Parsed-By : org.apache.tika.parser.DefaultParser
11: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
11: X-Parsed-By : org.apache.tika.parser.image.ImageParser
11: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
11: Chroma Gamma : 0.45455
11: Dimension PixelAspectRatio : 1.0
11: sRGB : Perceptual
11: Compression NumProgressiveScans : 1
11: Dimension HorizontalPixelSize : 0.26462027
11: Chroma BlackIsZero : true
11: Compression Lossless : true
11: X-TIKA:embedded_depth : 1
11: width : 165
11: X-TIKA:parse_time_millis : 205
11: Dimension ImageOrientation : Normal
11: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="131" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=165, height=131, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="131" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="165" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-11" />
<meta name="tiff:ImageWidth" content="165" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr" />
</body></html>
11: X-TIKA:embedded_resource_path : /embedded-11
11: tiff:ImageWidth : 165
11: Chroma NumChannels : 3
11: Data SampleFormat : UnsignedIntegral
12: Transparency Alpha : none
12: X-TIKA:content_handler : ToXMLContentHandler
12: tiff:ImageLength : 131
12: Compression CompressionTypeName : deflate
12: Data BitsPerSample : 8 8 8
12: Data PlanarConfiguration : PixelInterleaved
12: Dimension VerticalPixelSize : 0.26462027
12: IHDR : width=165, height=131, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
12: Chroma ColorSpaceType : RGB
12: tiff:BitsPerSample : 8 8 8
12: Content-Type : image/png
12: height : 131
12: gAMA : 45455
12: X-Parsed-By : org.apache.tika.parser.DefaultParser
12: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
12: X-Parsed-By : org.apache.tika.parser.image.ImageParser
12: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
12: Chroma Gamma : 0.45455
12: Dimension PixelAspectRatio : 1.0
12: sRGB : Perceptual
12: Compression NumProgressiveScans : 1
12: Dimension HorizontalPixelSize : 0.26462027
12: Chroma BlackIsZero : true
12: Compression Lossless : true
12: X-TIKA:embedded_depth : 1
12: width : 165
12: X-TIKA:parse_time_millis : 254
12: Dimension ImageOrientation : Normal
12: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="131" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=165, height=131, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="131" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="165" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-12" />
<meta name="tiff:ImageWidth" content="165" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

 

 

‘Shopping list

 

 
 
   

 

Books to reed

 

 

o

 

 

 

Search all notebooks...

 

 

 

 
</div>
</body></html>
12: X-TIKA:embedded_resource_path : /embedded-12
12: tiff:ImageWidth : 165
12: Chroma NumChannels : 3
12: Data SampleFormat : UnsignedIntegral
13: Transparency Alpha : none
13: X-TIKA:content_handler : ToXMLContentHandler
13: tiff:ImageLength : 131
13: Compression CompressionTypeName : deflate
13: Data BitsPerSample : 8 8 8
13: Data PlanarConfiguration : PixelInterleaved
13: Dimension VerticalPixelSize : 0.26462027
13: IHDR : width=167, height=131, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
13: Chroma ColorSpaceType : RGB
13: tiff:BitsPerSample : 8 8 8
13: Content-Type : image/png
13: height : 131
13: gAMA : 45455
13: X-Parsed-By : org.apache.tika.parser.DefaultParser
13: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
13: X-Parsed-By : org.apache.tika.parser.image.ImageParser
13: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
13: Chroma Gamma : 0.45455
13: Dimension PixelAspectRatio : 1.0
13: sRGB : Perceptual
13: Compression NumProgressiveScans : 1
13: Dimension HorizontalPixelSize : 0.26462027
13: Chroma BlackIsZero : true
13: Compression Lossless : true
13: X-TIKA:embedded_depth : 1
13: width : 167
13: X-TIKA:parse_time_millis : 195
13: Dimension ImageOrientation : Normal
13: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="131" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=167, height=131, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="131" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="167" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-13" />
<meta name="tiff:ImageWidth" content="167" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

 

 

 

 

 

 

 

 

 

 
</div>
</body></html>
13: X-TIKA:embedded_resource_path : /embedded-13
13: tiff:ImageWidth : 167
13: Chroma NumChannels : 3
13: Data SampleFormat : UnsignedIntegral
14: Transparency Alpha : none
14: X-TIKA:content_handler : ToXMLContentHandler
14: tiff:ImageLength : 79
14: Compression CompressionTypeName : deflate
14: Data BitsPerSample : 8 8 8
14: Data PlanarConfiguration : PixelInterleaved
14: Dimension VerticalPixelSize : 0.26462027
14: IHDR : width=340, height=79, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
14: Chroma ColorSpaceType : RGB
14: tiff:BitsPerSample : 8 8 8
14: Content-Type : image/png
14: height : 79
14: gAMA : 45455
14: X-Parsed-By : org.apache.tika.parser.DefaultParser
14: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
14: X-Parsed-By : org.apache.tika.parser.image.ImageParser
14: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
14: Chroma Gamma : 0.45455
14: Dimension PixelAspectRatio : 1.0
14: sRGB : Perceptual
14: Compression NumProgressiveScans : 1
14: Dimension HorizontalPixelSize : 0.26462027
14: Chroma BlackIsZero : true
14: Compression Lossless : true
14: X-TIKA:embedded_depth : 1
14: width : 340
14: X-TIKA:parse_time_millis : 273
14: Dimension ImageOrientation : Normal
14: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="79" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=340, height=79, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="79" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="340" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-14" />
<meta name="tiff:ImageWidth" content="340" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

warms AL

Layout Shading Sort Convert to Excel
. ~ Spreadsheet

 
</div>
</body></html>
14: X-TIKA:embedded_resource_path : /embedded-14
14: tiff:ImageWidth : 340
14: Chroma NumChannels : 3
14: Data SampleFormat : UnsignedIntegral
15: Transparency Alpha : none
15: X-TIKA:content_handler : ToXMLContentHandler
15: tiff:ImageLength : 76
15: Compression CompressionTypeName : deflate
15: Data BitsPerSample : 8 8 8
15: Data PlanarConfiguration : PixelInterleaved
15: Dimension VerticalPixelSize : 0.26462027
15: IHDR : width=185, height=76, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
15: Chroma ColorSpaceType : RGB
15: tiff:BitsPerSample : 8 8 8
15: Content-Type : image/png
15: height : 76
15: gAMA : 45455
15: X-Parsed-By : org.apache.tika.parser.DefaultParser
15: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
15: X-Parsed-By : org.apache.tika.parser.image.ImageParser
15: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
15: Chroma Gamma : 0.45455
15: Dimension PixelAspectRatio : 1.0
15: sRGB : Perceptual
15: Compression NumProgressiveScans : 1
15: Dimension HorizontalPixelSize : 0.26462027
15: Chroma BlackIsZero : true
15: Compression Lossless : true
15: X-TIKA:embedded_depth : 1
15: width : 185
15: X-TIKA:parse_time_millis : 197
15: Dimension ImageOrientation : Normal
15: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="76" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=185, height=76, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="76" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="185" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-15" />
<meta name="tiff:ImageWidth" content="185" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

 

 

 

HOME &gt;
ToDo

Tag
</div>
</body></html>
15: X-TIKA:embedded_resource_path : /embedded-15
15: tiff:ImageWidth : 185
15: Chroma NumChannels : 3
15: Data SampleFormat : UnsignedIntegral
16: Transparency Alpha : none
16: X-TIKA:content_handler : ToXMLContentHandler
16: tiff:ImageLength : 278
16: Compression CompressionTypeName : deflate
16: Data BitsPerSample : 8 8 8
16: Data PlanarConfiguration : PixelInterleaved
16: Dimension VerticalPixelSize : 0.26462027
16: IHDR : width=453, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
16: Chroma ColorSpaceType : RGB
16: tiff:BitsPerSample : 8 8 8
16: Content-Type : image/png
16: height : 278
16: gAMA : 45455
16: X-Parsed-By : org.apache.tika.parser.DefaultParser
16: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
16: X-Parsed-By : org.apache.tika.parser.image.ImageParser
16: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
16: Chroma Gamma : 0.45455
16: Dimension PixelAspectRatio : 1.0
16: sRGB : Perceptual
16: Compression NumProgressiveScans : 1
16: Dimension HorizontalPixelSize : 0.26462027
16: Chroma BlackIsZero : true
16: Compression Lossless : true
16: X-TIKA:embedded_depth : 1
16: width : 453
16: X-TIKA:parse_time_millis : 261
16: Dimension ImageOrientation : Normal
16: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="278" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=453, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="278" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="453" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-16" />
<meta name="tiff:ImageWidth" content="453" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

 

 
 

 

Status meeting
Conf room 36
John, Felicity

   
 

Follow up with John

 

 

 
</div>
</body></html>
16: X-TIKA:embedded_resource_path : /embedded-16
16: tiff:ImageWidth : 453
16: Chroma NumChannels : 3
16: Data SampleFormat : UnsignedIntegral
17: Transparency Alpha : none
17: X-TIKA:content_handler : ToXMLContentHandler
17: tiff:ImageLength : 50
17: Compression CompressionTypeName : deflate
17: Data BitsPerSample : 8 8 8
17: Data PlanarConfiguration : PixelInterleaved
17: Dimension VerticalPixelSize : 0.26462027
17: IHDR : width=171, height=50, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
17: Chroma ColorSpaceType : RGB
17: tiff:BitsPerSample : 8 8 8
17: Content-Type : image/png
17: height : 50
17: gAMA : 45455
17: X-Parsed-By : org.apache.tika.parser.DefaultParser
17: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
17: X-Parsed-By : org.apache.tika.parser.image.ImageParser
17: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
17: Chroma Gamma : 0.45455
17: Dimension PixelAspectRatio : 1.0
17: sRGB : Perceptual
17: Compression NumProgressiveScans : 1
17: Dimension HorizontalPixelSize : 0.26462027
17: Chroma BlackIsZero : true
17: Compression Lossless : true
17: X-TIKA:embedded_depth : 1
17: width : 171
17: X-TIKA:parse_time_millis : 191
17: Dimension ImageOrientation : Normal
17: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="50" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=171, height=50, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="50" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="171" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-17" />
<meta name="tiff:ImageWidth" content="171" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">vome 5 OG
</div>
</body></html>
17: X-TIKA:embedded_resource_path : /embedded-17
17: tiff:ImageWidth : 171
17: Chroma NumChannels : 3
17: Data SampleFormat : UnsignedIntegral
18: Transparency Alpha : none
18: X-TIKA:content_handler : ToXMLContentHandler
18: tiff:ImageLength : 89
18: Compression CompressionTypeName : deflate
18: Data BitsPerSample : 8 8 8
18: Data PlanarConfiguration : PixelInterleaved
18: Dimension VerticalPixelSize : 0.26462027
18: IHDR : width=162, height=89, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
18: Chroma ColorSpaceType : RGB
18: tiff:BitsPerSample : 8 8 8
18: Content-Type : image/png
18: height : 89
18: gAMA : 45455
18: X-Parsed-By : org.apache.tika.parser.DefaultParser
18: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
18: X-Parsed-By : org.apache.tika.parser.image.ImageParser
18: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
18: Chroma Gamma : 0.45455
18: Dimension PixelAspectRatio : 1.0
18: sRGB : Perceptual
18: Compression NumProgressiveScans : 1
18: Dimension HorizontalPixelSize : 0.26462027
18: Chroma BlackIsZero : true
18: Compression Lossless : true
18: X-TIKA:embedded_depth : 1
18: width : 162
18: X-TIKA:parse_time_millis : 200
18: Dimension ImageOrientation : Normal
18: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="89" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=162, height=89, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="89" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="162" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-18" />
<meta name="tiff:ImageWidth" content="162" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">in your tas

   

 

+N onyour keyboard
</div>
</body></html>
18: X-TIKA:embedded_resource_path : /embedded-18
18: tiff:ImageWidth : 162
18: Chroma NumChannels : 3
18: Data SampleFormat : UnsignedIntegral
19: Transparency Alpha : none
19: X-TIKA:content_handler : ToXMLContentHandler
19: tiff:ImageLength : 68
19: Compression CompressionTypeName : deflate
19: Data BitsPerSample : 8 8 8
19: Data PlanarConfiguration : PixelInterleaved
19: Dimension VerticalPixelSize : 0.26462027
19: IHDR : width=230, height=68, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
19: Chroma ColorSpaceType : RGB
19: tiff:BitsPerSample : 8 8 8
19: Content-Type : image/png
19: height : 68
19: gAMA : 45455
19: X-Parsed-By : org.apache.tika.parser.DefaultParser
19: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
19: X-Parsed-By : org.apache.tika.parser.image.ImageParser
19: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
19: Chroma Gamma : 0.45455
19: Dimension PixelAspectRatio : 1.0
19: sRGB : Perceptual
19: Compression NumProgressiveScans : 1
19: Dimension HorizontalPixelSize : 0.26462027
19: Chroma BlackIsZero : true
19: Compression Lossless : true
19: X-TIKA:embedded_depth : 1
19: width : 230
19: X-TIKA:parse_time_millis : 240
19: Dimension ImageOrientation : Normal
19: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="68" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=230, height=68, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="68" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="230" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-19" />
<meta name="tiff:ImageWidth" content="230" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">aay,
r

 

HOME = —&gt;
</div>
</body></html>
19: X-TIKA:embedded_resource_path : /embedded-19
19: tiff:ImageWidth : 230
19: Chroma NumChannels : 3
19: Data SampleFormat : UnsignedIntegral
20: Transparency Alpha : none
20: X-TIKA:content_handler : ToXMLContentHandler
20: tiff:ImageLength : 278
20: Compression CompressionTypeName : deflate
20: Data BitsPerSample : 8 8 8
20: Data PlanarConfiguration : PixelInterleaved
20: Dimension VerticalPixelSize : 0.26462027
20: IHDR : width=454, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
20: Chroma ColorSpaceType : RGB
20: tiff:BitsPerSample : 8 8 8
20: Content-Type : image/png
20: height : 278
20: gAMA : 45455
20: X-Parsed-By : org.apache.tika.parser.DefaultParser
20: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
20: X-Parsed-By : org.apache.tika.parser.image.ImageParser
20: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
20: Chroma Gamma : 0.45455
20: Dimension PixelAspectRatio : 1.0
20: sRGB : Perceptual
20: Compression NumProgressiveScans : 1
20: Dimension HorizontalPixelSize : 0.26462027
20: Chroma BlackIsZero : true
20: Compression Lossless : true
20: X-TIKA:embedded_depth : 1
20: width : 454
20: X-TIKA:parse_time_millis : 268
20: Dimension ImageOrientation : Normal
20: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="278" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=454, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="278" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="454" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-20" />
<meta name="tiff:ImageWidth" content="454" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

 

Quarter 1 revenue

Sales Revenue Expenses

Scott 4 5 3
James 2 1 4 | I

 

 
</div>
</body></html>
20: X-TIKA:embedded_resource_path : /embedded-20
20: tiff:ImageWidth : 454
20: Chroma NumChannels : 3
20: Data SampleFormat : UnsignedIntegral
21: Transparency Alpha : none
21: X-TIKA:content_handler : ToXMLContentHandler
21: tiff:ImageLength : 77
21: Compression CompressionTypeName : deflate
21: Data BitsPerSample : 8 8 8
21: Data PlanarConfiguration : PixelInterleaved
21: Dimension VerticalPixelSize : 0.26462027
21: IHDR : width=221, height=77, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
21: Chroma ColorSpaceType : RGB
21: tiff:BitsPerSample : 8 8 8
21: Content-Type : image/png
21: height : 77
21: gAMA : 45455
21: X-Parsed-By : org.apache.tika.parser.DefaultParser
21: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
21: X-Parsed-By : org.apache.tika.parser.image.ImageParser
21: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
21: Chroma Gamma : 0.45455
21: Dimension PixelAspectRatio : 1.0
21: sRGB : Perceptual
21: Compression NumProgressiveScans : 1
21: Dimension HorizontalPixelSize : 0.26462027
21: Chroma BlackIsZero : true
21: Compression Lossless : true
21: X-TIKA:embedded_depth : 1
21: width : 221
21: X-TIKA:parse_time_millis : 192
21: Dimension ImageOrientation : Normal
21: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="77" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=221, height=77, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="77" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="221" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-21" />
<meta name="tiff:ImageWidth" content="221" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">osor | AE

Spreadsheet
</div>
</body></html>
21: X-TIKA:embedded_resource_path : /embedded-21
21: tiff:ImageWidth : 221
21: Chroma NumChannels : 3
21: Data SampleFormat : UnsignedIntegral
22: Transparency Alpha : none
22: X-TIKA:content_handler : ToXMLContentHandler
22: tiff:ImageLength : 277
22: Compression CompressionTypeName : deflate
22: Data BitsPerSample : 8 8 8
22: Data PlanarConfiguration : PixelInterleaved
22: Dimension VerticalPixelSize : 0.26462027
22: IHDR : width=452, height=277, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
22: Chroma ColorSpaceType : RGB
22: tiff:BitsPerSample : 8 8 8
22: Content-Type : image/png
22: height : 277
22: gAMA : 45455
22: X-Parsed-By : org.apache.tika.parser.DefaultParser
22: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
22: X-Parsed-By : org.apache.tika.parser.image.ImageParser
22: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
22: Chroma Gamma : 0.45455
22: Dimension PixelAspectRatio : 1.0
22: sRGB : Perceptual
22: Compression NumProgressiveScans : 1
22: Dimension HorizontalPixelSize : 0.26462027
22: Chroma BlackIsZero : true
22: Compression Lossless : true
22: X-TIKA:embedded_depth : 1
22: width : 452
22: X-TIKA:parse_time_millis : 234
22: Dimension ImageOrientation : Normal
22: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="277" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=452, height=277, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="277" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="452" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-22" />
<meta name="tiff:ImageWidth" content="452" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

Z

SS

&lt;2

SZ
Zl
K|

Se

 

INS

|

LL Quick Notes 2

SEP

Ss

 

 

 
</div>
</body></html>
22: X-TIKA:embedded_resource_path : /embedded-22
22: tiff:ImageWidth : 452
22: Chroma NumChannels : 3
22: Data SampleFormat : UnsignedIntegral
23: Transparency Alpha : none
23: X-TIKA:content_handler : ToXMLContentHandler
23: tiff:ImageLength : 50
23: Compression CompressionTypeName : deflate
23: Data BitsPerSample : 8 8 8
23: Data PlanarConfiguration : PixelInterleaved
23: Dimension VerticalPixelSize : 0.26462027
23: IHDR : width=232, height=50, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
23: Chroma ColorSpaceType : RGB
23: tiff:BitsPerSample : 8 8 8
23: Content-Type : image/png
23: height : 50
23: gAMA : 45455
23: X-Parsed-By : org.apache.tika.parser.DefaultParser
23: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
23: X-Parsed-By : org.apache.tika.parser.image.ImageParser
23: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
23: Chroma Gamma : 0.45455
23: Dimension PixelAspectRatio : 1.0
23: sRGB : Perceptual
23: Compression NumProgressiveScans : 1
23: Dimension HorizontalPixelSize : 0.26462027
23: Chroma BlackIsZero : true
23: Compression Lossless : true
23: X-TIKA:embedded_depth : 1
23: width : 232
23: X-TIKA:parse_time_millis : 191
23: Dimension ImageOrientation : Normal
23: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="50" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=232, height=50, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="50" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="232" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-23" />
<meta name="tiff:ImageWidth" content="232" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">] in the top comer of the page
</div>
</body></html>
23: X-TIKA:embedded_resource_path : /embedded-23
23: tiff:ImageWidth : 232
23: Chroma NumChannels : 3
23: Data SampleFormat : UnsignedIntegral
24: Transparency Alpha : none
24: X-TIKA:content_handler : ToXMLContentHandler
24: tiff:ImageLength : 278
24: Compression CompressionTypeName : deflate
24: Data BitsPerSample : 8 8 8
24: Data PlanarConfiguration : PixelInterleaved
24: Dimension VerticalPixelSize : 0.26462027
24: IHDR : width=453, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
24: Chroma ColorSpaceType : RGB
24: tiff:BitsPerSample : 8 8 8
24: Content-Type : image/png
24: height : 278
24: gAMA : 45455
24: X-Parsed-By : org.apache.tika.parser.DefaultParser
24: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
24: X-Parsed-By : org.apache.tika.parser.image.ImageParser
24: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
24: Chroma Gamma : 0.45455
24: Dimension PixelAspectRatio : 1.0
24: sRGB : Perceptual
24: Compression NumProgressiveScans : 1
24: Dimension HorizontalPixelSize : 0.26462027
24: Chroma BlackIsZero : true
24: Compression Lossless : true
24: X-TIKA:embedded_depth : 1
24: width : 453
24: X-TIKA:parse_time_millis : 269
24: Dimension ImageOrientation : Normal
24: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="278" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=453, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="278" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="453" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-24" />
<meta name="tiff:ImageWidth" content="453" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

 

Don't forget to buy milk
on the way home!

 

 

 

 

 

 

 

 

 

 

Ee
es

 
</div>
</body></html>
24: X-TIKA:embedded_resource_path : /embedded-24
24: tiff:ImageWidth : 453
24: Chroma NumChannels : 3
24: Data SampleFormat : UnsignedIntegral
25: Transparency Alpha : none
25: X-TIKA:content_handler : ToXMLContentHandler
25: tiff:ImageLength : 278
25: Compression CompressionTypeName : deflate
25: Data BitsPerSample : 8 8 8
25: Data PlanarConfiguration : PixelInterleaved
25: Dimension VerticalPixelSize : 0.26462027
25: IHDR : width=454, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
25: Chroma ColorSpaceType : RGB
25: tiff:BitsPerSample : 8 8 8
25: Content-Type : image/png
25: height : 278
25: gAMA : 45455
25: X-Parsed-By : org.apache.tika.parser.DefaultParser
25: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
25: X-Parsed-By : org.apache.tika.parser.image.ImageParser
25: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
25: Chroma Gamma : 0.45455
25: Dimension PixelAspectRatio : 1.0
25: sRGB : Perceptual
25: Compression NumProgressiveScans : 1
25: Dimension HorizontalPixelSize : 0.26462027
25: Chroma BlackIsZero : true
25: Compression Lossless : true
25: X-TIKA:embedded_depth : 1
25: width : 454
25: X-TIKA:parse_time_millis : 205
25: Dimension ImageOrientation : Normal
25: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="278" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=454, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="278" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="454" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-25" />
<meta name="tiff:ImageWidth" content="454" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

 

il

 

 

 

 

 

the report
looks gre™ at!

 

 
</div>
</body></html>
25: X-TIKA:embedded_resource_path : /embedded-25
25: tiff:ImageWidth : 454
25: Chroma NumChannels : 3
25: Data SampleFormat : UnsignedIntegral
26: Transparency Alpha : none
26: X-TIKA:content_handler : ToXMLContentHandler
26: tiff:ImageLength : 92
26: Compression CompressionTypeName : deflate
26: Data BitsPerSample : 8 8 8
26: Data PlanarConfiguration : PixelInterleaved
26: Dimension VerticalPixelSize : 0.26462027
26: IHDR : width=167, height=92, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
26: Chroma ColorSpaceType : RGB
26: tiff:BitsPerSample : 8 8 8
26: Content-Type : image/png
26: height : 92
26: gAMA : 45455
26: X-Parsed-By : org.apache.tika.parser.DefaultParser
26: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
26: X-Parsed-By : org.apache.tika.parser.image.ImageParser
26: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
26: Chroma Gamma : 0.45455
26: Dimension PixelAspectRatio : 1.0
26: sRGB : Perceptual
26: Compression NumProgressiveScans : 1
26: Dimension HorizontalPixelSize : 0.26462027
26: Chroma BlackIsZero : true
26: Compression Lossless : true
26: X-TIKA:embedded_depth : 1
26: width : 167
26: X-TIKA:parse_time_millis : 188
26: Dimension ImageOrientation : Normal
26: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="92" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=167, height=92, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="92" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="167" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-26" />
<meta name="tiff:ImageWidth" content="167" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">in your taskbar

 

 

+N on your keyboard
</div>
</body></html>
26: X-TIKA:embedded_resource_path : /embedded-26
26: tiff:ImageWidth : 167
26: Chroma NumChannels : 3
26: Data SampleFormat : UnsignedIntegral
27: Transparency Alpha : none
27: X-TIKA:content_handler : ToXMLContentHandler
27: tiff:ImageLength : 89
27: Compression CompressionTypeName : deflate
27: Data BitsPerSample : 8 8 8
27: Data PlanarConfiguration : PixelInterleaved
27: Dimension VerticalPixelSize : 0.26462027
27: IHDR : width=164, height=89, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
27: Chroma ColorSpaceType : RGB
27: tiff:BitsPerSample : 8 8 8
27: Content-Type : image/png
27: height : 89
27: gAMA : 45455
27: X-Parsed-By : org.apache.tika.parser.DefaultParser
27: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
27: X-Parsed-By : org.apache.tika.parser.image.ImageParser
27: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
27: Chroma Gamma : 0.45455
27: Dimension PixelAspectRatio : 1.0
27: sRGB : Perceptual
27: Compression NumProgressiveScans : 1
27: Dimension HorizontalPixelSize : 0.26462027
27: Chroma BlackIsZero : true
27: Compression Lossless : true
27: X-TIKA:embedded_depth : 1
27: width : 164
27: X-TIKA:parse_time_millis : 192
27: Dimension ImageOrientation : Normal
27: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="89" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=164, height=89, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="89" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="164" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-27" />
<meta name="tiff:ImageWidth" content="164" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">in your taskbar

 

 

+S on your keyboard
</div>
</body></html>
27: X-TIKA:embedded_resource_path : /embedded-27
27: tiff:ImageWidth : 164
27: Chroma NumChannels : 3
27: Data SampleFormat : UnsignedIntegral
28: Transparency Alpha : none
28: X-TIKA:content_handler : ToXMLContentHandler
28: tiff:ImageLength : 278
28: Compression CompressionTypeName : deflate
28: Data BitsPerSample : 8 8 8
28: Data PlanarConfiguration : PixelInterleaved
28: Dimension VerticalPixelSize : 0.26462027
28: IHDR : width=453, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
28: Chroma ColorSpaceType : RGB
28: tiff:BitsPerSample : 8 8 8
28: Content-Type : image/png
28: height : 278
28: gAMA : 45455
28: X-Parsed-By : org.apache.tika.parser.DefaultParser
28: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
28: X-Parsed-By : org.apache.tika.parser.image.ImageParser
28: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
28: Chroma Gamma : 0.45455
28: Dimension PixelAspectRatio : 1.0
28: sRGB : Perceptual
28: Compression NumProgressiveScans : 1
28: Dimension HorizontalPixelSize : 0.26462027
28: Chroma BlackIsZero : true
28: Compression Lossless : true
28: X-TIKA:embedded_depth : 1
28: width : 453
28: X-TIKA:parse_time_millis : 207
28: Dimension ImageOrientation : Normal
28: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="278" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=453, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="278" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="453" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-28" />
<meta name="tiff:ImageWidth" content="453" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

 

 

e -

ge Description

Pricing

 

 

$249.99-599.99 |

 

 

 
</div>
</body></html>
28: X-TIKA:embedded_resource_path : /embedded-28
28: tiff:ImageWidth : 453
28: Chroma NumChannels : 3
28: Data SampleFormat : UnsignedIntegral
29: Transparency Alpha : none
29: X-TIKA:content_handler : ToXMLContentHandler
29: tiff:ImageLength : 278
29: Compression CompressionTypeName : deflate
29: Data BitsPerSample : 8 8 8
29: Data PlanarConfiguration : PixelInterleaved
29: Dimension VerticalPixelSize : 0.26462027
29: IHDR : width=453, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
29: Chroma ColorSpaceType : RGB
29: tiff:BitsPerSample : 8 8 8
29: Content-Type : image/png
29: height : 278
29: gAMA : 45455
29: X-Parsed-By : org.apache.tika.parser.DefaultParser
29: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
29: X-Parsed-By : org.apache.tika.parser.image.ImageParser
29: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
29: Chroma Gamma : 0.45455
29: Dimension PixelAspectRatio : 1.0
29: sRGB : Perceptual
29: Compression NumProgressiveScans : 1
29: Dimension HorizontalPixelSize : 0.26462027
29: Chroma BlackIsZero : true
29: Compression Lossless : true
29: X-TIKA:embedded_depth : 1
29: width : 453
29: X-TIKA:parse_time_millis : 282
29: Dimension ImageOrientation : Normal
29: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="278" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=453, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="278" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="453" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-29" />
<meta name="tiff:ImageWidth" content="453" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

EE retreat

Attending? Overnight?__Vegetarian?

 

N &gt;

Stacy No No

 

 

 
</div>
</body></html>
29: X-TIKA:embedded_resource_path : /embedded-29
29: tiff:ImageWidth : 453
29: Chroma NumChannels : 3
29: Data SampleFormat : UnsignedIntegral
30: Transparency Alpha : none
30: X-TIKA:content_handler : ToXMLContentHandler
30: tiff:ImageLength : 85
30: Compression CompressionTypeName : deflate
30: Data BitsPerSample : 8 8 8
30: Data PlanarConfiguration : PixelInterleaved
30: Dimension VerticalPixelSize : 0.26462027
30: IHDR : width=213, height=85, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
30: Chroma ColorSpaceType : RGB
30: tiff:BitsPerSample : 8 8 8
30: Content-Type : image/png
30: height : 85
30: gAMA : 45455
30: X-Parsed-By : org.apache.tika.parser.DefaultParser
30: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
30: X-Parsed-By : org.apache.tika.parser.image.ImageParser
30: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
30: Chroma Gamma : 0.45455
30: Dimension PixelAspectRatio : 1.0
30: sRGB : Perceptual
30: Compression NumProgressiveScans : 1
30: Dimension HorizontalPixelSize : 0.26462027
30: Chroma BlackIsZero : true
30: Compression Lossless : true
30: X-TIKA:embedded_depth : 1
30: width : 213
30: X-TIKA:parse_time_millis : 179
30: Dimension ImageOrientation : Normal
30: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="85" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=213, height=85, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="85" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="213" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-30" />
<meta name="tiff:ImageWidth" content="213" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">
</div>
</body></html>
30: X-TIKA:embedded_resource_path : /embedded-30
30: tiff:ImageWidth : 213
30: Chroma NumChannels : 3
30: Data SampleFormat : UnsignedIntegral
31: Transparency Alpha : none
31: X-TIKA:content_handler : ToXMLContentHandler
31: tiff:ImageLength : 278
31: Compression CompressionTypeName : deflate
31: Data BitsPerSample : 8 8 8
31: Data PlanarConfiguration : PixelInterleaved
31: Dimension VerticalPixelSize : 0.26462027
31: IHDR : width=453, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
31: Chroma ColorSpaceType : RGB
31: tiff:BitsPerSample : 8 8 8
31: Content-Type : image/png
31: height : 278
31: gAMA : 45455
31: X-Parsed-By : org.apache.tika.parser.DefaultParser
31: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
31: X-Parsed-By : org.apache.tika.parser.image.ImageParser
31: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
31: Chroma Gamma : 0.45455
31: Dimension PixelAspectRatio : 1.0
31: sRGB : Perceptual
31: Compression NumProgressiveScans : 1
31: Dimension HorizontalPixelSize : 0.26462027
31: Chroma BlackIsZero : true
31: Compression Lossless : true
31: X-TIKA:embedded_depth : 1
31: width : 453
31: X-TIKA:parse_time_millis : 175
31: Dimension ImageOrientation : Normal
31: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="278" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=453, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="278" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="453" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-31" />
<meta name="tiff:ImageWidth" content="453" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

 

 

 

 
</div>
</body></html>
31: X-TIKA:embedded_resource_path : /embedded-31
31: tiff:ImageWidth : 453
31: Chroma NumChannels : 3
31: Data SampleFormat : UnsignedIntegral
32: Transparency Alpha : none
32: X-TIKA:content_handler : ToXMLContentHandler
32: tiff:ImageLength : 278
32: Compression CompressionTypeName : deflate
32: Data BitsPerSample : 8 8 8
32: Data PlanarConfiguration : PixelInterleaved
32: Dimension VerticalPixelSize : 0.26462027
32: IHDR : width=452, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
32: Chroma ColorSpaceType : RGB
32: tiff:BitsPerSample : 8 8 8
32: Content-Type : image/png
32: height : 278
32: gAMA : 45455
32: X-Parsed-By : org.apache.tika.parser.DefaultParser
32: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
32: X-Parsed-By : org.apache.tika.parser.image.ImageParser
32: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
32: Chroma Gamma : 0.45455
32: Dimension PixelAspectRatio : 1.0
32: sRGB : Perceptual
32: Compression NumProgressiveScans : 1
32: Dimension HorizontalPixelSize : 0.26462027
32: Chroma BlackIsZero : true
32: Compression Lossless : true
32: X-TIKA:embedded_depth : 1
32: width : 452
32: X-TIKA:parse_time_millis : 408
32: Dimension ImageOrientation : Normal
32: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="278" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=452, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="278" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="452" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-32" />
<meta name="tiff:ImageWidth" content="452" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

 

 

Sen

&gt; Transportation fh Reservation

+ Arrive at airport at 6am [Ben + Hotel is for the 6 — 10%
+ Plane departs at 8am + Do we need to extend

+ Plane lands at 20m the reservation by a day?

Tom

 

 
</div>
</body></html>
32: X-TIKA:embedded_resource_path : /embedded-32
32: tiff:ImageWidth : 452
32: Chroma NumChannels : 3
32: Data SampleFormat : UnsignedIntegral
33: Transparency Alpha : none
33: X-TIKA:content_handler : ToXMLContentHandler
33: tiff:ImageLength : 278
33: Compression CompressionTypeName : deflate
33: Data BitsPerSample : 8 8 8
33: Data PlanarConfiguration : PixelInterleaved
33: Dimension VerticalPixelSize : 0.26462027
33: IHDR : width=453, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none
33: Chroma ColorSpaceType : RGB
33: tiff:BitsPerSample : 8 8 8
33: Content-Type : image/png
33: height : 278
33: gAMA : 45455
33: X-Parsed-By : org.apache.tika.parser.DefaultParser
33: X-Parsed-By : org.apache.tika.parser.ocr.TesseractOCRParser
33: X-Parsed-By : org.apache.tika.parser.image.ImageParser
33: pHYs : pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter
33: Chroma Gamma : 0.45455
33: Dimension PixelAspectRatio : 1.0
33: sRGB : Perceptual
33: Compression NumProgressiveScans : 1
33: Dimension HorizontalPixelSize : 0.26462027
33: Chroma BlackIsZero : true
33: Compression Lossless : true
33: X-TIKA:embedded_depth : 1
33: width : 453
33: X-TIKA:parse_time_millis : 430
33: Dimension ImageOrientation : Normal
33: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="Transparency Alpha" content="none" />
<meta name="tiff:ImageLength" content="278" />
<meta name="Compression CompressionTypeName" content="deflate" />
<meta name="Data BitsPerSample" content="8 8 8" />
<meta name="Data PlanarConfiguration" content="PixelInterleaved" />
<meta name="Dimension VerticalPixelSize" content="0.26462027" />
<meta name="IHDR" content="width=453, height=278, bitDepth=8, colorType=RGB, compressionMethod=deflate, filterMethod=adaptive, interlaceMethod=none" />
<meta name="Chroma ColorSpaceType" content="RGB" />
<meta name="tiff:BitsPerSample" content="8 8 8" />
<meta name="Content-Type" content="image/png" />
<meta name="height" content="278" />
<meta name="gAMA" content="45455" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.ocr.TesseractOCRParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.image.ImageParser" />
<meta name="pHYs" content="pixelsPerUnitXAxis=3779, pixelsPerUnitYAxis=3779, unitSpecifier=meter" />
<meta name="Chroma Gamma" content="0.45455" />
<meta name="Dimension PixelAspectRatio" content="1.0" />
<meta name="sRGB" content="Perceptual" />
<meta name="Compression NumProgressiveScans" content="1" />
<meta name="Dimension HorizontalPixelSize" content="0.26462027" />
<meta name="Chroma BlackIsZero" content="true" />
<meta name="Compression Lossless" content="true" />
<meta name="X-TIKA:embedded_depth" content="1" />
<meta name="width" content="453" />
<meta name="Dimension ImageOrientation" content="Normal" />
<meta name="X-TIKA:embedded_resource_path" content="/embedded-33" />
<meta name="tiff:ImageWidth" content="453" />
<meta name="Chroma NumChannels" content="3" />
<meta name="Data SampleFormat" content="UnsignedIntegral" />
<title></title>
</head>
<body><div class="ocr">

2.

Shopping list Priorities
Milk [1 Check messages
(J Oranges * Call Dave
CO Potatoes ? Follow up with Jim
Bread ®) Schedule appt.
O cereal FA call Janet

Sugar

 

 

 
</div>
</body></html>
33: X-TIKA:embedded_resource_path : /embedded-33
33: tiff:ImageWidth : 453
33: Chroma NumChannels : 3
33: Data SampleFormat : UnsignedIntegral
{noformat}

> OneNote formats support - Mime Magic and Parser
> -----------------------------------------------
>
>                 Key: TIKA-2224
>                 URL: https://issues.apache.org/jira/browse/TIKA-2224
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime
>    Affects Versions: 1.14
>            Reporter: Nick Burch
>            Priority: Major
>         Attachments: Sample1.json, Sample1.one, note-ssn-test-mmmm.one
>
>
> As raised at http://stackoverflow.com/questions/41272195/onenote-support-for-apache-tika-parsers, we don't have any magic for the OneNote formats. Several years ago we dug out the file format specs (see http://lucene.472066.n3.nabble.com/Tika-OneNote-Support-td4020393.html), but didn't have volunteer energy to implement a parser. However, armed with those specs, we should be able to come up with some mime magic for detection



--
This message was sent by Atlassian Jira
(v8.3.4#803005)