<html>
  <head>
    <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
  </head>
  <body text="#000000" bgcolor="#FFFFFF">
    <p>Good to hear. Glad I could give back.<br>
    </p>
    <div class="moz-cite-prefix">On 9/16/2019 11:27 AM, Laura Brody
      wrote:<br>
    </div>
    <blockquote type="cite"
cite="mid:CAMDnY5OB-VdLwvoByJncSyomP=KKehDFxrmzNPSf6yi8DbH1kw@mail.gmail.com">
      <meta http-equiv="content-type" content="text/html; charset=UTF-8">
      <div dir="ltr">
        <div>Just an update.... I got it all working on a Raspberry Pi 3
          B+ with a 32 GB micro SD chip. Debian Linux. I wrote a script
          to upload the processed file to Dropbox automatically. It is
          happily working on files and will probably be done in a week
          or less (it has about 55 files to process, some with 2-6
          pages, but most with 50-130 pages).</div>
        <div><br>
        </div>
        <div>All of the software was free. I already had a few Raspberry
          Pi boards, so my only investment was my time.</div>
        <div><br>
        </div>
        <div>Thank you so much for pointing me in the right direction.
          Left to my own devices, I would still be researching how to
          tackle this project.</div>
        <div><br>
        </div>
        <div>Laura Brody</div>
      </div>
      <br>
      <div class="gmail_quote">
        <div dir="ltr" class="gmail_attr">On Mon, Sep 9, 2019 at 10:44
          PM Laura Brody <<a href="mailto:laura.k.brody@gmail.com"
            moz-do-not-send="true">laura.k.brody@gmail.com</a>>
          wrote:<br>
        </div>
        <blockquote class="gmail_quote" style="margin:0px 0px 0px
          0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
          <div dir="ltr">
            <div>Yes, I see that. Now that I know that PDFsandwhich and
              tesseract will run on the Raspberry Pi and do what I need,
              I have a clear idea what I need to do to get searchable
              PDFs out of the files that I have. Thank you for pointing
              me in the right direction. You saved me a boatload of time
              and aggravation.</div>
            <div><br>
            </div>
            <div>Laura Brody<br>
            </div>
          </div>
          <br>
          <div class="gmail_quote">
            <div dir="ltr" class="gmail_attr">On Mon, Sep 9, 2019 at
              10:38 PM Cesar Baquerizo <<a
                href="mailto:ces@cescom.com" target="_blank"
                moz-do-not-send="true">ces@cescom.com</a>> wrote:<br>
            </div>
            <blockquote class="gmail_quote" style="margin:0px 0px 0px
              0.8ex;border-left:1px solid
              rgb(204,204,204);padding-left:1ex">
              <div>
                <div>
                  <div>
                    <div>
                      <div style="direction:ltr">Yw. You’ll also need
                        tesseract. They are two different Sw. Let me
                        know how it goes.
                      </div>
                    </div>
                    <div><br>
                    </div>
                    <div
class="gmail-m_-84511995742781478gmail-m_6089203737040733023ms-outlook-ios-signature">Get
                      <a href="https://aka.ms/o0ukef" target="_blank"
                        moz-do-not-send="true">Outlook for iOS</a></div>
                  </div>
                  <div> </div>
                  <hr style="display:inline-block;width:98%">
                  <div
                    id="gmail-m_-84511995742781478gmail-m_6089203737040733023divRplyFwdMsg"
                    dir="dir="ltr""><font
                      style="font-size:11pt" face="Calibri, sans-serif"
                      color="#000000"><b>From:</b> Laura Brody <<a
                        href="mailto:laura.k.brody@gmail.com"
                        target="_blank" moz-do-not-send="true">laura.k.brody@gmail.com</a>><br>
                      <b>Sent:</b> Monday, September 9, 2019 10:35 PM<br>
                      <b>To:</b> Cesar Baquerizo; Filepro_List<br>
                      <b>Subject:</b> Re: OT: Help getting PDF to OCR or
                      searchable form
                      <div> </div>
                    </font></div>
                  <div dir="ltr">
                    <div>I found a list of Linux flavors that
                      PDFsandwhich has been ported to and Raspberrian
                      Linux was on the list!</div>
                    <div><br>
                    </div>
                    <div>I will be be working on this project tomorrow.
                      Thank you so much for this lead. I don't think I
                      would have found it by myself.</div>
                    <div><br>
                    </div>
                    <div>Laura Brody<br>
                    </div>
                  </div>
                  <br>
                  <div class="gmail_quote">
                    <div dir="ltr" class="gmail_attr">On Mon, Sep 9,
                      2019 at 10:27 PM Laura Brody <<a
                        href="mailto:laura.k.brody@gmail.com"
                        target="_blank" moz-do-not-send="true">laura.k.brody@gmail.com</a>>
                      wrote:<br>
                    </div>
                    <blockquote class="gmail_quote" style="margin:0px
                      0px 0px 0.8ex;border-left:1px solid
                      rgb(204,204,204);padding-left:1ex">
                      <div dir="ltr">
                        <div>This is very interesting.</div>
                        <div><br>
                        </div>
                        <div>The only Linux box I have running at the
                          moment is Raspberry Pi 3 B+. I have 64GB SD
                          card available, so space isn't an issue. Any
                          idea if it will work on it?</div>
                        <div><br>
                        </div>
                        <div>Laura Brody<br>
                        </div>
                      </div>
                      <br>
                      <div class="gmail_quote">
                        <div dir="ltr" class="gmail_attr">On Mon, Sep 9,
                          2019 at 9:54 PM Cesar Baquerizo <<a
                            href="mailto:ces@cescom.com" target="_blank"
                            moz-do-not-send="true">ces@cescom.com</a>>
                          wrote:<br>
                        </div>
                        <blockquote class="gmail_quote"
                          style="margin:0px 0px 0px
                          0.8ex;border-left:1px solid
                          rgb(204,204,204);padding-left:1ex">
                          <div>
                            <div>
                              <div>
                                <div>
                                  <div style="direction:ltr">Lookup
                                    Tesseract and Pdfsandwich. It may
                                    help you. </div>
                                </div>
                                <div><br>
                                </div>
                                <div
class="gmail-m_-84511995742781478gmail-m_6089203737040733023gmail-m_8760251007502064830gmail-m_2990805021006981543ms-outlook-ios-signature">Get
                                  <a href="https://aka.ms/o0ukef"
                                    target="_blank"
                                    moz-do-not-send="true">Outlook for
                                    iOS</a></div>
                              </div>
                              <div> </div>
                              <hr style="display:inline-block;width:98%">
                              <div
id="gmail-m_-84511995742781478gmail-m_6089203737040733023gmail-m_8760251007502064830gmail-m_2990805021006981543divRplyFwdMsg"
                                dir="dir="ltr"">
                                <font style="font-size:11pt"
                                  face="Calibri, sans-serif"
                                  color="#000000"><b>From:</b>
                                  Filepro-list
                                  <filepro-list-bounces+ces=<a
                                    href="mailto:cescom.com@lists.celestial.com"
                                    target="_blank"
                                    moz-do-not-send="true">cescom.com@lists.celestial.com</a>>
                                  on behalf of Laura Brody via
                                  Filepro-list <<a
                                    href="mailto:filepro-list@lists.celestial.com"
                                    target="_blank"
                                    moz-do-not-send="true">filepro-list@lists.celestial.com</a>><br>
                                  <b>Sent:</b> Monday, September 9, 2019
                                  9:50 PM<br>
                                  <b>To:</b> Filepro_List<br>
                                  <b>Cc:</b> Laura Brody<br>
                                  <b>Subject:</b> Re: OT: Help getting
                                  PDF to OCR or searchable form
                                  <div> </div>
                                </font></div>
                              Additional information.... <br>
                              <br>
                              I talked to the user and got some
                              history... <br>
                              <br>
                              The user scanned in legal documents. Saved
                              the images as pages in a PDF. <br>
                              That is why I can't search on keywords for
                              most of the files. A few files <br>
                              were typed up and then exported as PDF.
                              most are images of the pages. That <br>
                              means that OCR has to be part of the
                              solution. <br>
                              <br>
                              I discovered that Adobe Acobat Reader has
                              a setting to search all PDFs in a <br>
                              directory for keywords. The problem is
                              that these files don't contain text. <br>
                              They contain images of text. Adobe can't
                              search images and find keywords. <br>
                              <br>
                              Laura Brody <br>
                              <br>
                              On Mon, Sep 9, 2019 at 8:03 PM Laura Brody
                              <<a
                                href="mailto:laura.k.brody@gmail.com"
                                target="_blank" moz-do-not-send="true">laura.k.brody@gmail.com</a>>
                              wrote:
                              <br>
                              <br>
                              > I am hoping that one of you has
                              solved this problem before..... <br>
                              > <br>
                              > I have over a thousand pages of text
                              in a dozen or so PDF files. Most <br>
                              > files are "read-only" and I can not
                              do Ctrl-F to search for keywords. I <br>
                              > would like to be able to OCR the
                              files and put everything into one file <br>
                              > that is searchable. Or is there a
                              utility that will search all of the PDFs <br>
                              > in a directory for a keyword? <br>
                              > <br>
                              > Suggestions anyone? <br>
                              > <br>
                              > Laura Brody <br>
                              > <br>
                              -------------- next part -------------- <br>
                              An HTML attachment was scrubbed... <br>
                              URL: <<a
href="http://mailman.celestial.com/pipermail/filepro-list/attachments/20190909/935e0f40/attachment.html"
                                target="_blank" moz-do-not-send="true">http://mailman.celestial.com/pipermail/filepro-list/attachments/20190909/935e0f40/attachment.html</a>>
                              <br>
_______________________________________________ <br>
                              Filepro-list mailing list <br>
                              <a
                                href="mailto:Filepro-list@lists.celestial.com"
                                target="_blank" moz-do-not-send="true">Filepro-list@lists.celestial.com</a>
                              <br>
                              Subscribe/Unsubscribe/Subscription Changes
                              <br>
                              <a
                                href="http://mailman.celestial.com/mailman/listinfo/filepro-list"
                                target="_blank" moz-do-not-send="true">http://mailman.celestial.com/mailman/listinfo/filepro-list</a>
                              <br>
                            </div>
                          </div>
                        </blockquote>
                      </div>
                    </blockquote>
                  </div>
                </div>
              </div>
            </blockquote>
          </div>
        </blockquote>
      </div>
    </blockquote>
    <div class="moz-signature">-- <br>
      <img src="cid:part14.09D40B22.79025816@cescom.com" border="0"></div>
  </body>
</html>