6

Does anybody have a suggestion for a java library that performs automatic cropping and deskewing of images (like those retrieved from a flatbed scanner)?

skaffman
  • 390,936
  • 96
  • 800
  • 764
Chris Rauber
  • 81
  • 1
  • 1
  • 5

5 Answers5

6

Deskewing

Take a look at Tess4j (Java JNA wrapper for Tesseract).

You can combine ImageDeskew.getSkewAngle() with ImageHelper.rotate(BufferedImage image, double angle).

There is an example on how to use it on the test folder of the tess4j project Tesseract1Test.java

public void testDoOCR_SkewedImage() throws Exception {
    logger.info("doOCR on a skewed PNG image");
    File imageFile = new File(this.testResourcesDataPath, "eurotext_deskew.png");
    BufferedImage bi = ImageIO.read(imageFile);
    ImageDeskew id = new ImageDeskew(bi);
    double imageSkewAngle = id.getSkewAngle(); // determine skew angle
    if ((imageSkewAngle > MINIMUM_DESKEW_THRESHOLD || imageSkewAngle < -(MINIMUM_DESKEW_THRESHOLD))) {
        bi = ImageHelper.rotateImage(bi, -imageSkewAngle); // deskew image
    }

    String expResult = "The (quick) [brown] {fox} jumps!\nOver the $43,456.78 <lazy> #90 dog";
    String result = instance.doOCR(bi);
    logger.info(result);
    assertEquals(expResult, result.substring(0, expResult.length()));
}

eurotext_deskew.png

delkant
  • 2,264
  • 1
  • 28
  • 26
4

ImageMagick can do that; you can use the ImageMagick Java bindings. The auto-crop operator is probably what you're looking for. Automatic deskewing is a much harder problem and involves some significant image processing; I'm not sure if ImageMagick can handle that. If you can figure out the skewing parameters using something else, ImageMagick can definitely unskew it for you.

Adam Rosenfield
  • 375,615
  • 96
  • 501
  • 581
3

I wrote a note that simple port of a very good deskewer. It works best if you have some text in the image.

Mahmoud Mabrok
  • 1,238
  • 15
  • 22
anydoby
  • 359
  • 2
  • 8
0

I've written a simple image deskew app, includes source. Available at:

http://www.recognition-software.com/image/deskew/

0

I'd imagine that someone has built a library on top of the Java Advanced Imaging API for doing this. You could try Googling for "Java Advanced Imaging deskew".

Matt Passell
  • 4,217
  • 3
  • 23
  • 37