Entfernen von Hintergrundrauschen aus dem Captcha-Bild mit PYTHON PIL

Question

Mar 10, 2013, 07:17 AM

algorithm captcha python image-processing python-imaging-library

Entfernen von Hintergrundrauschen aus dem Captcha-Bild mit PYTHON PIL

Ich habe ein verarbeitetes Captcha-Bild (vergrößert), das wie folgt aussieht:

Wie Sie sehen, ist die Schriftgröße des "TEXT" etwas größer als die Breite der lauten Linien.
Also brauche ich einen Algorithmus oder Code, um die verrauschten Linien von diesem Bild zu entfernen.

Mit Hilfe der Python PIL Library und des unten genannten Chopping-Algorithmus erhalte ich kein Ausgabebild, das von OCRs leicht gelesen werden kann.

Hier ist der Python-Code, den ich ausprobiert habe:

import PIL.Image
import sys

# python chop.py [chop-factor] [in-file] [out-file]

chop = int(sys.argv[1])
image = PIL.Image.open(sys.argv[2]).convert('1')
width, height = image.size
data = image.load()

# Iterate through the rows.
for y in range(height):
    for x in range(width):

        # Make sure we're on a dark pixel.
        if data[x, y] > 128:
            continue

        # Keep a total of non-white contiguous pixels.
        total = 0

        # Check a sequence ranging from x to image.width.
        for c in range(x, width):

            # If the pixel is dark, add it to the total.
            if data[c, y] < 128:
                total += 1

            # If the pixel is light, stop the sequence.
            else:
                break

        # If the total is less than the chop, replace everything with white.
        if total <= chop:
            for c in range(total):
                data[x + c, y] = 255

        # Skip this sequence we just altered.
        x += total


# Iterate through the columns.
for x in range(width):
    for y in range(height):

        # Make sure we're on a dark pixel.
        if data[x, y] > 128:
            continue

        # Keep a total of non-white contiguous pixels.
        total = 0

        # Check a sequence ranging from y to image.height.
        for c in range(y, height):
            # If the pixel is dark, add it to the total.
            if data[x, c] < 128:
                total += 1

            # If the pixel is light, stop the sequence.
            else:
                break

        # If the total is less than the chop, replace everything with white.
        if total <= chop:
            for c in range(total):
                data[x, y + c] = 255

        # Skip this sequence we just altered.
        y += total

image.save(sys.argv[3])

Im Grunde möchte ich einen besseren Algorithmus / Code kennen, um das Rauschen loszuwerden und so das Bild für die OCR lesbar zu machen (Tesseract oder Pytesser).