【免费】文档歪斜检测方法的论文资源-CSDN文库

文档歪斜

需积分: 0 47 浏览量 2013-01-31 19:51:24 上传评论收藏 506KB PDF 举报

资源详情

资源评论

A DOCUMENT SKEW DETECTION METHOD USING

RUN-LENGTH ENCODING AND THE HOUGH TRANSFORM

Stuart C. Hinds,l James L. Fisher, and Donald

D'Amato

The MITRE CorporationlCivii Systems Division

Systems Engineering and Applied Technology

7525

Colshire Drive

McLean, Virginia 22102-3481,

USA

Abstract

As part of the development of a document image analysis

system, we have devised a method, based on the Hough transform,

for the detection of document skew and interline spacing--necessary

parameters for the automatic segmentation of text from graphics.

Because the Hough transform is computationally expensive, we

reduce the amount of data within a document image through the

computation of its horizontal and vertical black run-lengths.

Histograms of these run-lengths are used

determine whether the

document is in portrait or landscape orientation. A grey scale "burst

image" is created from the black run lengths that are perpendicular

the text lines by placing the length of the

run

the run's bottom-

most pixel. This data reduction procedure decreases the processing

time of the Hough transform and reduces the effects of non-textual

data on the determination of skew and interline spacing.

Introduction

As more and more institutions and federal agencies confront an

overwhelming abundance of paper and microfilm-based information,

it has become increasingly important

convert such information into

an efficiently stored, computer searchable form. Document image

analysis is the process

identifying the components (text and

non-text) of a document's structure and can be divided into three

phases

[l]:

scanning and binarization, 2) segmentation and

labeling of text and figure blocks, and 3) processing of text (usually

by optical character recognition) and figures. Fast top-down

methods, such as the run-length smoothing algorithm (RLSA),

developed by Wong et. al. [2], and projection profile cuts [3], [4],

have been developed for accomplishing phase 2. Unfortunately,

these methods are generally ineffective when applied

skewed

documents. Although other researchers

[5],

(61 have successfully

segmented text from figures in skewed documents, their methods

involve a connected components analysis, which can be

computationally intensive.

We have developed a document image analysis system based

on the approach of Wong et. al. Because Wong's system is limited

aligned documents, we have added a skew detection stage

our

system. While other methods for determining document skew exist

(see

[7]

for a short review), we have based our method on the

Hough transform because of its sensitivity

skew and its

applicability in determining a document's interline spacing--a

necessary parameter for the RLSA. Because the Hough transform

is computationally expensive and slowed by noise, we have

developed a procedure, based on the computation of an image's

vertical black run-lengths,

reduce the amount of data and

minimize the effects of noise and non-textual material within the

original document image.

Current address: University of California at San Diego, Neuropsychology

Research

Lab.,

Childrens Hospital,

8001

Frost

St.,

San Diego,

92123

Review

the Hough Transform and Its

Use

In Document Skew Detection

The Hough transform can be used

detect lines at any

orientation.

consists of mapping points in Cartesian space

(xy)

sinusoidal curves in

ptl

space via the transformation:

xcos(e)

ysin(8).

Each time a sinusoidal curve intersects another at a particular value

and

the likelihood increases that a line corresponding

that

coordinate value is present in the original image. An accumulator

array (consisting of

rows and

columns) is used to count the

number of intersections at various

and

values. Those cells in the

accumulator array with the highest number of counts will correspond

lines in the original image. The basic computational steps in the

Hough transform are as follows:

For (x)

For (Y)

if (pixel is black)

(

For

(8)

(

Calculate

xcos(8)

ysin(8)

increment accumulator array at

p,8

should be noted that the Hough transform is usually applied

binary images; hence the need

check

a pixel is black (i.e., part

the information within the image).

The angular resolution that the Hough transform will detect

depends on how finely the columns of the accumulator array

(corresponding

values) are spaced. The increment size

between

values and the maximum skew angle that should be

detected will in turn affect the computation time of the Hough

transform. In general,

ranges either from

180

-90

degrees in increments of one degree.

The number of rows,

of the accumulator array

(corresponding

values) affects how well the Hough transform

resolves lines. To detect fine lines,

should be such that each xy

point along a straight column can

mapped

a unique row.

Therefore,

detect every point in a rectangular image (with

ranging from

-90

degrees):

(w2

h2)'Q

where w

width and h

height.

Because text lines are actually thick lines of sparse density,

the problem of determining the skew angle of text lines becomes

more difficult than determining the skew angle of fine lines. Two

groups of researchers have developed different methods for

detecting document skew with the Hough transform.

464

CH2898-5/90/0000/0464$01

.OO

1990

IEEE

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余4页未读，立即下载

评论收藏

内容反馈

文档歪斜检测方法的论文

评论0

最新资源

文档歪斜检测方法的论文

评论0

最新资源

相关推荐

基于Shearlet变换的扫描文档图像倾斜检测

解决Win 10与不兼容VirtualBox操作过程文档+（附带软件）.zip

计算机网络知识点总结(谢希仁第八版).pdf

Xshell软件(配色方案&amp;高亮关键字/突出显示集)的相关文件

湖南科技大学《计算机网络》配套课件（PDF版）

windows server 2022镜像ISO

计算机网络第八版（谢希仁）课后习题答案

《计算机网络自顶向下方法第7版》中文PDF+复习题问题中文版答案

scau计算机网络实验

计算机网络自顶向下方法第八版答案

postman 离线登录版

PPT模板（科技风、商务风）

quickping下载

《计算机网络：自顶向下方法》第八版 PPT 第一章 计算机网络和因特网

hevc视频扩展免费2.0.53348.0x64

2023国赛 网络建设与运维正式赛卷

华为Ensp win10 win11 四款软件兼容性安装 解决Ensp 启动设备AR1失败 错误代码40 41

matlab输入到abaqus

BLE蓝牙调试助手，Win10桌面工具，exe

南京邮电大学交换技术与通信网 MPLS基本配置实验报告（最新）

CANoe10.0的安装步骤.pdf

网络攻防原理与技术第3版课后习题参考答案

win10破解多用户远程登录桌面补丁

计算机网络期末复习电子版资料（谢希仁第8版）

网络工程师笔记.pdf

华为ICT实践赛 网络赛道备考资料 题库永久更新

Ansys/Workbench常用材料查询表

计算机网络原理(谢希仁第八版)课后习题答案

Xshell软件(配色方案&高亮关键字/突出显示集)的相关文件

《计算机网络：自顶向下方法》第八版 PPT 第一章计算机网络和因特网

2023国赛网络建设与运维正式赛卷

华为Ensp win10 win11 四款软件兼容性安装解决Ensp 启动设备AR1失败错误代码40 41

华为ICT实践赛网络赛道备考资料题库永久更新