基于选择性搜索（SelectiveSearch）选择候选区域.zip资源-CSDN文库

共3个文件

py：2个

pdf：1个

版权申诉

Pytorch

197 浏览量 2023-08-23 09:06:54 上传评论收藏 5.6MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

基于选择性搜索（Selective Search）选择候选区域.zip （3个子文件）

main.py 2KB

SelectiveSearch.py 13KB

selectiveSearchDraft.pdf 5.66MB

Selective Search for Object Recognition

J.R.R. Uijlings

∗1,2

, K.E.A. van de Sande

†2

, T. Gevers

, and A.W.M. Smeulders

University of Trento, Italy

University of Amsterdam, the Netherlands

Technical Report 2012, submitted to IJCV

Abstract

This paper addresses the problem of generating possible object lo-

cations for use in object recognition. We introduce Selective Search

which combines the strength of both an exhaustive search and seg-

mentation. Like segmentation, we use the image structure to guide

our sampling process. Like exhaustive search, we aim to capture

all possible object locations. Instead of a single technique to gen-

erate possible object locations, we diversify our search and use a

variety of complementary image partitionings to deal with as many

image conditions as possible. Our Selective Search results in a

small set of data-driven, class-independent, high quality locations,

yielding 99% recall and a Mean Average Best Overlap of 0.879 at

10,097 locations. The reduced number of locations compared to

an exhaustive search enables the use of stronger machine learning

techniques and stronger appearance models for object recognition.

In this paper we show that our selective search enables the use of

the powerful Bag-of-Words model for recognition. The Selective

Search software is made publicly available

1 Introduction

For a long time, objects were sought to be delineated before their

identiﬁcation. This gave rise to segmentation, which aims for

a unique partitioning of the image through a generic algorithm,

where there is one part for all object silhouettes in the image. Re-

search on this topic has yielded tremendous progress over the past

years [3, 6, 13, 26]. But images are intrinsically hierarchical: In

Figure 1a the s alad and spoons are inside t he salad bowl, which in

turn stands on the table. Furthermore, depending on the context the

term table in this picture can refer to only the wood or include ev-

erything on the table. Therefore both the nature of images and the

different uses of an object category are hierarchical. This prohibits

the unique partitioning of objects for all but the most speciﬁc pur-

poses. Hence for most tasks multiple scales in a segmentation are a

necessity. This is most naturally addressed by using a hierarchical

partitioning, as done for example by Arbelaez et al. [3].

Besides that a segmentation should be hierarchical, a generic so-

lution for segmentation using a single strategy may not exist at all.

There are many conﬂicting reasons why a region should be grouped

together: In Figure 1b the cats can be separated using colour, but

their texture is the same. Conversely, in Figure 1c the chameleon

∗

[email protected]

†

[email protected]

http://disi.unitn.it/

uijlings/SelectiveSearch.html

(a) (b)

Figure 1: There is a high variety of reasons that an image region

forms an object. In (b) the cats can be distinguished by colour, not

texture. In (c) the chameleon can be distinguished from the sur-

rounding leaves by texture, not colour. In (d) the wheels can be part

of the car because they are enclosed, not because they are similar

in texture or colour. Therefore, to ﬁnd objects in a structured way

it is necessary to use a variety of diverse strategies. Furthermore,

an image is intrinsically hierarchical as there is no single scale for

which the complete table, salad bowl, and salad spoon can be found

in (a).

is similar to its surrounding leaves in terms of colour, yet its tex-

ture differs. Finally, in Figure 1d, the wheels are wildly different

from the car in terms of both colour and texture, yet are enclosed

by the car. Individual visual features therefore cannot resolve the

ambiguity of segmentation.

And, ﬁnally, there is a more fundamental problem. Regions with

very different characteristics, such as a face over a sweater, can

only be combined i nto one object after it has been established that

the object at hand is a human. Hence without prior recognition it is

hard to decide that a face and a sweater are part of one object [29].

This has led to the opposite of the traditional approach: to do

localisation through the identiﬁcation of an object. This recent ap-

proach in object recognition has made enormous progress in less

than a decade [8, 12, 16, 35]. With an appearance model learned

from examples, an exhaustive search is performed where every lo-

cation within the image is examined as to not miss any potential

object location [8, 12, 16, 35].

However, the exhaustive search itself has several drawbacks.

Searching every possible location is computationally infeasible.

The search space has to be reduced by using a regular grid, ﬁxed

scales, and ﬁxed aspect ratios. In most cases the number of lo-

cations to visit remains huge, so much that alternative restrictions

need to be imposed. The classiﬁer is simpliﬁed and the appearance

model needs to be fast. Furthermore, a uniform sampling yields

many boxes for which it is immediately clear that they are not sup-

portive of an object. Rather then sampling locations blindly using

an exhaustive search, a key question is: Can we steer the sampling

by a data-driven analysis?

In this paper, we aim to combine the best of the intuitions of seg-

mentation and exhaustive search and propose a data-driven selec-

tive search. Inspired by bottom-up segmentation, we aim to exploit

the structure of the image to generate object locations. Inspired by

exhaustive search, we aim to capture all possible object locations.

Therefore, instead of using a single sampling technique, we aim

to diversify the sampling techniques to account for as many image

conditions as possible. Speciﬁcally, we use a data-driven grouping-

based strategy where we increase diversity by using a variety of

complementary grouping criteria and a variety of complementary

colour spaces with different invariance properties. The set of lo-

cations is obtained by combining the locations of these comple-

mentary partitionings. Our goal is to generate a class-independent,

data-driven, selective search strategy that generates a small set of

high-quality object locations.

Our application domain of selective search is object recognition.

We therefore evaluate on the most commonly used dataset for this

purpose, the Pascal VOC detection challenge which consists of 20

object classes. The size of this dataset yields computational con-

straints for our selective search. Furthermore, the use of this dataset

means that the quality of locations is mainly evaluated in terms of

bounding boxes. However, our selective search applies to regions

as well and is also applicable to concepts such as “grass”.

In this paper we propose selective search for object recognition.

Our main research questions are: (1) What are good diversiﬁcation

strategies for adapting segmentation as a selective search strategy?

(2) How effective is selective search in creating a small set of high-

quality locations within an image? (3) Can we use selective search

to employ more powerful classiﬁers and appearance models for ob-

ject recognition?

2 Related Work

We conﬁne the related work to the domain of object recognition

and divide it into three categories: Exhaustive search, segmenta-

tion, and other sampling strategies that do not fall in either cate-

gory.

2.1 Exhaustive Search

As an object can be located at any position and scale in the image,

it is natural to search everywhere [8, 16, 36]. However, the visual

search space is huge, making an exhaustive search computationally

expensive. This imposes constraints on the evaluation cost per lo-

cation and/or the number of locations considered. Hence most of

these sliding window techniques use a coarse search grid and ﬁxed

aspect ratios, using weak classiﬁers and economic image features

such as HOG [8, 16, 36]. This method is often used as a preselec-

tion step in a cascade of classiﬁers [16, 36].

Related to the sliding window technique is the highly successful

part-based object localisation method of Felzenszwalb et al. [12].

Their method also performs an exhaustive search using a linear

SVM and HOG features. However, they search for objects and

object parts, whose combination results in an impressive object de-

tection performance.

Lampert et al. [17] proposed using the appearance model to

guide the search. This both alleviates the constraints of using a

regular grid, ﬁxed scales, and ﬁxed aspect ratio, while at the same

time reduces the number of locations visited. This is done by di-

rectly searching for the optimal window within the image using a

branch and bound technique. While they obtain impressive results

for linear classiﬁers, [1] found that for non-linear classiﬁers the

method in practice still visits over a 100,000 windows per image.

Instead of a blind exhaustive search or a branch and bound

search, we propose selective search. We use t he underlying im-

age structure to generate object locations. In contrast to the dis-

cussed methods, this yields a completely class-independent set of

locations. Furthermore, because we do not use a ﬁxed aspect ra-

tio, our method is not limited to objects but should be able to ﬁnd

stuff like “grass” and “sand” as well (this also holds for [17]). Fi-

nally, we hope to generate fewer locations, which should make the

problem easier as the variability of samples becomes lower. And

more importantly, it frees up computational power which can be

used for stronger machine learning techniques and more powerful

appearance models.

2.2 Segmentation

Both Carreira and Sminchisescu [4] and Endres and Hoiem [9] pro-

pose to generate a set of class independent object hypotheses using

segmentation. Both methods generate multiple foreground/back-

ground segmentations, learn to predict the likelihood that a fore-

ground segment is a complete object, and use this to rank the seg-

ments. Both algorithms show a promising ability to accurately

delineate objects within images, conﬁrmed by [19] who achieve

state-of-the-art results on pixel-wise image classiﬁcation using [4].

As common in segmentation, both methods rely on a single strong

algorithm for identifying good regions. They obtain a variety of

locations by using many randomly initialised foreground and back-

ground seeds. In contrast, we explicitly deal with a variety of image

conditions by using different grouping criteria and different repre-

sentations. This means a lower computational investment as we do

not have to invest in the single best segmentation strategy, such as

using the excellent yet expensive contour detector of [3]. Further-

more, as we deal with different image conditions separately, we

expect our locations to have a more consistent quality. Finally, our

selective search paradigm dictates that the most interesting ques-

tion is not how our regions compare to [4, 9], but rather how they

can complement each other.

Gu et al. [15] address the problem of carefully segmenting and

recognizing objects based on their parts. They ﬁrst generate a set

of part hypotheses using a grouping method based on Arbelaez et

al. [3]. Each part hypothesis is described by both appearance and

shape features. Then, an object is recognized and carefully delin-

eated by using its parts, achieving good results for shape recogni-

tion. In their work, the segmentation is hierarchical and yields seg-

ments at all scales. However, they use a single grouping strategy

Figure 2: Two examples of our selective search showing the necessity of different scales. On the left we ﬁnd many objects at different

scales. On the right we necessarily ﬁnd the objects at different scales as the girl is contained by the tv.

whose power of discovering parts or objects is left unevaluated. In

this work, we use multiple complementary strategies to deal with

as many image conditions as possible. We include the locations

generated using [3] in our evaluation.

2.3 Other Sampling Strategies

Alexe et al. [2] address the problem of the large sampling space

of an exhaustive search by proposing to search for any object, in-

dependent of its class. In their method they train a classiﬁer on the

object windows of those objects which have a well-deﬁned shape

(as opposed to stuff like “grass” and “sand”). Then instead of a full

exhaustive search they randomly sample boxes to which they apply

their classiﬁer. The boxes with the highest “objectness” measure

serve as a set of object hypotheses. This set is then used to greatly

reduce the number of windows evaluated by class-speciﬁc object

detectors. We compare our method with their work.

Another strategy is to use visual words of the Bag-of-Words

model to predict the object location. Vedaldi et al. [34] use jumping

windows [5], in which the relation between individual visual words

and the object location is learned to predict the object location in

new images. Maji and Malik [23] combine multiple of these rela-

tions to predict the object location using a Hough-transform, after

which they randomly sample windows close to the Hough maxi-

mum. In contrast to learning, we use the image st ructure to sample

a set of class-independent object hypotheses.

To summarize, our novelty is as follows. Instead of an exhaus-

tive search [8, 12, 16, 36] we use segmentation as selective search

yielding a small set of class independent object locations. In con-

trast to the segmentation of [4, 9], instead of focusing on the best

segmentation algorithm [3], we use a variety of strategies to deal

with as many image conditions as possible, thereby severely reduc-

ing computational costs while potentially capturing more objects

accurately. Instead of learning an objectness measure on randomly

sampled boxes [2], we use a bottom-up grouping procedure to gen-

erate good object locations.

3 Selective Search

In this section we detail our s elective search algorithm for object

recognition and present a variety of diversiﬁcation strategies to deal

with as many image conditions as possible. A selective search al-

gorithm is subject to the following design considerations:

Capture All Scales. Objects can occur at any scale within the im-

age. Furthermore, some objects have less clear boundaries

then other objects. Therefore, in selective search all object

scales have to be taken into account, as illustrated in Figure

2. This is most naturally achieved by using an hierarchical

algorithm.

Diversiﬁcation. There is no single optimal strategy to group re-

gions together. As observed earlier in Figure 1, regions may

form an object because of only colour, only texture, or because

parts are enclosed. Furthermore, lighting conditions such as

shading and the colour of the light may inﬂuence how regions

form an object. Therefore instead of a single strategy which

works well in most cases, we want to have a diverse set of

strategies to deal with all cases.

Fast to Compute. The goal of selective search is to yield a set of

possible object locations for use in a practical object recogni-

tion framework. The creation of this set should not become a

computational bottleneck, hence our algorithm should be rea-

sonably fast.

3.1 Selective Search by Hierarchical Grouping

We take a hierarchical grouping algorithm to form the basis of our

selective search. Bottom-up grouping is a popular approach to seg-

mentation [6, 13], hence we adapt it for selective search. Because

the process of grouping itself is hierarchical, we can naturally gen-

erate locations at all scales by continuing the grouping process until

the whole image becomes a single region. This satisﬁes the condi-

tion of capturing all scales.

As regions can yield richer information than pixels, we want to

use region-based features whenever possible. To get a set of small

starting regions which ideally do not span multiple objects, we use

the fast method of Felzenszwalb and Huttenlocher [13], which [3]

found well-suited for such purpose.

Our grouping procedure now works as follows. We ﬁrst use [13]

to create initial regions. Then we use a greedy algorithm to iter-

atively group regions together: First the similarities between all

neighbouring regions are calculated. The two most similar regions

are grouped together, and new similarities are calculated between

the resulting region and its neighbours. The process of grouping

the most s imilar regions is repeated until the whole image becomes

a single region. The general method is detailed in Algorithm 1.

Algorithm 1: Hierarchical Grouping Algorithm

Input: (colour) image

Output: Set of object location hypotheses L

Obtain initial regions R = {r

,··· ,r

} using [13]

Initialise similarity set S = /0

foreach Neighbouring region pair (r

) do

Calculate similarity s(r

)

S = S ∪ s(r

)

while S 6= /0 do

Get highest similarity s(r

) = max(S)

Merge corresponding regions r

= r

∪ r

Remove similarities regarding r

: S = S \ s(r

∗

)

Remove similarities regarding r

: S = S \ s(r

∗

)

Calculate similarity set S

between r

and its neighbours

S = S ∪ S

R = R ∪ r

Extract object location boxes L from all regions in R

For the similarity s(r

) between region r

and r

we want a va-

riety of complementary measures under the constraint that they are

fast to compute. In effect, this means that the similarities should be

based on features that can be propagated through the hierarchy, i.e.

when merging region r

and r

into r

, the features of region r

need

to be calculated from the features of r

and r

without accessing the

image pixels.

3.2 Diversiﬁcation Strategies

The second design criterion for selective search is to diversify the

sampling and create a set of complementary strategies whose loca-

tions are combined afterwards. We diversify our selective search

(1) by using a variety of colour spaces with different invariance

properties, (2) by using different similarity measures s

i j

, and (3)

by varying our starting regions.

Complementary Colour Spaces. We want to account for dif-

ferent scene and lighting conditions. Therefore we perform our

hierarchical grouping algorithm in a variety of colour spaces with

a range of invariance properties. Speciﬁcally, we the following

colour spaces with an increasing degree of invariance: (1) RGB,

(2) the intensity (grey-scale image) I, (3) Lab, (4) the rg chan-

nels of normalized RGB plus intensity denoted as rgI, (5) HSV, (6)

normalized RGB denoted as rgb, (7) C [14] which is an opponent

colour space where intensity is divided out, and ﬁnally (8) the Hue

channel H from HSV . The speciﬁc invariance properties are li sted

in Table 1.

Of course, for images that are black and white a change of colour

space has little impact on the ﬁnal outcome of the algorithm. For

colour channels R G B I V L a b S r g C H

Light Intensity - - - - - - +/- +/- + + + + +

Shadows/shading - - - - - - +/- +/- + + + + +

Highlights - - - - - - - - - - - +/- +

colour spaces RGB I Lab rgI HSV rgb C H

Light Intensity - - +/-

+ + +

Shadows/shading - - +/-

+ + +

Highlights - - - -

- +/- +

Table 1: The invariance properties of both the individual colour

channels and the colour spaces used in this paper, sorted by de-

gree of invariance. A “+/-” means partial invariance. A fraction

/3 means that one of the three colour channels is invariant to said

property.

these images we rely on the other diversiﬁcation methods for en-

suring good object locations.

In this paper we always use a single colour space throughout

the algorithm, meaning that both the initial grouping algorithm of

[13] and our subsequent grouping algorithm are performed in this

colour space.

Complementary Similarity Measures. We deﬁne four comple-

mentary, fast-to-compute similarity measures. These measures are

all in range [0,1] which facilitates combinations of these measures.

colour

) measures colour similarity. Speciﬁcally, for each re-

gion we obtain one-dimensional colour histograms for each

colour channel using 25 bins, which we found to work well.

This leads to a colour histogram C

= {c

,··· ,c

} for each

region r

with dimensionality n = 75 when three colour chan-

nels are used. The colour histograms are normalised using the

norm. Similarity is measured using the histogram intersec-

tion:

colour

) =

∑

k=1

min(c

). (1)

The colour histograms can be efﬁciently propagated through

the hierarchy by

size(r

) ×C

+ size(r

) ×C

size(r

) + size(r

)

. (2)

The size of a resulting region is s imply the sum of its con-

stituents: size(r

) = size(r

) + size(r

texture

) measures texture similarity. We represent texture us-

ing fast SIFT-like measurements as SIFT itself works well for

material recognition [20]. We take Gaussian derivatives in

eight orientations using

= 1 for each colour channel. For

each orientation for each colour channel we extract a his-

togram using a bin size of 10. This leads to a texture his-

togram T

= {t

,··· ,t

} for each region r

with dimension-

ality n = 240 when three colour channels are used. Texture

histograms are normalised using the L

norm. Simil arity is

measured using histogram intersection:

texture

) =

∑

k=1

min(t

). (3)

Texture histograms are efﬁciently propagated through the hi-

erarchy in the same way as the colour histograms.

size

) encourages small regions to merge early. This forces

regions in S, i.e. regions which have not yet been merged, to

be of similar sizes throughout the algorithm. This is desir-

able because it ensures that object locations at all scales are

created at all parts of the image. For example, it prevents a

single region from gobbling up all other regions one by one,

yielding all scales only at the location of this growing region

and nowhere els e. s

size

) is deﬁned as the fraction of the

image that r

and r

jointly occupy:

size

) = 1 −

size(r

) + size(r

)

size(im)

, (4)

where size(im) denotes the size of the image in pixels.

ﬁll

) measures how well region r

and r

ﬁt into each other.

The idea is to ﬁll gaps: if r

is contained in r

it is logical to

merge these ﬁrst in order to avoid any holes. On the other

hand, if r

and r

are hardly t ouching each other they will

likely form a strange region and should not be merged. To

keep the measure fast, we use only the size of the regions and

of the containing boxes. Speciﬁcally, we deﬁne BB

i j

to be the

tight bounding box around r

and r

. Now s

ﬁll

) is the

fraction of the image contained in BB

i j

which is not covered

by the regions of r

and r

ﬁll(r

) = 1 −

size(BB

i j

) − size(r

)

size(im)

(5)

We divide by size(im) for consistency with Equation 4. Note

that this measure can be efﬁciently calculated by keeping track

of the bounding boxes around each region, as the bounding

box around two regions can be easily derived from these.

In this paper, our ﬁnal similarity measure is a combination of the

above four:

s(r

) = a

colour

) + a

texture

) +

size

) + a

f ill

), (6)

where a

∈ {0,1} denotes if the similarity measure is used or

not. As we aim to diversify our strategies, we do not consider any

weighted similarities.

Complementary Starting Regions. A third diversiﬁcation

strategy is varying the complementary starting regions. To the

best of our knowledge, the method of [13] is the fastest, publicly

available algorithm that yields high quality starting locations. We

could not ﬁnd any other algorithm with similar computational efﬁ-

ciency so we use only this oversegmentation in this paper. But note

that different starting regions are (already) obtained by varying the

colour spaces, each which has different invariance properties. Ad-

ditionally, we vary the threshold parameter k in [13].

3.3 Combining Locations

In this paper, we combine the object hypotheses of several varia-

tions of our hierarchical grouping algorithm. Ideally, we want to

order the object hypotheses in such a way that the locations which

are most likely to be an object come ﬁrst. This enables one to ﬁnd

a good trade-off between the quality and quantity of the resulting

object hypothesis set, depending on the computational efﬁciency of

the subsequent feature extraction and classiﬁcation method.

We choose to order the combined object hypotheses set based

on the order in which the hypotheses were generated in each in-

dividual grouping strategy. However, as we combine results from

up to 80 different strategies, such order would too heavily empha-

size large regions. To prevent this, we include some randomness

as follows. Given a grouping strategy j, let r

be the region which

is created at position i in the hierarchy, where i = 1 represents the

top of the hierarchy (whose corresponding region covers the com-

plete image). We now calculate the position value v

as RND × i,

where RND is a random number in range [0,1]. The ﬁnal ranking

is obtained by ordering the regions using v

When we use locations in terms of bounding boxes, we ﬁrst rank

all the locations as detailed above. Only afterwards we ﬁlter out

lower ranked duplicates. This ensures that duplicate boxes have a

better chance of obtaining a high rank. This is desirable because

if multiple grouping strategies suggest the same box location, it is

likely to come from a visually coherent part of the image.

4 Object Recognition using Selective

This paper uses the locations generated by our selective search for

object recognition. This section details our f ramework for object

recognition.

Two types of features are dominant in object recognition: his-

tograms of oriented gradients (HOG) [8] and bag-of-words [7, 27].

HOG has been shown to be successful in combination with the part-

based model by Felzenszwalb et al. [12]. However, as they use an

exhaustive search, HOG features in combination with a linear clas-

siﬁer is the only feasible choice from a computational perspective.

In contrast, our selective search enables the use of more expensive

and potentially more powerful features. Therefore we use bag-of-

words for object recognition [16, 17, 34]. However, we use a more

powerful (and expensive) implementation than [16, 17, 34] by em-

ploying a variety of colour-SIFT descriptors [32] and a ﬁner spatial

pyramid division [18].

Speciﬁcally we sample descriptors at each pixel on a single scale

(

= 1.2). Using software from [32], we extract SIFT [21] and two

colour SIFTs which were found to be the most sensitive for de-

tecting image structures, Extended OpponentSIFT [31] and RGB-

SIFT [32]. We use a visual codebook of size 4,000 and a spatial

pyramid with 4 levels using a 1x1, 2x2, 3x3. and 4x4 division.

This gives a total feature vector length of 360,000. In image clas-

siﬁcation, features of this size are already used [25, 37] . Because

a spatial pyramid results in a coarser spatial subdivision than the

cells which make up a HOG descriptor, our features contain less

information about the speciﬁc spatial layout of the object. There-

fore, HOG is better suited for rigid objects and our features are

better suited for deformable object types.

As classiﬁer we employ a Support Vector Machine with a his-

togram intersection kernel using the Shogun Toolbox [28]. To ap-

ply the trained classiﬁer, we use t he fast, approximate classiﬁcation

strategy of [22], which was shown to work well for Bag-of-Words

in [30].

Our training procedure is illustrated in Figure 3. The initial posi-

tive examples consist of all ground truth object windows. As initial

negative examples we select from all object locations generated

评论收藏

内容反馈

版权申诉

sjx_alo

粉丝: 1w+
资源: 1206

基于选择性搜索（Selective Search）选择候选区域.zip

Selective Search算法---候选区域搜索.zip

Selective-Search-边框生成.7z

论文研究-基于投影的改进Selective Search图像分割算法 .pdf

Selective Search for Object Recognition

selectivesearch.zip

Python+OpenCV实现Selective Search算法

matlab数组排序代码-selective_search:选择性搜索的Python实现

selectiveSearch C++实现

基于Selective-Mapping选择性映射的OFDM系统PAPR抑制算法matlab仿真+matlab操作视频

selective-search:Python 3的选择性搜索实现

Frequency Selective Surfaces Based Planar Microwave Absorber.pdf

基于selective_search对手写数字串进行分割，并基于tensorflow在mnist训练好的模型进行识别

selective-search-cpp:选择性搜索的C ++实现

选择性符号执行S2E.zip

Selective Thresholding in Wavelet Image Compression.pdf

Simple Python script to compute Selective Search proposals in

DSC.zip_Radio_churchgz9_dsc_selective_对讲

PyPI 官网下载 | selective_linter-0.2.0.tar.gz

selectivesearch.py

YOLOv8-deepsort 实现智能车辆目标检测+车辆跟踪+车辆计数

YOLOv8网络结构图，自制visio文件，yolov8.vsds，需要的自取，在原有的基础上直接改就行了

yolov8(2023年8月版本),已经下好yolov8s.pt和yolov8n.pt

Transformer模型实现长期预测并可视化结果（附代码+数据集+原理介绍）

社交平台上经济类话题的文章热度信息，数据是真实的，但不是真实日期

行人跌倒数据集（VOC格式）

YOLOV5 + 双目相机实现三维测距（新版本）

Unet眼底血管图像分割数据集+代码+模型+系统界面+教学视频.zip

全新的SOTA模型YOLOv9

YOLOV5口罩检测数据集+代码+模型 2000张标注好的数据+教学视频.zip

最新资源