【免费】以易于解析的方式显示所有2019年CVPR接受论文仅供学习参考用代码.zip资源-CSDN文库

共19个文件

py：9个

txt：4个

html：2个

需积分: 0 64 浏览量 2023-05-06 21:56:03 上传评论收藏 28.5MB ZIP 举报

标题中的“以易于解析的方式显示所有2019年CVPR接受论文”指的是一个程序或工具，它能够整理并清晰地展示2019年计算机视觉与模式识别会议（Computer Vision and Pattern Recognition, CVPR）上被接受的论文列表。CVPR是全球计算机视觉领域最顶级的会议之一，每年都会吸引众多研究者提交他们的最新研究成果。这个工具可能以结构化数据的形式呈现这些论文，比如作者、标题、摘要、引用次数等关键信息，便于学习和研究。描述中的“仅供学习参考用代码”表明这是一个教学资源，提供给那些希望了解如何处理学术数据或者对CVPR论文感兴趣的学者和学生。这个代码可能包含解析论文数据的算法、数据结构和可视化技术，旨在帮助用户理解如何获取和处理类似信息。标签“毕业设计软件/插件”暗示了这个压缩包可能是某个毕业设计项目的一部分，可能包含一个软件或插件，用于实现上述功能。毕业设计通常要求学生综合运用所学知识解决实际问题，因此这个项目可能涉及到编程语言（如Python）、数据处理库（如Pandas、BeautifulSoup等）、以及可能的前端展示技术（如HTML、CSS、JavaScript）。软件/插件部分可能是指用户可以通过安装和运行这段代码来查看CVPR论文的详情。压缩包中的唯一文件名“以易于解析的方式显示所有2019年CVPR接受论文仅供学习参考用代码”很可能是一个包含所有源代码的文件，可能是Python脚本或者其他编程语言的源代码文件。该文件可能分为以下几个主要部分： 1. 数据获取：这部分代码负责从CVPR官方网站或其他公开数据源抓取或下载论文信息，可能使用了网络爬虫技术。 2. 数据解析：代码将原始数据转换为可处理的格式，可能涉及XML、JSON或HTML解析。 3. 数据存储：处理后的数据可能被存储在数据库或CSV文件中，便于后续分析。 4. 数据分析：可能包含一些简单的统计分析，如论文数量、作者分布等。 5. 数据展示：代码会创建一个用户界面或生成报告，展示整理好的论文信息，可能使用了数据可视化库如Matplotlib、Seaborn或Plotly。对于学习者来说，这个代码示例可以帮助他们了解如何处理大规模学术数据，如何构建网络爬虫，以及如何利用编程技术进行数据可视化。同时，这也是一种实践计算机视觉领域研究趋势和热点的好方法。通过分析这些论文，可以发现当时的热门研究方向，了解最新的技术进展。

资源推荐

资源详情

资源评论

收起资源包目录

以易于解析的方式显示所有2019年CVPR接受论文仅供学习参考用代码.zip （19个子文件）

以易于解析的方式显示所有2019年CVPR接受论文仅供学习参考用代码

generatenicelda.py 6KB

lda.py 6KB

jquery-1.8.3.min.js 91KB

scrape.py 2KB

doc

static

oar_preview.png 601KB

site.gif 28.17MB

abstracts

dummy.txt 0B

makecorpus.py 1KB

download_pdfs.py 538B

vocabulary.py 6KB

cvpr2019oar.html 3.21MB

getabstracts.py 978B

pdftowordcloud.py 2KB

thumbs

dummy.txt 0B

pdftothumbs.py 1KB

stopwords.txt 4KB

Readme.md 3KB

content

dummy.txt 0B

cvprnice_template.html 14KB

**Updated repository at https://github.com/mattdeitke/CVPR-Accepted-Papers-Viewer** --- # CVPR 2019 Accepted Papers The main goal of these scripts is to build a page that displays the accepted papers for CVPR 2019 in a way that is easier for humans to parse (see: https://mattdeitke.com/CVPR-2019). Below is an example of what this repository will display, and following that is what CVPR open access currently shows. <img src="doc/static/site.gif" style="width:100%"/> <div style="text-align:center"> <img src="doc/static/oar_preview.png" style="width:60%;"/> </div> In particular, there is functionality to cluster papers based on latent Dirichlet allocation topics, create thumbnail images from the first 8 pages of each PDF, find the abstracts, copy a BibTeX, view the paper and supplementary material, and more. The scripts use Python 3.7 and should work for any past and future CVPR conference (unless they change how they display the pages). Modifications can be made to adapt the format to another conference. #### Installation 0. Clone this repository `git clone https://github.com/mattdeitke/CVPR2019` 1. Save the HTML from where the accepted papers are displayed. For CVPR, this year, that would be `http://openaccess.thecvf.com/CVPR2019.py`. 2. Install ImageMagick, which can be done using `sudo apt-get install imagemagick` or using another supported method such as `brew install imagemagick`. 3. Run `pdftowordcloud.py` to generate top words for each paper. The output is saved in topwords.p. 4. Run `pdftothumbs.py` to generate tiny thumbnails for all papers. The outputs are saved in thumbs/ folder. 5. Run `scrape.py` to generate each paperid, title, authors list by scraping the cvpr2019oar.html page. 6. Run `makecorpus.py` to create allpapers.txt file that has all papers (one per row). 7. Run `python lda.py -f allpapers.txt -k 7 --alpha=0.5 --beta=0.5 -i 100` . This will generate a pickle file called `ldaphi.p` that contains the LDA word distribution matrix. Thanks to this [nice LDA code](https://github.com/shuyo/iir/blob/master/lda/lda.py) by [@shuyo](https://github.com/shuyo)! It requires nltk library and numpy. In this example we are using 7 categories. You would need to change the `cvprnice_template.html` file a bit if you wanted to try different number of categories. 8. Generate the abstract files inside abstracts/ folder using `getabstracts.py`. 9. Finally, run `generatenicelda.py` to create the `index.html` page. ### Acknowledgements Big thanks to [@karpathy](https://github.com/karpathy) for his [NeurIPS preview](https://github.com/karpathy/nipspreview) and [ArXiV Sanity Preserver](https://github.com/karpathy/arxiv-sanity-preserver), which is what this repository builds on! Also a thanks to [@tholman](https://github.com/tholman) for creating a more modern [GitHub Corners](https://github.com/tholman/github-corners) and [@shuyo](https://github.com/shuyo) for the [LDA code](https://github.com/shuyo/iir/blob/master/lda/lda.py)! Finally, more thanks go to the people at CVPR for [openly publishing all of their accepted papers](http://openaccess.thecvf.com/CVPR2019.py)! #### Licence MIT License

评论收藏

内容反馈