<div align="center">
<img src="./docs/doccano.png">
</div>
# doccano
[](https://app.codacy.com/app/Hironsan/doccano?utm_source=github.com&utm_medium=referral&utm_content=doccano/doccano&utm_campaign=Badge_Grade_Dashboard)
[](https://travis-ci.com/doccano/doccano)
doccano is an open source text annotation tool for humans. It provides annotation features for text classification, sequence labeling and sequence to sequence tasks. So, you can create labeled data for sentiment analysis, named entity recognition, text summarization and so on. Just create a project, upload data and start annotating. You can build a dataset in hours.
## Demo
You can try the [annotation demo](http://doccano.herokuapp.com).

## Features
- Collaborative annotation
- Multi-language support
- Mobile support
- Emoji :smile: support
- Dark theme
- RESTful API
## Usage
Two options to run doccano:
- (Recommended) Docker Compose
- Docker
### Docker Compose
```bash
$ git clone https://github.com/doccano/doccano.git
$ cd doccano
$ docker-compose -f docker-compose.prod.yml up
```
Go to <http://0.0.0.0/>.
_Note the superuser account credentials located in the `docker-compose.prod.yml` file:_
```yml
ADMIN_USERNAME: "admin"
ADMIN_PASSWORD: "password"
```
> Note: If you want to add annotators, see [Frequently Asked Questions](https://github.com/doccano/doccano/wiki/Frequently-Asked-Questions#i-want-to-add-annotators)
_Note for Windows developers: Be sure to configure git to correctly handle line endings or you may encounter `status code 127` errors while running the services in future steps. Running with the git config options below will ensure your git directory correctly handles line endings._
```bash
git clone https://github.com/doccano/doccano.git --config core.autocrlf=input
```
### Docker
As a one-time setup, create a Docker container for Doccano:
```bash
docker pull doccano/doccano
docker container create --name doccano \
-e "ADMIN_USERNAME=admin" \
-e "ADMIN_EMAIL=admin@example.com" \
-e "ADMIN_PASSWORD=password" \
-p 8000:8000 doccano/doccano \
-v /data:/doccano
docker container create -v /data:/doccano --name doccano \
-e "ADMIN_USERNAME=admin" \
-e "ADMIN_EMAIL=admin@example.com" \
-e "ADMIN_PASSWORD=password" \
-p 8000:8000 doccano/doccano
docker container create --name doccano1 \
-e "ADMIN_USERNAME=admin" \
-e "ADMIN_EMAIL=admin@example.com" \
-e "ADMIN_PASSWORD=password" \
-p 8001:8000 doccano/doccano \
-v /data:/doccano
```
Next, start Doccano by running the container:
```bash
docker container start doccano
docker exec -it doccano /bin/bash
docker container start doccano1
docker exec -it doccano1 /bin/bash
```
To stop the container, run `docker container stop doccano -t 5`.
All data created in the container will persist across restarts.
Go to <http://127.0.0.1:8000/>.
### For Developers
You can setup local development environment as follows:
```bash
$ git clone https://github.com/doccano/doccano.git
$ cd doccano
$ docker-compose -f docker-compose.dev.yml up
```
Go to <http://127.0.0.1:3000/>.
## One-click Deployment
| Service | Button |
|---------|---|
| AWS[^1] | [](https://console.aws.amazon.com/cloudformation/home?#/stacks/create/review?stackName=doccano&templateURL=https://s3-external-1.amazonaws.com/cf-templates-10vry9l3mp71r-us-east-1/2019290i9t-AppSGl1poo4j8qpq) |
| Azure | [](https://portal.azure.com/#create/Microsoft.Template/uri/https%3A%2F%2Fraw.githubusercontent.com%2Fdoccano%2Fdoccano%2Fmaster%2Fazuredeploy.json) |
| GCP[^2] | [](https://console.cloud.google.com/cloudshell/editor?shellonly=true&cloudshell_image=gcr.io/cloudrun/doccano&cloudshell_git_repo=https://github.com/doccano/doccano.git) |
| Heroku | [](https://heroku.com/deploy) |
> [^1]: (1) EC2 KeyPair cannot be created automatically, so make sure you have an existing EC2 KeyPair in one region. Or [create one yourself](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ec2-key-pairs.html#having-ec2-create-your-key-pair). (2) If you want to access doccano via HTTPS in AWS, here is an [instruction](https://github.com/doccano/doccano/wiki/HTTPS-setting-for-doccano-in-AWS).
> [^2]: Although this is a very cheap option, it is only suitable for very small teams (up to 80 concurrent requests). Read more on [Cloud Run docs](https://cloud.google.com/run/docs/concepts).
## Contribution
As with any software, doccano is under continuous development. If you have requests for features, please file an issue describing your request. Also, if you want to see work towards a specific feature, feel free to contribute by working towards it. The standard procedure is to fork the repository, add a feature, fix a bug, then file a pull request that your changes are to be merged into the main repository and included in the next release.
Here are some tips might be helpful. [How to Contribute to Doccano Project](https://github.com/doccano/doccano/wiki/How-to-Contribute-to-Doccano-Project)
## Citation
```
@misc{doccano,
title={{doccano}: Text Annotation Tool for Human},
url={https://github.com/doccano/doccano},
note={Software available from https://github.com/doccano/doccano},
author={
Hiroki Nakayama and
Takahiro Kubo and
Junya Kamura and
Yasufumi Taniguchi and
Xu Liang},
year={2018},
}
```
## Contact
For help and feedback, please feel free to contact [the author](https://github.com/Hironsan).

涂衮衮
- 粉丝: 3
- 资源: 4
最新资源
- 2023-04-06-项目笔记 - 第四百一十四阶段 - 4.4.2.412全局变量的作用域-412 -2025.02.19
- 基于STM32的智能环境系统设计20250211
- 基于元胞自动机法的枝晶生长模拟:任意角度偏心正方算法结合流体动力学LBM研究,基于元胞自动机法的枝晶生长模拟:任意角度偏心正方算法结合流体动力学LBM分析,C++程序,基于元胞自动机法模拟枝晶生长,能
- 单相全波可控整流电路:变压器中心抽头下的电流与波形特性研究,深入理解单相全波可控整流电路:原理、应用与参数计算,单相全波可控整流电路 此电路变压器是带中心抽头的,在u2正半周T1工作,变压器二次绕组
- terraform solace provider plugins
- MATLAB中蚁群算法在TSP问题上的应用研究,MATLAB实现蚁群算法求解旅行商TSP问题的优化策略,MATLAB 用蚁群算法解决旅行商TSP问题 ,MATLAB; 蚁群算法; 旅行商TSP问题
- 路径规划算法仿真:A星算法改进版,高效搜索与路径优化,带梯度下降及S-G滤波器处理,Matlab实现,可定量比较不同算法效果,改进A*算法:权重系数提升搜索效率、冗余拐角优化及路径平滑处理(Matla
- 基于C51单片机的花椒智能烘干设备的设计20250218
- 转速电流双闭环无传感器无刷直流电机Simulink模型解析:探究转速、转矩、反向电动势与三相电流之间的关系,转速电流双闭环无传感器无刷直流电机Simulink模型解析:探究转速、转矩、反向电动势与三相
- LaTeX-OCR-PRO图像识别项目实战
- Malab Simulink MW级直驱风机模型解析及参考文献资源分享,基于Malab Simulink构建的MW级直驱风机模型及其相关参考文献,Malab Simulink MW级直驱风机模型,附赠
- 风力永磁同步发电机设计及仿真分析:Maxwell电磁仿真在1.5兆瓦风力发电机中的应用与工况研究,基于Maxwell电磁仿真的1.5兆瓦风力永磁同步发电机设计与工况分析,1.5兆瓦风力发电机 maxw
- SCI计算复现系列:Pandat代算与自操作实践下的共晶成分设计-以Al-Cu-Si三元合金共晶点成分寻找为例,SCI计算复现:共晶成分设计实例-Al-Cu-Si三元合金共晶点成分的相图计算与Pa
- DeepSeek:本地搭建
- 基于双馈发电机与电池储能系统的频率调节控制策略Simulink模型研究,基于双馈发电机与电池储能系统的频率调节控制策略Simulink模型研究,风储调频 使用双馈发电机(DFIG)相关的电池储能系统
- 永磁同步电机PMSM模糊PI控制策略:详细搭建过程、仿真效果及与传统PI对比资料全套打包介绍,永磁同步电机PMSM模糊PI控制策略详解:搭建过程、参考资料、与传统PI对比及仿真效果展示,永磁同步电机P
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈



评论0