ApacheIoTDB是针对时间序列数据收集、存储与分析一体化的数据管理引擎它具有体量轻、性能高、易使用的特点.zip资源-CSDN文库

共2000个文件

java：1621个

md：234个

xml：56个

版权申诉

apache

python

64 浏览量 2024-05-02 07:48:39 上传评论收藏 6.41MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

Apache IoTDB 是针对时间序列数据收集、存储与分析一体化的数据管理引擎它具有体量轻、性能高、易使用的特点.zip （2000个子文件）

Session.cpp 32KB

SessionExample.cpp 9KB

sessionIT.cpp 5KB

main.cpp 1KB

webui.css 34KB

Session.h 21KB

index.html 3KB

StorageGroupProcessor.java 102KB

IoTDBSqlVisitor.java 79KB

MManager.java 71KB

RaftMember.java 69KB

TSServiceImpl.java 68KB

MetaGroupMember.java 65KB

PlanExecutor.java 65KB

IoTDBConfig.java 61KB

CMManager.java 59KB

RaftLogManagerTest.java 55KB

MetaGroupMemberTest.java 51KB

MTree.java 50KB

IoTDBAlignByDeviceIT.java 49KB

TsFileSequenceReader.java 48KB

Session.java 47KB

SyncLogDequeSerializer.java 47KB

IoTDBLevelCompactionIT.java 46KB

IoTDBTagIT.java 46KB

DataGroupMemberTest.java 45KB

PhysicalPlanTest.java 45KB

PhysicalGenerator.java 42KB

IoTDBDescriptor.java 42KB

TsFileProcessor.java 42KB

IoTDBSimpleQueryIT.java 41KB

SeriesReader.java 40KB

SessionPool.java 37KB

IOTDBGroupByIT.java 36KB

IoTDBAuthorizationIT.java 35KB

IoTDBAggregationIT.java 33KB

RaftLogManager.java 33KB

DataClusterServer.java 33KB

IoTDBAggregationLargeDataIT.java 33KB

RamUsageEstimator.java 33KB

DataGroupMember.java 33KB

StorageEngine.java 32KB

IoTDBUDTFAlignByTimeQueryIT.java 32KB

AbstractIoTDBJDBCResultSet.java 31KB

IoTDBFillIT.java 29KB

LocalQueryExecutor.java 29KB

IoTDBDatabaseMetadata.java 28KB

LevelCompactionTsFileManagement.java 27KB

Coordinator.java 27KB

IoTDBAggregationSmallDataIT.java 27KB

IoTDBGroupByFillIT.java 26KB

UDTFAlignByTimeDataSetTest.java 26KB

ClusterReaderFactory.java 25KB

ReadWriteIOUtils.java 25KB

SyncClient.java 25KB

IoTDBSessionComplexIT.java 25KB

FileSnapshot.java 24KB

MTreeTest.java 24KB

BytesUtils.java 24KB

StorageGroupProcessorTest.java 24KB

LevelCompactionRecoverTest.java 23KB

SyncLogDequeSerializerTest.java 23KB

IoTDBLoadExternalTsfileIT.java 23KB

TsFileOnlineUpgradeTool.java 23KB

SessionConnection.java 23KB

IoTDBAsIT.java 22KB

TsFileResource.java 22KB

AbstractCli.java 22KB

SerializeUtils.java 21KB

IoTDBMetadataFetchIT.java 21KB

IoTDBQueryDemoIT.java 21KB

ClusterPlanExecutor.java 21KB

SlotPartitionTableTest.java 21KB

DatetimeUtils.java 21KB

MergeMultiChunkTask.java 21KB

IoTDBLastIT.java 20KB

IoTDBCompleteIT.java 20KB

IoTDBTagAlterIT.java 20KB

IoTDBStatement.java 20KB

IoTDBNumberPathIT.java 20KB

MManagerBasicTest.java 19KB

SyncClientAdaptor.java 19KB

IoTDBUDFWindowQueryIT.java 19KB

RawQueryDataSetWithoutValueFilter.java 19KB

InputLayer.java 19KB

IoTDBMultiSeriesIT.java 19KB

IoTDBSessionSimpleIT.java 19KB

RestorableTsFileIOWriterTest.java 18KB

ConcatPathOptimizer.java 18KB

GorillaDecoderV2Test.java 18KB

ClusterPlanRouter.java 18KB

InsertTabletPlan.java 18KB

InsertRowPlan.java 18KB

SessionExample.java 17KB

AggregationExecutor.java 17KB

SlotPartitionTable.java 17KB

ClientMain.java 17KB

GroupByEngineDataSetTest.java 17KB

IoTDBUDFManagementIT.java 17KB

IoTDBDisableAlignIT.java 17KB

共 2000 条

# TsFile-Spark-Connector User Guide ## 1. About TsFile-Spark-Connector TsFile-Spark-Connector implements the support of Spark for external data sources of Tsfile type. This enables users to read, write and query Tsfile by Spark. With this connector, you can * load a single TsFile, from either the local file system or hdfs, into Spark * load all files in a specific directory, from either the local file system or hdfs, into Spark * write data from Spark into TsFile ## 2. System Requirements |Spark Version | Scala Version | Java Version | TsFile | |------------- | ------------- | ------------ |------------ | | `>= 2.2` | `2.11` | `1.8` | `0.10.0`| > Note: For more information about how to download and use TsFile, please see the following link: https://github.com/apache/incubator-iotdb/tree/master/tsfile. ## 3. Quick Start ### Local Mode Start Spark with TsFile-Spark-Connector in local mode: ``` ./<spark-shell-path> --jars tsfile-spark-connector.jar,tsfile-0.10.0-jar-with-dependencies.jar ``` Note: * \<spark-shell-path> is the real path of your spark-shell. * Multiple jar packages are separated by commas without any spaces. * See https://github.com/apache/iotdb/tree/master/tsfile for how to get TsFile. ### Distributed Mode Start Spark with TsFile-Spark-Connector in distributed mode (That is, the spark cluster is connected by spark-shell): ``` . /<spark-shell-path> --jars tsfile-spark-connector.jar,tsfile-0.10.0-jar-with-dependencies.jar --master spark://ip:7077 ``` Note: * \<spark-shell-path> is the real path of your spark-shell. * Multiple jar packages are separated by commas without any spaces. * See https://github.com/apache/iotdb/tree/master/tsfile for how to get TsFile. ## 4. Data Type Correspondence | TsFile data type | SparkSQL data type| | --------------| -------------- | | BOOLEAN | BooleanType | | INT32 | IntegerType | | INT64 | LongType | | FLOAT | FloatType | | DOUBLE | DoubleType | | TEXT | StringType | ## 5. Schema Inference The way to display TsFile is dependent on the schema. Take the following TsFile structure as an example: There are three Measurements in the TsFile schema: status, temperature, and hardware. The basic information of these three measurements is as follows: | name | type | encode| |------|------|-------| | status | Boolean | PLAIN| | temperature | Float | RLE| | hardware | Text | PLAIN| The existing data in the TsFile is as follows: | root.ln.wf01.wt01 | | root.ln.wf02.wt02 | | | | | | |------|------------|-----|--------|------|-------|------|-------| | status | | temperature | | hardware | | status | | | time | value | time | value | time | value | | 1 | True | 1 | 2.2 | 2 | "aaa" | 1 | True | 3 | True | 2 | 2.2 | 4 | "bbb" | 2 | False | 5 | False | 3 | 2.1 | 6 | "ccc" | 4 | True The corresponding SparkSQL table is as follows: | time | root.ln.wf02.wt02.temperature | root.ln.wf02.wt02.status | root.ln.wf02.wt02.hardware | root.ln.wf01.wt01.temperature | root.ln.wf01.wt01.status | root.ln.wf01.wt01.hardware | |------|-------------------------------|--------------------------|----------------------------|-------------------------------|--------------------------|----------------------------| | 1 | null | true | null | 2.2 | true | null | | 2 | null | false | aaa | 2.2 | null | null | | 3 | null | null | null | 2.1 | true | null | | 4 | null | true | bbb | null | null | null | | 5 | null | null | null | null | false | null | | 6 | null | null | ccc | null | null | null | You can also use narrow table form which as follows: (You can see part 6 about how to use narrow form) | time | device_name | status | hardware | temperature | |------|-------------------------------|--------------------------|----------------------------|-------------------------------| | 1 | root.ln.wf02.wt01 | true | null | 2.2 | | 1 | root.ln.wf02.wt02 | true | null | null | | 2 | root.ln.wf02.wt01 | null | null | 2.2 | | 2 | root.ln.wf02.wt02 | false | aaa | null | | 3 | root.ln.wf02.wt01 | true | null | 2.1 | | 4 | root.ln.wf02.wt02 | true | bbb | null | | 5 | root.ln.wf02.wt01 | false | null | null | | 6 | root.ln.wf02.wt02 | null | ccc | null | ## 6. Scala API NOTE: Remember to assign necessary read and write permissions in advance. ### Example 1: read from the local file system ```scala import org.apache.iotdb.tsfile._ val wide_df = spark.read.tsfile("test.tsfile") wide_df.show val narrow_df = spark.read.tsfile("test.tsfile", true) narrow_df.show ``` ### Example 2: read from the hadoop file system ```scala import org.apache.iotdb.tsfile._ val wide_df = spark.read.tsfile("hdfs://localhost:9000/test.tsfile") wide_df.show val narrow_df = spark.read.tsfile("hdfs://localhost:9000/test.tsfile", true) narrow_df.show ``` ### Example 3: read from a specific directory ```scala import org.apache.iotdb.tsfile._ val df = spark.read.tsfile("hdfs://localhost:9000/usr/hadoop") df.show ``` Note 1: Global time ordering of all TsFiles in a directory is not supported now. Note 2: Measurements of the same name should have the same schema. ### Example 4: query in wide form ```scala import org.apache.iotdb.tsfile._ val df = spark.read.tsfile("hdfs://localhost:9000/test.tsfile") df.createOrReplaceTempView("tsfile_table") val newDf = spark.sql("select * from tsfile_table where `device_1.sensor_1`>0 and `device_1.senso

评论收藏

内容反馈

版权申诉