spark-atlas-connector:一个Spark Atlas连接器,用于跟踪Apache Atlas中的数据沿袭

上传者: 42109639 | 上传时间: 2023-02-17 16:41:24 | 文件大小: 318KB | 文件类型: ZIP
Spark Atlas连接器 用于跟踪Spark SQL / DataFrame转换并将元数据更改推送到Apache Atlas的连接器。 此连接器支持跟踪: SQL DDL,例如“创建/删除/更改数据库”,“创建/删除/更改表”。 SQL DML,例如“ CREATE TABLE tbl AS SELECT”,“ INSERT INTO ...”,“ LOAD DATA [LOCAL] INPATH”,“ INSERT OVERWRITE [LOCAL] DIRECTORY”等。 具有输入和输出的DataFrame转换 机器学习管道。 该连接器将与Hive,HDFS等其他系统关联,以跟踪Atlas中数据的生命周期。 如何建造 要使用此连接器,您将需要最新版本的Spark(Spark 2.3+),因为大多数功能仅在Spark 2.3.0+中存在。 要构建此项目,请执行: mv

文件下载

资源详情

[{"title":"( 90 个子文件 318KB ) spark-atlas-connector:一个Spark Atlas连接器,用于跟踪Apache Atlas中的数据沿袭","children":[{"title":"spark-atlas-connector-master","children":[{"title":".github","children":[{"title":"PULL_REQUEST_TEMPLATE <span style='color:#111;'> 224B </span>","children":null,"spread":false}],"spread":true},{"title":"patch","children":[{"title":"Spark_ML_Listener_2.3.patch <span style='color:#111;'> 9.04KB </span>","children":null,"spread":false},{"title":"Spark_ML_Listener_2.4.patch <span style='color:#111;'> 15.41KB </span>","children":null,"spread":false},{"title":"1100-spark_model.json <span style='color:#111;'> 11.46KB </span>","children":null,"spread":false},{"title":"Spark_ML_Listener_2.4.diff <span style='color:#111;'> 14.80KB </span>","children":null,"spread":false},{"title":"Spark_ML_Listener_2.3.diff <span style='color:#111;'> 8.46KB </span>","children":null,"spread":false}],"spread":true},{"title":"examples","children":[{"title":"zeepline_notebook","children":[{"title":"Spark ML_Lineage.png <span style='color:#111;'> 48.93KB </span>","children":null,"spread":false},{"title":"SAC_ Spark Streaming and ML.json <span style='color:#111;'> 4.57KB </span>","children":null,"spread":false},{"title":"SAC_ Spark SQL.json <span style='color:#111;'> 4.18KB </span>","children":null,"spread":false},{"title":"SAC_ Spark ML.json <span style='color:#111;'> 5.00KB </span>","children":null,"spread":false},{"title":"Spark_ML_Streaming_Lineage.png <span style='color:#111;'> 60.31KB </span>","children":null,"spread":false},{"title":"Spark ETL_Lineage.png <span style='color:#111;'> 51.94KB </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"spark-atlas-connector","children":[{"title":"src","children":[{"title":"test","children":[{"title":"scala","children":[{"title":"com","children":[{"title":"hotels","children":[{"title":"beeju","children":[{"title":"ThriftHiveMetaStoreTestUtil.scala <span style='color:#111;'> 1.02KB </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"hortonworks","children":[{"title":"spark","children":[{"title":"atlas","children":[{"title":"sql","children":[{"title":"SparkExecutionPlanProcessorForViewSuite.scala <span style='color:#111;'> 4.10KB </span>","children":null,"spread":false},{"title":"testhelper","children":[{"title":"KafkaTopicEntityValidator.scala <span style='color:#111;'> 2.83KB </span>","children":null,"spread":false},{"title":"AtlasStreamingQueryProgressListener.scala <span style='color:#111;'> 2.37KB </span>","children":null,"spread":false},{"title":"AtlasQueryExecutionListener.scala <span style='color:#111;'> 1.62KB </span>","children":null,"spread":false},{"title":"BaseHarvesterSuite.scala <span style='color:#111;'> 2.42KB </span>","children":null,"spread":false},{"title":"ProcessEntityValidator.scala <span style='color:#111;'> 4.42KB </span>","children":null,"spread":false},{"title":"CreateEntitiesTrackingAtlasClient.scala <span style='color:#111;'> 1.95KB </span>","children":null,"spread":false},{"title":"DirectProcessSparkExecutionPlanProcessor.scala <span style='color:#111;'> 1.23KB </span>","children":null,"spread":false},{"title":"FsEntityValidator.scala <span style='color:#111;'> 2.45KB </span>","children":null,"spread":false},{"title":"TableEntityValidator.scala <span style='color:#111;'> 4.11KB </span>","children":null,"spread":false}],"spread":false},{"title":"LoadDataHarvesterSuite.scala <span style='color:#111;'> 3.64KB </span>","children":null,"spread":false},{"title":"InsertIntoHarvesterSuite.scala <span style='color:#111;'> 12.28KB </span>","children":null,"spread":false},{"title":"CreateViewHarvesterSuite.scala <span style='color:#111;'> 4.04KB </span>","children":null,"spread":false},{"title":"SparkExecutionPlanProcessorWithRemoteHiveMetastoreServiceSuite.scala <span style='color:#111;'> 5.93KB </span>","children":null,"spread":false},{"title":"SparkExecutionPlanProcessorForComplicatedQuerySuite.scala <span style='color:#111;'> 4.55KB </span>","children":null,"spread":false},{"title":"CatalogEventToAtlasIT.scala <span style='color:#111;'> 5.83KB </span>","children":null,"spread":false},{"title":"CreateDataSourceTableAsSelectHarvesterSuite.scala <span style='color:#111;'> 4.11KB </span>","children":null,"spread":false},{"title":"InsertIntoHiveDirHarvesterSuite.scala <span style='color:#111;'> 3.55KB </span>","children":null,"spread":false},{"title":"SparkExecutionPlanProcessForRdbmsQuerySuite.scala <span style='color:#111;'> 4.22KB </span>","children":null,"spread":false},{"title":"SparkExecutionPlanProcessorForStreamingQuerySuite.scala <span style='color:#111;'> 26.78KB </span>","children":null,"spread":false},{"title":"SparkExecutionPlanProcessorForBatchQuerySuite.scala <span style='color:#111;'> 16.42KB </span>","children":null,"spread":false},{"title":"SparkCatalogEventProcessorSuite.scala <span style='color:#111;'> 8.97KB </span>","children":null,"spread":false},{"title":"CreateHiveTableAsSelectHarvesterSuite.scala <span style='color:#111;'> 7.82KB </span>","children":null,"spread":false}],"spread":false},{"title":"WithHiveSupport.scala <span style='color:#111;'> 2.44KB </span>","children":null,"spread":false},{"title":"ml","children":[{"title":"MLPipelineTrackerIT.scala <span style='color:#111;'> 3.32KB </span>","children":null,"spread":false},{"title":"MLPipelineWithSaveIntoSuite.scala <span style='color:#111;'> 9.04KB </span>","children":null,"spread":false}],"spread":false},{"title":"utils","children":[{"title":"JdbcUtilsTest.scala <span style='color:#111;'> 3.42KB </span>","children":null,"spread":false},{"title":"SparkUtilsSuite.scala <span style='color:#111;'> 2.76KB </span>","children":null,"spread":false}],"spread":false},{"title":"AtlasEntityCreationRequestHelperSuite.scala <span style='color:#111;'> 4.80KB </span>","children":null,"spread":false},{"title":"WithRemoteHiveMetastoreServiceSupport.scala <span style='color:#111;'> 2.81KB </span>","children":null,"spread":false},{"title":"KafkaClientIT.scala <span style='color:#111;'> 4.76KB </span>","children":null,"spread":false},{"title":"TestUtils.scala <span style='color:#111;'> 2.64KB </span>","children":null,"spread":false},{"title":"types","children":[{"title":"AtlasExternalEntityUtilsSuite.scala <span style='color:#111;'> 6.48KB </span>","children":null,"spread":false},{"title":"MLAtlasEntityUtilsSuite.scala <span style='color:#111;'> 4.33KB </span>","children":null,"spread":false},{"title":"SparkAtlasEntityUtilsSuite.scala <span style='color:#111;'> 4.63KB </span>","children":null,"spread":false}],"spread":false},{"title":"BaseResourceIT.scala <span style='color:#111;'> 3.90KB </span>","children":null,"spread":false},{"title":"WithHDFSSupport.scala <span style='color:#111;'> 2.63KB </span>","children":null,"spread":false}],"spread":false}],"spread":true}],"spread":true}],"spread":true}],"spread":true},{"title":"resources","children":[{"title":"atlas-application.properties <span style='color:#111;'> 10.25KB </span>","children":null,"spread":false},{"title":"users.parquet <span style='color:#111;'> 615B </span>","children":null,"spread":false},{"title":"log4j.properties <span style='color:#111;'> 1.33KB </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"main","children":[{"title":"scala","children":[{"title":"com","children":[{"title":"hortonworks","children":[{"title":"spark","children":[{"title":"atlas","children":[{"title":"sql","children":[{"title":"CommandsHarvester.scala <span style='color:#111;'> 25.26KB </span>","children":null,"spread":false},{"title":"SparkCatalogEventProcessor.scala <span style='color:#111;'> 5.71KB </span>","children":null,"spread":false},{"title":"Harvester.scala <span style='color:#111;'> 992B </span>","children":null,"spread":false},{"title":"KafkaTopicInformation.scala <span style='color:#111;'> 1.12KB </span>","children":null,"spread":false},{"title":"SparkExecutionPlanProcessor.scala <span style='color:#111;'> 7.33KB </span>","children":null,"spread":false},{"title":"SparkExtension.scala <span style='color:#111;'> 2.27KB </span>","children":null,"spread":false}],"spread":true},{"title":"KafkaAtlasClient.scala <span style='color:#111;'> 3.69KB </span>","children":null,"spread":false},{"title":"ml","children":[{"title":"MLPipelineEventProcessor.scala <span style='color:#111;'> 6.81KB </span>","children":null,"spread":false}],"spread":false},{"title":"utils","children":[{"title":"ReflectionHelper.scala <span style='color:#111;'> 3.94KB </span>","children":null,"spread":false},{"title":"CatalogUtils.scala <span style='color:#111;'> 1.98KB </span>","children":null,"spread":false},{"title":"Logging.scala <span style='color:#111;'> 1.64KB </span>","children":null,"spread":false},{"title":"JdbcUtils.scala <span style='color:#111;'> 3.87KB </span>","children":null,"spread":false},{"title":"SparkUtils.scala <span style='color:#111;'> 8.62KB </span>","children":null,"spread":false}],"spread":false},{"title":"SACAtlasEntity.scala <span style='color:#111;'> 2.29KB </span>","children":null,"spread":false},{"title":"AtlasClientConf.scala <span style='color:#111;'> 2.45KB </span>","children":null,"spread":false},{"title":"AtlasUtils.scala <span style='color:#111;'> 1.78KB </span>","children":null,"spread":false},{"title":"types","children":[{"title":"metadata.scala <span style='color:#111;'> 1.19KB </span>","children":null,"spread":false},{"title":"internal.scala <span style='color:#111;'> 9.64KB </span>","children":null,"spread":false},{"title":"external.scala <span style='color:#111;'> 10.13KB </span>","children":null,"spread":false},{"title":"AtlasEntityUtils.scala <span style='color:#111;'> 3.86KB </span>","children":null,"spread":false}],"spread":false},{"title":"AtlasEntityCreationRequestHelper.scala <span style='color:#111;'> 4.34KB </span>","children":null,"spread":false},{"title":"AtlasClient.scala <span style='color:#111;'> 4.33KB </span>","children":null,"spread":false},{"title":"AbstractEventProcessor.scala <span style='color:#111;'> 2.66KB </span>","children":null,"spread":false},{"title":"AtlasEntityReadHelper.scala <span style='color:#111;'> 3.30KB </span>","children":null,"spread":false},{"title":"RestAtlasClient.scala <span style='color:#111;'> 3.47KB </span>","children":null,"spread":false},{"title":"SparkAtlasStreamingQueryEventTracker.scala <span style='color:#111;'> 3.13KB </span>","children":null,"spread":false},{"title":"SparkAtlasEventTracker.scala <span style='color:#111;'> 3.14KB </span>","children":null,"spread":false}],"spread":false}],"spread":true}],"spread":true}],"spread":true},{"title":"org","children":[{"title":"apache","children":[{"title":"spark","children":[{"title":"sql","children":[{"title":"kafka010","children":[{"title":"atlas","children":[{"title":"ExtractFromDataSource.scala <span style='color:#111;'> 14.36KB </span>","children":null,"spread":false}],"spread":false}],"spread":false}],"spread":true}],"spread":true}],"spread":true}],"spread":true}],"spread":true}],"spread":true}],"spread":true},{"title":"pom.xml <span style='color:#111;'> 1.39KB </span>","children":null,"spread":false}],"spread":true},{"title":".travis.yml <span style='color:#111;'> 986B </span>","children":null,"spread":false},{"title":"LICENSE <span style='color:#111;'> 11.09KB </span>","children":null,"spread":false},{"title":"build","children":[{"title":"mvn <span style='color:#111;'> 3.84KB </span>","children":null,"spread":false}],"spread":true},{"title":"README.md <span style='color:#111;'> 8.58KB </span>","children":null,"spread":false},{"title":"spark-atlas-connector-assembly","children":[{"title":"pom.xml <span style='color:#111;'> 5.13KB </span>","children":null,"spread":false}],"spread":true},{"title":"pom.xml <span style='color:#111;'> 20.92KB </span>","children":null,"spread":false},{"title":"scalastyle-config.xml <span style='color:#111;'> 16.89KB </span>","children":null,"spread":false},{"title":".gitignore <span style='color:#111;'> 77B </span>","children":null,"spread":false}],"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明