Member since
07-25-2016
61
Posts
32
Kudos Received
10
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1475 | 11-14-2018 07:51 PM | |
5118 | 09-20-2018 03:48 AM | |
4543 | 09-13-2017 05:38 PM | |
4229 | 07-27-2017 06:54 PM | |
1129 | 06-30-2017 05:21 AM |
09-20-2018
03:48 AM
1 Kudo
@Gray Pickney Filtering in lineage graph is not present currently. There has been a lot of ask about this feature being included. We will work towards including this in our next atlas release.
... View more
09-19-2018
10:23 PM
There are few issues with reusing the same entities after delete: 1. If the previously deleted entity has some tags - say PII associated with them, does the new table created inherit these tags? 2. If the new table is created with additional or lesser number of columns than the original, reusing tables will not help. If lineage cluttering is the issue, we can work towards adding filtering in lineage graph like exclude deleted entities from rendering. updating entities status from DELETED to ACTIVE doesn't help as only entity attribute updates are honored, but status, createTime, updateTime are all treated as system attributes of the entity.
... View more
09-13-2017
05:47 PM
@Laura Ngo, this is work in progress and we will be adding functionality to restrict certain tags from propagation using relationships. For more information on relationships: ATLAS-1690
... View more
09-13-2017
05:38 PM
The following GET works to list all type definitions in atlas: http://localhost:21000/api/atlas/v2/types/typedefs
http://localhost:21000/api/atlas/v2/types/typedefs/headers Please refer to the following v2 REST API doc for reference: http://atlas.apache.org/api/v2/resource_TypesREST.html
... View more
07-27-2017
06:54 PM
3 Kudos
@Smart Data
Kafka messaging still uses v1 structured atlas messages. You have supplied object id references for inputs and outputs attribute values. If you change it to Id format (v1 structure), it works. We are currently in the process of migrating v1 kafka messages to start using v2 structure. More information on the format of kafka message supported by atlas can be found here: https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.1/bk_data-governance/content/atlas_messaging_integration.html The following command works. please let us know if you still see the issue. echo '{"message":{"entities":[{"id":{"id":"-1467290565135246000","jsonClass":"org.apache.atlas.typesystem.json.InstanceSerialization$_Id","state":"ACTIVE","typeName":"Process","version":0},"jsonClass":"org.apache.atlas.typesystem.json.InstanceSerialization$_Reference","traitNames":[],"traits":{},"typeName":"Process","values":{"qualifiedName":"qn_process_cp","processor_name":"file_copy_","name":"n_process_cp","inputs":[{"jsonClass":"org.apache.atlas.typesystem.json.InstanceSerialization$_Reference","id":{"jsonClass":"org.apache.atlas.typesystem.json.InstanceSerialization$_Id","id":"b385b0ba-a806-44ae-b551-1a210c0e4c8a","version":0,"typeName":"hdfs_path","state":"ACTIVE"},"typeName":"hdfs_path","values":{"name":"hdfs_path_a","qualifiedName":"hdfs_path_a","path":"hdfs_path_a"},"traitNames":[],"traits":{},"systemAttributes":{}}],"outputs":[{"jsonClass":"org.apache.atlas.typesystem.json.InstanceSerialization$_Reference","id":{"jsonClass":"org.apache.atlas.typesystem.json.InstanceSerialization$_Id","id":"746dee24-0820-40e2-922c-759693085a0c","version":0,"typeName":"hdfs_path","state":"ACTIVE"},"typeName":"hdfs_path","values":{"name":"hdfs_path_b","qualifiedName":"hdfs_path_b","path":"hdfs_path_b"},"traitNames":[],"traits":{},"systemAttributes":{}}]}}],"type":"ENTITY_CREATE","user":"admin"},"version":{"version":"1.0.0"}}' | /usr/hdp/2.6.2.0-98/kafka/bin/kafka-console-producer.sh --broker-list localhost:6667 --topic ATLAS_HOOK
... View more
07-25-2017
03:57 PM
@Smart Data, can you tell some more details on how you create the process entity using Kafka?
... View more
07-24-2017
06:37 AM
@Carlos Guerra, some sample dsl like queries: hive_table where name like "sa?es*" hive_db where name like "R*" hive_db where hive_db.name like "R???rt?*" or hive_db.name like "S?l?s" or hive_db.name like "Log*" hive_db where hive_db.name like "R???rt?*" and hive_db.name like "S?l?s" and hive_db.name like "Log*" hive_table where name like 'sales*', db where name like 'Sa?es'
... View more
07-21-2017
05:40 PM
@Carlos Guerra, 'entity' attributes should contain only object id references. More information about the referred object id entity should be added in the 'referredEntities' section. For e.g. when creating hive_table entity, hive_column attributes are referred using its object ids in 'entity' section and detailed information about the hive column (like its own attributes) are defined in the referred entities section. If this answers your question please accept and upvote.
... View more
07-11-2017
06:14 PM
1 Kudo
@Carlos Guerra, you can use the sample curl script to create hive_table entity using v2 APIs. Please note that all the referred entities like hive_column, hive_storagedesc are referred using negative long guids, also a valid db guid is specified for the db attribute. Hope this helps. curl -X POST \
http://localhost:21000/api/atlas/v2/entity \
-H 'authorization: Basic YWRtaW46YWRtaW4=' \
-H 'content-type: application/json' \
-d '{
"entity":
{
"guid": "-208942807557404",
"status": "ACTIVE",
"version": 0,
"typeName": "hive_table",
"attributes": {
"owner": "hive",
"temporary": false,
"name": "test_table_0001",
"qualifiedName": "my_db.test_table_0001@hdp_cluster",
"columns": [
{
"guid": "-208942807557402",
"typeName": "hive_column"
},
{
"guid": "-208942807557401",
"typeName": "hive_column"
},
{
"guid": "-208942807557400",
"typeName": "hive_column"
}
],
"tableType": "MANAGED_TABLE",
"sd": {
"guid": "-208942807557403",
"typeName": "hive_storagedesc"
},
"partitionKeys": [
{
"guid": "-208942807557405",
"typeName": "hive_column"
}
],
"parameters": {
"totalSize": "0",
"rawDataSize": "0",
"numRows": "0",
"COLUMN_STATS_ACCURATE": "{\"BASIC_STATS\":\"true\"}",
"numFiles": "0",
"transient_lastDdlTime": "1497764442"
},
"db": {
"guid": "90a7d3af-873a-4c10-a815-069f2d47d490",
"typeName": "hive_db"
},
"retention": 0
}
}
,
"referredEntities": {
"-208942807557403": {
"guid": "-208942807557403",
"status": "ACTIVE",
"version": 0,
"typeName": "hive_storagedesc",
"attributes": {
"qualifiedName": "my_db.test_table_0001@hdp_cluster_storage",
"storedAsSubDirectories": false,
"location": "hdfs://localhost.localdomain:8020/apps/hive/warehouse/my_db.db/test_table_0001",
"compressed": false,
"inputFormat": "org.apache.hadoop.mapred.TextInputFormat",
"outputFormat": "org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat",
"parameters": {},
"serdeInfo": {
"typeName": "hive_serde",
"attributes": {
"serializationLib": "org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe",
"parameters": {
"serialization.format": "1"
}
}
},
"table": {
"guid": "-208942807557404",
"typeName": "hive_table"
},
"numBuckets": -1
}
},
"-208942807557402": {
"guid": "-208942807557402",
"status": "ACTIVE",
"version": 0,
"typeName": "hive_column",
"attributes": {
"owner": "hive",
"qualifiedName": "my_db.test_table_0001.col1@hdp_cluster",
"name": "col1",
"position": 0,
"type": "int",
"table": {
"guid": "-208942807557404",
"typeName": "hive_table"
}
}
},
"-208942807557401": {
"guid": "-208942807557401",
"status": "ACTIVE",
"version": 0,
"typeName": "hive_column",
"attributes": {
"owner": "hive",
"qualifiedName": "my_db.test_table_0001.col2@hdp_cluster",
"name": "col2",
"position": 1,
"type": "string",
"table": {
"guid": "-208942807557404",
"typeName": "hive_table"
}
}
},
"-208942807557400": {
"guid": "-208942807557400",
"status": "ACTIVE",
"version": 0,
"typeName": "hive_column",
"attributes": {
"owner": "hive",
"qualifiedName": "my_db.test_table_0001.col3@hdp_cluster",
"name": "col3",
"position": 2,
"type": "string",
"table": {
"guid": "-208942807557404",
"typeName": "hive_table"
}
}
},
"-208942807557405": {
"guid": "-208942807557405",
"status": "ACTIVE",
"version": 0,
"typeName": "hive_column",
"attributes": {
"owner": "hive",
"qualifiedName": "my_db.test_table_0001.col4@hdp_cluster",
"name": "col4",
"position": 5,
"type": "string",
"table": {
"guid": "-208942807557404",
"typeName": "hive_table"
}
}
}
}
}'
... View more
07-11-2017
08:14 AM
@Saba Baig, you can search for any entities tagged with a particular tag name using the single quotes. for e.g. `Log Data` searches for all entities in atlas tagged as Log Data. Only classification name is supported in single quotes.
... View more