SAP数据质量管理的示例JSON有效负载，用于位置数据的微服务.zip资源-CSDN文库

共109个文件

jpg：53个

png：36个

json：10个

版权申诉

29 浏览量 2024-06-15 13:26:13 上传评论收藏 4.26MB ZIP 举报

在IT行业中，SAP（System Applications and Products）是一款全球领先的企业资源规划软件，它提供了全面的业务流程管理和信息系统的解决方案。数据质量管理是SAP系统中的关键组成部分，它确保企业数据的准确性、完整性和一致性，从而支持高效运营和决策制定。在本示例中，我们关注的是与位置数据相关的微服务，这可能涉及到物流、供应链管理或地理空间分析等多个领域。 JSON（JavaScript Object Notation）是一种轻量级的数据交换格式，易于人阅读和编写，同时也易于机器解析和生成。在SAP数据质量管理的上下文中，JSON有效负载通常包含用于处理和验证位置数据的结构化信息。这些数据可能包括地址、坐标、地理围栏等。在提供的压缩包“SAP 数据质量管理的示例 JSON 有效负载，用于位置数据的微服务.zip”中，文件名为“cloud-dqm-sample-payloads-main”。这个文件很可能包含了多个JSON样例，展示了如何构建和传递与位置相关的数据请求。这些样例可能涵盖了不同的场景，如创建新的位置记录、更新现有记录、执行数据验证和清洗任务等。使用Python进行数据处理和分析是常见的实践，因为Python拥有丰富的库和工具，如`json`模块用于解析和生成JSON，以及`pandas`用于数据清洗和转换。开发者可以通过Python读取这些JSON样本，分析其中的数据模式，然后编写脚本来自动化数据质量管理流程，例如： 1. **数据导入与解析**：使用Python的`json`模块，可以轻松地加载JSON文件到内存，并将其转换为Python字典或列表，便于进一步处理。 2. **数据验证**：通过检查JSON数据中的键值对，确保位置数据遵循预定义的格式和规则，如地址字段的长度限制、经纬度的有效范围等。 3. **数据清洗**：利用`pandas`库，可以对位置数据进行缺失值处理、异常值识别和标准化操作，确保数据质量。 4. **接口调用**：结合SAP提供的API，Python可以用来发送HTTP请求，将JSON有效负载提交给SAP数据质量管理微服务进行处理，如创建或更新位置数据记录。 5. **错误处理与日志记录**：在处理过程中遇到的问题，如无效数据或网络错误，可以通过Python的异常处理机制来捕获，并记录在日志文件中，便于后续排查和优化。 SAP数据质量管理的示例JSON有效负载提供了一个理解如何与位置数据微服务交互的窗口，而Python则作为一个强大的工具，帮助我们处理、验证和集成这些数据。这样的组合对于实现高效、可靠的数据管理策略至关重要。通过深入研究这些样例和应用Python编程技巧，我们可以提升数据驱动的决策质量和企业的运营效率。

资源推荐

资源详情

资源评论

收起资源包目录

SAP 数据质量管理的示例 JSON 有效负载，用于位置数据的微服务.zip （109个子文件）

dep5 2KB

postman7.jpg 249KB

SampleReqStep1.jpg 153KB

postman9.jpg 104KB

postman13.jpg 91KB

oauth2.jpg 90KB

postman12.jpg 77KB

EnviornmentStep5.jpg 70KB

EnviornmentStep4.jpg 68KB

EvironmentStep3.jpg 61KB

EvnironmentStep2.jpg 56KB

SampleReqStep5.jpg 56KB

EnvironmentStep7.jpg 55KB

EnvironmentStep9.jpg 55KB

EnvironmentStep8.jpg 54KB

sample11.jpg 53KB

postman8.jpg 50KB

Result2.jpg 48KB

oauth4.jpg 46KB

postman11.jpg 46KB

sample12.jpg 38KB

sample17.jpg 37KB

postman.jpg 37KB

sample5.jpg 36KB

postmanTwo.jpg 35KB

sample18.jpg 34KB

postmanFirst.jpg 32KB

postman6.jpg 31KB

postmanSecond.jpg 30KB

oauth1.jpg 28KB

SampleReq1.jpg 27KB

postman5.jpg 27KB

sample8.jpg 26KB

sample3.jpg 24KB

sample10.jpg 23KB

postman2.jpg 22KB

sample9.jpg 21KB

oauth3.jpg 21KB

Result1.jpg 20KB

sample2.jpg 20KB

sample13.jpg 20KB

sample6.jpg 19KB

sample16.jpg 19KB

sample14.jpg 18KB

sample7.jpg 17KB

postman3.jpg 15KB

EnvironmentStep6.jpg 15KB

Result3.jpg 15KB

sample4.jpg 13KB

postman4.jpg 11KB

postman10.jpg 10KB

postman1.jpg 10KB

sample15.jpg 9KB

sample1.jpg 7KB

SalesOrders_Model.json 127KB

SalesOrders_View.json 107KB

Addresses_Location.json 43KB

Addresses_View.json 17KB

SalesOrders.json 3KB

BusinessPartners.json 2KB

Addresses.json 2KB

Addresses_View_GEO.json 1KB

service-key.json 383B

db-user.json 80B

LICENSE 11KB

README.md 12KB

README.md 6KB

README.md 2KB

DQMM_GEO_0.0.1.package 20KB

sac-story-style.png 471KB

dsp-output-table.png 242KB

sac-story-geomap.png 167KB

dsp-fact-join.png 152KB

dsp-fact-measure.png 133KB

dsp-fact-association.png 129KB

dsp-dimension-geo-column.png 127KB

dsp-import-csv.png 121KB

dsp-dimension-join.png 120KB

dsp-input-view.png 119KB

dsp-dimension-key.png 103KB

SampleReq1.png 100KB

dsp-analytic-model.png 94KB

postman image 9.png 75KB

dqmm-dsp-python-sample-db-user.png 63KB

Environement4.png 52KB

dqmm-dsp-python.png 47KB

Environment3.png 46KB

Environment2.png 43KB

Enviornment6.png 37KB

Environment8.png 36KB

dqmm-dsp-python-sample-view.png 36KB

Evironment7.png 35KB

Environment1.png 35KB

postman image 8.png 35KB

Result2.png 30KB

Postman Sample Step 2.png 23KB

Result1.png 15KB

postman image 5.png 13KB

postman image 6.png 13KB

共 109 条

# dqmm-dsp-python-sample ## Description This sample demonstrates how data in Datasphere can be enriched with geo-coordinates using Data Quality Management microservices and Python. To use this sample Python script, a view needs to be created over the data to be enriched with geo-coordinates. The view will project the columns to names that are recognized by the Data Quality Management microservice in addition to an ID column that uniquely identifies each row. Optionally, the view could also filter only those rows that are desired for the geo analysis. The resultant geo-coordinates are written to a table in a Datasphere database user OpenSQL schema. The value from the ID column in the view is included with the results so that the geo-coordinates can be joined to the original data. The results are written to a table whose name is the technical name of the input view appended with “_GEO”. This table is created within the database user’s schema and contains the following columns: | Column name | Description | | ----------- | ----------- | | ID | The ID provided in the view. This column is used to join the results back to the original data. The data type will be the data type of the ID column in the view. | | LATLONG | A geometry column used to store the latitude and longitude. This column is included to make it easy to use with a geo map in SAP Analytics Cloud. The data type is ST_GEOMETRY and is populated with ST_POINT. | | LATITUDE | The latitude. This column is included as a convenience to understand the results. The data type is double. | | LONGITUDE | The longitude. This column is included as a convenience to understand the results. The data type is double. | | GEO_ASMT_LEVEL | The level of assignment of the geo-coordinates. This can be used to filter the results to only include those with sufficient accuracy. The possible values can be found at https://help.sap.com/docs/data-quality-services/data-quality-services/geo-location-coordinates. The data type is nvarchar(4). | If a row does not assign to a geo-coordinate, the row will be added to the output table with the appropriate ID value and all remaining columns set to NULL. ## Requirements A Datasphere tenant is required. While the script could be modified to work with any HANA database instance, the script assumes certain Datasphere naming conventions. A subscription to Data Quality Management microservices is required to generate the geo-coordinates. To try the service for free, please see https://community.sap.com/t5/technology-blogs-by-sap/getting-started-with-sap-data-quality-management-microservices-for-location/ba-p/13527838. A Python runtime is required to run the script. The sample script uses the hdbcli library to access Datasphere. Use the following command to install the library… >pip install hdbcli Please see https://pypi.org/project/hdbcli/ for more information. The requests library is used to make calls to Data Quality Management microservices. Use the following command to install the library... >pip install requests The script expects two configuration files in the folder from which the script is being run. | File name | Description | | --------- | ----------- | | db-user.json | Contains the database user credentials used to connect to the Datasphere instance. A template for this file is included with the sample. | | service-key.json | Contains the service key JSON from the service instance used to access Data Quality Management microservices. An empty file with this name is included in the sample. | >**Note:** The script requires Datasphere database user and BTP service key credentials to be stored in the db-user.json and service-key.json files respectively. These files should be marked with the appropriate permissions to protect their contents. **Architecture** ![Alt text](resources/dqmm-dsp-python.png?raw=true "Architecture") ## Configuration ### Create a Datasphere database user A Datasphere database user needs to be created within the space containing the data to be enriched. Go to Datasphere Space Management, create a database user, and enable **Read Access** and **Write Access** so that the data to be enriched can be read and the results can be written into the database user’s schema. ![Alt text](resources/dqmm-dsp-python-sample-db-user.png?raw=true "Database User") For more information, please see https://help.sap.com/docs/SAP_DATASPHERE/be5967d099974c69b77f4549425ca4c0/798e3fd6707940c3bd2219b2d1ebaac2.html. ### Update db-user.json with the database user credentials The sample provides a db-user.json file with a JSON structure that needs to be populated with the database user information. { "hostName": "", "port": "443", "schema": "", "password": "" } Open the Database User Details for the database user, and copy teh values for Host Name, Port, Open SQL Schema, and Password into the db-user.json file. ### Create a service key for your Data Quality Management microservices service instance A service key is used to expose the credentials needed to send requests to the service. This sample script works with service keys created for OAuth authentication which is the default for service keys associated with Data Quality Management microservices. For more information, please see https://help.sap.com/docs/data-quality-services/data-quality-services/enabling-dqm-microservices-687506470c6f4becbd64334d1965b476. ### Update service-key.json with the service key JSON The sample provides an empty file named service-key.json. The service key JSON needs to be copied into this file. The service key can be viewed in the BTP cockpit. When viewing the service key, the JSON can be copied to the clipboard. Paste the JSON into the service-key.json file. Alternatively, you can export the service key JSON into a file, rename it to service-key.json, and then copy it into the folder from which you are running the Python script. ## Process Data Perform the following steps to process your data. ### Create a view in your space to project the data to service-related input fields The view accomplishes two things. The first is to make the data accessible outside of the space. When creating a view, enable **Expose for Consumption** so that the view can be accessed using the database user. The second is to map column names to service-related input field names. The allowable input field names can be found at https://help.sap.com/docs/data-quality-services/data-quality-services/input-fields. In addition to the columns required by the service, an additional column named **ID** needs to be included in the view. This column is used to join the output back to the original input. ![Alt text](resources/dqmm-dsp-python-sample-view.png?raw=true "View") The view can also be used to filter record so that only records meeting the specified criteria are processed. For example, only records from a specific country or region. The script assumes that the view is created in the space associated with the database user. The name of the database user has the following format:     **space-name**#**database-user-name-suffix** The script will access the view in the **space-name** schema. When creating the view, note the technical name of the view since that is the name that will need to be provided to the script. ### Run the script The script is run using the Python runtime. The script accepts one argument which is the name of the view to be processed. The script can be run using the following command…     python3 dqmm-dsp.py **technical-view-name** where **technical-view-name** is the technical name of the view to be processed. The script reads records from the view in groups of 100. Every 100 records a message is displayed. The script creates a ta

评论收藏

内容反馈

版权申诉