Uploading CSV files to BICS Visual Analyzer and Data Visualization Cloud Service

Reference Guide for the APEX_WEB_SERVICE

Complete Text of Procedure Described

All content listed on this page is the property of Oracle Corp. Redistribution not allowed without written permission

Introduction

Oracle GoldenGate contains advanced functionality that exposes a wealth of information users may leverage. In this article we shall discuss three of these, TOKENS; which is user defined data written to Oracle GoldenGate Trails, the Column Conversion Function @TOKEN; which is used to retrieve the token data from the Oracle GoldenGate Trail, and the Column Conversion Function @GETENV; which is used to get information about the Oracle GoldenGate environment. We will demonstrate the use of each as data is replicated between an Oracle 12c Multi-tenant Database and MySQL Community Server 5.7 Database.

Main Article

What are Oracle GoldenGate Tokens?

Tokens are labels used to identify user defined data stored in the Oracle GoldenGate Trail Record Header. Tokens are defined via the Extract TABLE parameter; and, must consist of a name identifying the token and the token data. The token data character string may be up to 2000 bytes in length and may be either user specified text enclosed within single quotes or the results of an Oracle GoldenGate Column Conversion Function.

When using tokens in the replication stream, the Extract Data Pump cannot be in PASSTHRU mode.

Token data may be used in the COLMAP clause of a Replicat MAP statement, within a SQLEXEC, a UserExit, or a Macro. To retrieve the token data from the Oracle GoldenGate Trail, use the Column Conversion Function @TOKEN as input to any of the previously mentioned parameters.

Demonstration Environment

We will be replicating the sample Oracle HR database EMPLOYEES table to MySQL.

The Oracle table specifications are:

CREATE TABLE “HR”.”EMPLOYEES”
( “EMPLOYEE_ID” NUMBER(6,0),
“FIRST_NAME” VARCHAR2(20 BYTE),
“LAST_NAME” VARCHAR2(25 BYTE) CONSTRAINT “EMP_LAST_NAME_NN” NOT NULL ENABLE,
“EMAIL” VARCHAR2(25 BYTE) CONSTRAINT “EMP_EMAIL_NN” NOT NULL ENABLE,
“PHONE_NUMBER” VARCHAR2(20 BYTE),
“HIRE_DATE” DATE CONSTRAINT “EMP_HIRE_DATE_NN” NOT NULL ENABLE,
“JOB_ID” VARCHAR2(10 BYTE) CONSTRAINT “EMP_JOB_NN” NOT NULL ENABLE,
“SALARY” NUMBER(8,2),
“COMMISSION_PCT” NUMBER(2,2),
“MANAGER_ID” NUMBER(6,0),
“DEPARTMENT_ID” NUMBER(4,0),
CONSTRAINT “EMP_SALARY_MIN” CHECK (salary > 0) ENABLE,
CONSTRAINT “EMP_EMAIL_UK” UNIQUE (“EMAIL”)
CONSTRAINT “EMP_EMP_ID_PK” PRIMARY KEY (“EMPLOYEE_ID”)
CONSTRAINT “EMP_DEPT_FK” FOREIGN KEY (“DEPARTMENT_ID”)
REFERENCES “HR”.”DEPARTMENTS” (“DEPARTMENT_ID”) ENABLE,
CONSTRAINT “EMP_JOB_FK” FOREIGN KEY (“JOB_ID”)
REFERENCES “HR”.”JOBS” (“JOB_ID”) ENABLE,
CONSTRAINT “EMP_MANAGER_FK” FOREIGN KEY (“MANAGER_ID”)
REFERENCES “HR”.”EMPLOYEES” (“EMPLOYEE_ID”) ENABLE
);

The MySQL table specifications are:

CREATE TABLE `EMPLOYEES` (
`EMPLOYEE_ID` decimal(6,0) NOT NULL,
`FIRST_NAME` varchar(20) DEFAULT NULL,
`LAST_NAME` varchar(25) NOT NULL,
`EMAIL` varchar(25) NOT NULL,
`PHONE_NUMBER` varchar(20) DEFAULT NULL,
`HIRE_DATE` date NOT NULL,
`JOB_ID` varchar(10) NOT NULL,
`SALARY` decimal(8,2) DEFAULT NULL,
`COMMISSION_PCT` decimal(2,2) DEFAULT NULL,
`MANAGER_ID` decimal(6,0) DEFAULT NULL,
`DEPARTMENT_ID` varchar(45) DEFAULT NULL,
PRIMARY KEY (`EMPLOYEE_ID`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1;

CREATE TABLE `OGG_TOKENS` (
`EMPLOYEE_ID` decimal(6,0) NOT NULL,
`OGG_TOKEN_NAME` varchar(200) NOT NULL,
`OGG_TOKEN_DATA` varchar(2000) DEFAULT NULL,
PRIMARY KEY (`EMPLOYEE_ID`, `OGG_TOKEN_NAME`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1;

Token Example

In my source Integrated Extract, I will create the token tkn-example-text and replicate the data downstream to MySQL. In the Integrated Extract parameter file, I add the token to the TABLE statement:

When data for the employees table is captured, the token id and data will be written to the Oracle GoldenGate Trail. Using logdump and setting the option usertoken detail, we can see the token data:

Hdr-Ind    :     E (x45)     Partition :     . (x0c)
UndoFlag   :     . (x00)     BeforeAfter:     A (x41)
RecLength :    60 (x003c)   IO Time    : 2016/11/21 11:27:22.001.435
IOType     :   134 (x86)     OrigNode   :   255 (xff)
TransInd   :     . (x00)     FormatType :     R (x52)
SyskeyLen :     0 (x00)     Incomplete :     . (x00)
AuditRBA   :        272       AuditPos   : 11324432
Continued :     N (x00)     RecCount   :     1 (x01)
2016/11/21 11:27:22.001.435 GGSUnifiedUpdate     Len    60 RBA 2568
Name: PDBORCL.HR.EMPLOYEES (TDR Index: 1)
After Image:                                             Partition 12   GU b
0000 001c 0000 000a 0000 0000 0000 0000 00cf 000a | ………………..
000a 0000 0000 0000 0000 003c 0000 000a 0000 0000 | ………..<……..
0000 0000 00cf 000a 000a 0000 0000 0000 0000 00d2 | ………………..
User tokens:   62 bytes
tkn-example-test    : Example token data set on Integrated Extract

To send data downstream to the MySQL GoldenGate instance, I create an Extract Data Pump with the following settings:

extract pmysql
rmthost 192.168.120.46, mgrport 15000
rmttrail ./dirdat/om
userid c##ggadmin, password AACAAAAAAAAAAAHAAIFBOIYAMCGIMARE, encryptkey default
table pdborcl.hr.*;

We can verify the data delivery to the target GoldenGate instance by viewing the Remote GoldenGate Trail with logdump:

Hdr-Ind    :     E (x45)     Partition :     . (x0c)
UndoFlag   :     . (x00)     BeforeAfter:     A (x41)
RecLength :    60 (x003c)   IO Time    : 2016/11/21 11:27:22.001.435
IOType     :   134 (x86)     OrigNode   :   255 (xff)
TransInd   :     . (x03)     FormatType :     R (x52)
SyskeyLen :     0 (x00)     Incomplete :     . (x00)
AuditRBA   :        272       AuditPos   : 20475408
Continued :     N (x00)     RecCount   :     1 (x01)
2016/11/21 11:27:22.001.435 GGSUnifiedUpdate     Len    60 RBA 3903
Name: PDBORCL.HR.EMPLOYEES (TDR Index: 1)
After Image:                                             Partition 12   GU s
0000 001c 0000 000a 0000 0000 0000 0000 00cf 0007 | ………………..
000a 0000 0000 0000 0098 9680 0000 000a 0000 0000 | ………………..
0000 0000 00cf 0007 000a 0000 0000 0000 00a7 d8c0 | ………………..
User tokens:   62 bytes
tkn-example-test    : Example token data set on Integrated Extract

To apply the data to the MySQL OGG_TOKENS table, I use the COLMAP parameter and @TOKEN Column Conversion Function in the Replicat parameter file:

replicat ro12hr
targetdb hr@localhost, userid ggadmin, password AACAAAAAAAAAAAHAAIFBOIYAMCGIMARE, encryptkey default
reportcount every 60 seconds, rate
insertupdates
map pdborcl.hr.employees, target hr.OGG_TOKENS,
colmap (usedefaults,
OGG_TOKEN_NAME = ‘tkn-example-test’,
OGG_TOKEN_DATA = @token(‘tkn-example-test’)
);
noinsertupdates
map pdborcl.hr.*, target hr.*;

To verify the token information is inserted into the target tables, we can use MySQL Workbench to query the target, which returns:

# EMPLOYEE_ID, OGG_TOKEN_NAME, OGG_TOKEN_DATA
‘208’, ‘tkn-example-test’, ‘Example token data set on Integrated Extract’

This simple demonstration is not very useful for more than showing how to set and retrieve a token. Tokens become valuable when you want to record things about the Oracle GoldenGate environment that are useful when creating history tables, monitor details about the replication environment, or record details about the database and operating system environment. To obtain this information we use the @GETENV Column Conversion Function.

@GETENV Column Conversion Function

The @GETENV Column Conversion Function is used to obtain information about the Oracle GoldenGate environment. The information returned by @GETENV may be used as input to SQLEXEC queries and Stored Procedures, the COLMAP option of TABLE and MAP, TOKENS, and the UserExit GET_ENV_VALUE function.

There are too many options available for us to cover in this short article; however, the Oracle GoldenGate Windows and Unix Reference Guide provides an in-depth list of all supported function options and their use.

For our demonstration, we shall use @GETENV to do the following: (1) record any lag in each replication group, (2) get the current Julian timestamp in each replication group and use that information to compute lag, (3) get each replication group name, type, and process id, and (4) get details about the source Oracle GoldenGate environment, database environment, server, and operating system.

In the target MySQL database, create two tables for this data:

CREATE TABLE OGG_LAG_DATA (
ROW_TS timestamp(6) NOT NULL,
EXT_NAME varchar(8) NULL,
EXT_TYPE varchar(50) NULL,
EXT_PID varchar (50) NULL,
EXT_LAG_SEC bigint NULL,
DP_NAME varchar(8) NULL,
DP_TYPE varchar(50) NULL,
DP_PID varchar (50) NULL,
DP_LAG_SEC bigint NULL,
REP_NAME varchar(8) NULL,
REP_TYPE varchar(50) NULL,
REP_PID varchar (50) NULL,
REP_LAG_SEC bigint NULL,
SRC_COMMIT_TS timestamp(6) NULL,
EXT_JTS bigint NULL,
EXT_LAG_JTS bigint NULL,
DP_JTS bigint NULL,
DP_LAG_JTS bigint NULL,
REP_JTS bigint NULL,
REP_LAG_JTS bigint NULL,
PRIMARY KEY (ROW_TS)
) ENGINE=InnoDB DEFAULT CHARSET=latin1;

CREATE TABLE OGG_ENV_DATA (
ROW_TS timestamp(6) NOT NULL,
SOURCE_SERVER varchar(100) NULL,
SOURCE_OS_TYPE varchar(100) NULL,
SOURCE_OS_VERSION varchar(100) NULL,
SOURCE_HARDWARE varchar(100) NULL,
SOURCE_GG_VERSION varchar(100) NULL,
SOURCE_DB_NAME varchar(100) NULL,
SOURCE_DB_INSTANCE varchar(100) NULL,
SOURCE_DB_TYPE varchar(100) NULL,
SOURCE_DB_VERSION varchar(100) NULL,
TARGET_DB_NAME varchar(100) NULL,
TARGET_DB_VERSION varchar(100) NULL,
PRIMARY KEY (ROW_TS)
) ENGINE=InnoDB DEFAULT CHARSET=latin1;

To record information about the source environment, modify the Integrated Extract and Extract Data Pump configuration:

extract epdborcl
userid c##ggadmin, password AACAAAAAAAAAAAHAAIFBOIYAMCGIMARE, encryptkey default
exttrail ./dirdat/ea
logallsupcols
updaterecordformat compact
reportcount every 60 seconds, rate
table pdborcl.tpc.*;
table pdborcl.hr.countries;
table pdborcl.hr.departments;
table pdborcl.hr.job_history;
table pdborcl.hr.jobs;
table pdborcl.hr.locations;
table pdborcl.hr.regions;
table pdborcl.hr.employees,
tokens (
tkn-ext-group = @GETENV (‘GGENVIRONMENT’, ‘GROUPNAME’),
tkn-ext-type = @GETENV (‘GGENVIRONMENT’, ‘GROUPTYPE’),
tkn-ext-pid = @GETENV (‘GGENVIRONMENT’, ‘PROCESSID’),
tkn-ext-lag = @GETENV (‘LAG’, ‘SEC’),
tkn-ext-jts = @GETENV (‘JULIANTIMESTAMP’)
);

In the EPDBORCL Integrated Extract parameter file, we will use @GETENV to set tokens for information about the GoldenGate operating environment, processing lag in seconds, and the current system time when the Extract processed its latest record; in Julian Timestamp format. We’ll use this timestamp downstream in the Replicat as an alternative method for computing lag.

extract pmysql
rmthost 192.168.120.46, mgrport 15000
rmttrail ./dirdat/om
userid c##ggadmin, password AACAAAAAAAAAAAHAAIFBOIYAMCGIMARE, encryptkey default
table pdborcl.hr.employees,
tokens (
tkn-dp-group = @GETENV (‘GGENVIRONMENT’, ‘GROUPNAME’),
tkn-dp-type = @GETENV (‘GGENVIRONMENT’, ‘GROUPTYPE’),
tkn-dp-pid = @GETENV (‘GGENVIRONMENT’, ‘PROCESSID’),
tkn-dp-lag = @GETENV (‘LAG’, ‘SEC’),
tkn-dp-jts = @GETENV (‘JULIANTIMESTAMP’)
);
table pdborcl.hr.*;

In the PMYSQL Extract Data Pump, we use the same settings as the Integrated Extract to gather information about its operating environment.

In the target MySQL GoldenGate instance, configure the Replicat to apply the token data, return information about its operating environment, and perform lag calculations from the Julian Timestamps recorded by the Integrated Extract and Extract Data Pump.

replicat ro12hr
targetdb hr@localhost, userid ggadmin, password AACAAAAAAAAAAAHAAIFBOIYAMCGIMARE, encryptkey default
reportcount every 60 seconds, rate
insertupdates
insertdeletes
map pdborcl.hr.employees, target hr.OGG_LAG_DATA,
colmap (usedefaults,
ROW_TS = @date (‘yyyy-mm-dd hh:mi:ss.ffffff’, ‘JTS’, @getenv (‘JULIANTIMESTAMP’) ),
EXT_NAME = @token (‘tkn-ext-group’),
EXT_TYPE = @token (‘tkn-ext-type’),
EXT_PID = @token (‘tkn-ext-pid’),
EXT_LAG_SEC = @token (‘tkn-ext-lag’),
DP_NAME = @token (‘tkn-dp-group’),
DP_TYPE = @token (‘tkn-dp-type’),
DP_PID = @token (‘tkn-dp-pid’),
DP_LAG_SEC = @token (‘tkn-dp-lag’),
REP_NAME = @GETENV (‘GGENVIRONMENT’, ‘GROUPNAME’),
REP_TYPE = @GETENV (‘GGENVIRONMENT’, ‘GROUPTYPE’),
REP_PID = @GETENV (‘GGENVIRONMENT’, ‘PROCESSID’),
REP_LAG_SEC = @GETENV (‘LAG’, ‘SEC’),
SRC_COMMIT_TS = @GETENV (‘GGHEADER’, ‘COMMITTIMESTAMP’),
EXT_JTS = @token (‘tkn-ext-jts’),
EXT_LAG_JTS = @datediff (‘SS’, @GETENV (‘GGHEADER’, ‘COMMITTIMESTAMP’),
@date (‘yyyy-mm-dd hh:mi:ss.ffffff’, ‘JTS’, @token (‘tkn-ext-jts’) )
),
DP_JTS = @token (‘tkn-dp-jts’),
DP_LAG_JTS = @datediff (‘SS’, @GETENV (‘GGHEADER’, ‘COMMITTIMESTAMP’),
@date (‘yyyy-mm-dd hh:mi:ss.ffffff’, ‘JTS’, @token (‘tkn-dp-jts’) )
),
REP_JTS = @GETENV (‘JULIANTIMESTAMP’)
REP_LAG_JTS = @datediff (‘SS’, @GETENV (‘GGHEADER’, ‘COMMITTIMESTAMP’),
@date (‘yyyy-mm-dd hh:mi:ss.ffffff’, ‘JTS’, @getenv (‘JULIANTIMESTAMP’) )
)
);
map pdborcl.hr.employees, target hr.OGG_ENV_DATA,
colmap (usedefaults,
ROW_TS = @date (‘yyyy-mm-dd hh:mi:ss.ffffff’, ‘JTS’, @getenv (‘JULIANTIMESTAMP’) ),
SOURCE_SERVER = @GETENV (‘GGFILEHEADER’, ‘HOSTNAME’),
SOURCE_OS_TYPE = @GETENV (‘GGFILEHEADER’, ‘OSTYPE’),
SOURCE_OS_VERSION = @GETENV (‘GGFILEHEADER’, ‘OSVERSION’),
SOURCE_HARDWARE = @GETENV (‘GGFILEHEADER’, ‘HARDWARETYPE’),
SOURCE_GG_VERSION = @GETENV (‘GGFILEHEADER’, ‘GGVERSIONSTRING’),
SOURCE_DB_NAME = @GETENV (‘GGFILEHEADER’, ‘DBNAME’),
SOURCE_DB_INSTANCE = @GETENV (‘GGFILEHEADER’, ‘DBINSTANCE’),
SOURCE_DB_TYPE = @GETENV (‘GGFILEHEADER’, ‘DBTYPE’),
SOURCE_DB_VERSION = @GETENV (‘GGFILEHEADER’, ‘DBVERSIONSTRING’),
TARGET_DB_NAME = @GETENV (‘DBENVIRONMENT’, ‘DBNAME’),
TARGET_DB_VERSION = @GETENV (‘DBENVIRONMENT’, ‘DBVERSION’)
);
noinsertupdates
noinsertdeletes
map pdborcl.hr.*, target hr.*;

In the Replicat, @TOKEN is used to retrieve tokens in the Remote Extract Trail set by the Oracle Integrated Extract and Extract Data Pump. The @DATE Column Conversion Function converts the current server timestamp, in Julian Timestamp format, into a MySQL timestamp. which is then applied to the ROW_TS column of the target tables.

We use @GETENV to return information about the Replicat operating environment, retrieve the source record commit timestamp from the GoldenGate Trail record header, retrieve information about the source server and database from the GoldenGate Trail file header, and retrieve information about the target database environment.

@DATEDIFF computes the difference in seconds between the source record commit timestamp and the Julian Timestamp tokens recorded for each record by Integrated Extract, Extract Data Pump, and Replicat. @DATE is used to convert the Julian Timestamp to the designated timestamp format.

INSERTUPDATES and INSERTDELETES tells the Replicat to convert any update or delete operations against the source EMPLOYEES table into insert operations for the target OGG_LAG_DATA and OGG_ENV_DATA tables. These settings are toggled off via NOINSERTUPDATES and NOINSERTDELETES before the wildcard MAP statement; which ensures all source insert, update, and deletes are applied correctly to the remaining target HR tables.

We can use MySQL Workbench to review the test results:

select ROW_TS, EXT_NAME, EXT_PID, EXT_LAG_SEC, EXT_JTS, EXT_LAG_JTS from OGG_LAG_DATA;
# ROW_TS, EXT_NAME, EXT_PID, EXT_LAG_SEC, EXT_JTS, EXT_LAG_JTS
‘2016-11-21 14:56:38.970890’, ‘EPDBORCL’, ‘17253’, ‘0’, ‘212346515057710776’, ‘0’
‘2016-11-21 14:56:38.972303’, ‘EPDBORCL’, ‘17253’, ‘0’, ‘212346515057710839’, ‘0’
‘2016-11-21 14:56:38.974375’, ‘EPDBORCL’, ‘17253’, ‘0’, ‘212346515057710839’, ‘0’
‘2016-11-21 14:56:38.976066’, ‘EPDBORCL’, ‘17253’, ‘0’, ‘212346515057710839’, ‘0’

select ROW_TS, DP_NAME, DP_PID, DP_LAG_SEC, DP_JTS, DP_LAG_JTS from OGG_LAG_DATA;
# ROW_TS, DP_NAME, DP_PID, DP_LAG_SEC, DP_JTS, DP_LAG_JTS
‘2016-11-21 14:56:38.970890’, ‘PMYSQL’, ‘17763’, ’98’, ‘212346515155563697’, ’98’
‘2016-11-21 14:56:38.972303’, ‘PMYSQL’, ‘17763’, ’98’, ‘212346515155603197’, ’98’
‘2016-11-21 14:56:38.974375’, ‘PMYSQL’, ‘17763’, ’98’, ‘212346515155603197’, ’98’
‘2016-11-21 14:56:38.976066’, ‘PMYSQL’, ‘17763’, ’98’, ‘212346515155603197’, ’98’

select ROW_TS, REP_NAME, REP_PID, REP_LAG_SEC, REP_JTS, REP_LAG_JTS from OGG_LAG_DATA;
# ROW_TS, REP_NAME, REP_PID, REP_LAG_SEC, REP_JTS, REP_LAG_JTS
‘2016-11-21 14:56:38.970890’, ‘RO12HR’, ‘768’, ‘3141’, ‘212346518198970890’, ‘3141’
‘2016-11-21 14:56:38.972303’, ‘RO12HR’, ‘768’, ‘3141’, ‘212346518198972303’, ‘3141’
‘2016-11-21 14:56:38.974375’, ‘RO12HR’, ‘768’, ‘3141’, ‘212346518198974375’, ‘3141’
‘2016-11-21 14:56:38.976066’, ‘RO12HR’, ‘768’, ‘3141’, ‘212346518198976066’, ‘3141’

select ROW_TS, SOURCE_SERVER, SOURCE_OS_TYPE, SOURCE_OS_VERSION, SOURCE_HARDWARE from OGG_ENV_DATA;
# ROW_TS, SOURCE_SERVER, SOURCE_OS_TYPE, SOURCE_OS_VERSION, SOURCE_HARDWARE
‘2016-11-21 14:44:46.198673’, ‘centos0ra12’, ‘Linux’, ‘#1 SMP Thu Mar 31 16:04:38 UTC 2016’, ‘x86_64’
‘2016-11-21 14:47:03.848363’, ‘centos0ra12’, ‘Linux’, ‘#1 SMP Thu Mar 31 16:04:38 UTC 2016’, ‘x86_64’
‘2016-11-21 14:47:03.849590’, ‘centos0ra12’, ‘Linux’, ‘#1 SMP Thu Mar 31 16:04:38 UTC 2016’, ‘x86_64’
‘2016-11-21 14:47:03.850831’, ‘centos0ra12’, ‘Linux’, ‘#1 SMP Thu Mar 31 16:04:38 UTC 2016’, ‘x86_64’

select ROW_TS, SOURCE_GG_VERSION from OGG_ENV_DATA;
# ROW_TS, SOURCE_GG_VERSION
‘2016-11-21 14:44:46.198673’, ‘12.2.Version 12.2.0.1.1 OGGCORE_12.2.0.1.0_PLATFORMS_151211.1401_FBO’
‘2016-11-21 14:47:03.848363’, ‘12.2.Version 12.2.0.1.1 OGGCORE_12.2.0.1.0_PLATFORMS_151211.1401_FBO’
‘2016-11-21 14:47:03.849590’, ‘12.2.Version 12.2.0.1.1 OGGCORE_12.2.0.1.0_PLATFORMS_151211.1401_FBO’
‘2016-11-21 14:47:03.850831’, ‘12.2.Version 12.2.0.1.1 OGGCORE_12.2.0.1.0_PLATFORMS_151211.1401_FBO’

select ROW_TS, SOURCE_DB_NAME, SOURCE_DB_INSTANCE, SOURCE_DB_TYPE, SOURCE_DB_VERSION from OGG_ENV_DATA;
# ROW_TS, SOURCE_DB_NAME, SOURCE_DB_INSTANCE, SOURCE_DB_TYPE, SOURCE_DB_VERSION
‘2016-11-21 14:44:46.198673’, ‘ORCL’, ‘orcl’, ‘ORACLE’, ‘12.1.Oracle Database 12c Enterprise Edition Release 12.1.0.2.0 – 64bit Production\nPL/SQL Release 12.’
‘2016-11-21 14:47:03.848363’, ‘ORCL’, ‘orcl’, ‘ORACLE’, ‘12.1.Oracle Database 12c Enterprise Edition Release 12.1.0.2.0 – 64bit Production\nPL/SQL Release 12.’
‘2016-11-21 14:47:03.849590’, ‘ORCL’, ‘orcl’, ‘ORACLE’, ‘12.1.Oracle Database 12c Enterprise Edition Release 12.1.0.2.0 – 64bit Production\nPL/SQL Release 12.’
‘2016-11-21 14:47:03.850831’, ‘ORCL’, ‘orcl’, ‘ORACLE’, ‘12.1.Oracle Database 12c Enterprise Edition Release 12.1.0.2.0 – 64bit Production\nPL/SQL Release 12.’

select ROW_TS, TARGET_DB_NAME, TARGET_DB_VERSION from OGG_ENV_DATA;
# ROW_TS, TARGET_DB_NAME, TARGET_DB_VERSION
‘2016-11-21 14:44:46.198673’, ‘hr’, ‘MySQL\nServer Version: 5.7.16\nClient Version: 5.6.14\nHost Connection: Localhost via UNIX socket\nProto’
‘2016-11-21 14:47:03.848363’, ‘hr’, ‘MySQL\nServer Version: 5.7.16\nClient Version: 5.6.14\nHost Connection: Localhost via UNIX socket\nProto’
‘2016-11-21 14:47:03.849590’, ‘hr’, ‘MySQL\nServer Version: 5.7.16\nClient Version: 5.6.14\nHost Connection: Localhost via UNIX socket\nProto’
‘2016-11-21 14:47:03.850831’, ‘hr’, ‘MySQL\nServer Version: 5.7.16\nClient Version: 5.6.14\nHost Connection: Localhost via UNIX socket\nProto’

Summary

In this article we presented the Oracle GoldenGate @TOKEN, @TOKENS, and @GETENV Column Conversion functions and demonstrated their use by replicating data between an Oracle Multi-tenant Database and MySQL Community Server Database.

All content listed on this page is the property of Oracle Corp. Redistribution not allowed without written permission

Introduction

It builds upon the A-Team post Extracting Data from Oracle Business Intelligence 12c Using the BI Publisher REST API. This post uses REST web services to extract data from an XML-formatted BIP report.

The method uses the PL/SQL language to wrap the REST extract, XML parsing commands, and database table operations. It produces a BICS staging table which can then be transformed into star-schema object(s) for use in modeling. The transformation processes and modeling are not discussed in this post.

Additional detailed information, including the complete text of the procedure described, is included in the References section at the end of the post.

Rationale for using PL/SQL

PL/SQL is the only procedural tool that runs on the BICS / Database Schema Service platform. Other wrapping methods e.g. Java, ETL tools, etc. require a platform outside of BICS to run on.

PL/SQL can utilize native SQL commands to operate on the BICS tables. Other methods require the use of the BICS REST API.

Note: PL/SQL is a very good at showcasing functionality. However, it tends to become prohibitively resource intensive when deploying in an enterprise production environment.

For the best enterprise deployment, an ETL tool such as Oracle Data Integrator (ODI) should be used to meet these requirements and more:

* Security

* Logging and Error Handling

* Parallel Processing – Performance

* Scheduling

* Code Re-usability and Maintenance

The steps below depict how to load a BICS table.

About the BIP Report

The report used in this post is named BIP_DEMO_REPORT and is stored in a folder named Shared Folders/custom as shown below:

The report is based on a simple analysis with three columns and output as shown below:

Note: The method used here requires all column values in the BIP report to be NOT NULL for two reasons:

* The XPATH parsing command signals either the end of a row or the end of the data when a null result is returned.

* All columns being NOT NULL ensures that the result set is dense and not sparse. A dense result set ensures that each column is represented in each row.

Additional information regarding dense and sparse result sets may be found in the Oracle document Database PL/SQL Language Reference.

One way to ensure a column is not null is to use the IFNull function in the analysis column definition as shown below:

Call the BIP Report

The REST API request used here is similar to the one detailed in Extracting Data from Oracle Business Intelligence 12c Using the BI Publisher REST API. The REST API request should be constructed and tested using a REST API testing tool e.g. Postman

This step uses the APEX_WEB_SERVICE package to issue the REST API request and return the result in a CLOB variable. The key inputs to the package call are:

* The URL for the report request service

* Two request readers to be sent for authorization and content.

* The REST body the report request service expects.

* An optional proxy override

An example URL is below:

http://hostname/xmlpserver/services/rest/v1/reports/custom%2FBIP_DEMO_REPORT/run

Note: Any ASCII special characters used in a value within a URL, as opposed to syntax, needs to be referenced using its ASCII code prefixed by a % sign. In the example above, the slash (/) character is legal in the syntax but not for the value of the report location. Thus the report location, “custom/BIP_DEMO_REPORT” must be shown as custom%2FBIP_DEMO_REPORT where 2F is the ASCII code for a slash character.

An example request Authorization header is below.

apex_web_service.g_request_headers(1).name := ‘Authorization’; apex_web_service.g_request_headers(1).value := ‘Basic cHJvZG5leTpBZG1pbjEyMw==‘;

Note: The authorization header value is the string ‘Basic ‘ concatenated with a Base64 encoded representation of a username and password separated by a colon e.g. username:password

Encoding of the Base64 result should first be tested with a Base64 encoding tool e.g. base64encode.org

An example of the Content-Type header is below:

apex_web_service.g_request_headers(2).name := Content-Type’; apex_web_service.g_request_headers(2).value := ‘multipart/form-data; boundary=”Boundary_1_1153447573_1465550731355“‘;

Note: The boundary value entered here in the header is for usage in the body below. The boundary text may be any random text not used elsewhere in the request.

An example of a report request body is below:

—Boundary_1_1153447573_1465550731355 Content-Type: application/json Content-Disposition: form-data; name=“ReportRequest” {“byPassCache”:true,”flattenXML”:false} —Boundary_1_1153447573_1465550731355—

An example proxy override is below:

www-proxy.us.oracle.com

An example REST API call:

f_report_clob := apex_web_service.make_rest_request ( p_url => p_report_url, p_body => l_body, p_http_method => ‘POST’, p_proxy_override => l_proxy_override );

Parse the BIP REST Result

The BIP REST result is the report XML data embedded in text with form-data boundaries.

This step uses the :

* INSTR function to determine the beginning and end of the embedded XML

* SUBSTR function to extract just the embedded XML and store it in a CLOB variable

* XMLTYPE.createXML function to convert and return the XML.

The key inputs to this step are:

* The CLOB returned from BIP REST call above

* The XML root name returned from the BIP report, e.g. DATA_DS

An example of the REST result returned is below:

–Boundary_2_1430729833_1479236681852

Content-Type: application/json

Content-Disposition: form-data; name=”ReportResponse”

{“reportContentType”:”text/xml”}

–Boundary_2_1430729833_1479236681852

Content-Type: application/octet-stream

Content-Disposition: form-data; filename=”xmlp2414756005405263619tmp”; modification-date=”Tue, 15 Nov 2016 19:04:41 GMT”; size=1242; name=”ReportOutput”

<?xml version=”1.0″ encoding=”UTF-8″?>

<!–Generated by Oracle BI Publisher 12.2.1.1.0 -Dataengine, datamodel:_custom_BIP_DEMO_MODEL_xdm –>

<DATA_DS><SAW.PARAM.ANALYSIS></SAW.PARAM.ANALYSIS>

<G_1>

<COLUMN0>Accessories</COLUMN0><COLUMN1>5161697.87</COLUMN1><COLUMN2>483715</COLUMN2>

</G_1>

<G_1>

<COLUMN0>Smart Phones</COLUMN0><COLUMN1>6773120.36</COLUMN1><COLUMN2>633211</COLUMN2>

</G_1>

</DATA_DS>

–Boundary_2_1430729833_1479236681852– >

Examples of the string functions to retrieve and convert just the XML are below. The f_report_clob variable contains the result of the REST call. The p_root_name variable contains the BIP report specific XML rootName.

To find the starting position of the XML, the INSTR function searches for the opening tag consisting of the root name prefixed with a ‘<’ character, e.g. <DATA_DS:

f_start_position := instr ( f_report_clob, ‘<‘ || p_root_name );

To find the length of the XML, the INSTR function searches for the position of the closing tag consisting of the root name prefixed with a ‘</’ characters, e.g. </DATA_DS, determines and adds the length of the closing tab using the LENGTH function, and subtracts the starting position:

f_xml_length := instr ( f_report_clob, ‘</’ || p_root_name ) + length( ‘</’ || p_root_name || ‘>’) – f_start_position ;

To extract the XML and store it in a CLOB variable, the SUBSTR function uses the starting position and the length of the XML:

f_xml_clob := substr(f_report_clob, f_start_position, f_xml_length );

To convert the CLOB into an XMLTYPE variable:

f_xml := XMLTYPE.createXML( f_xml_clob );

Create a BICS Table

A When Others exception block allows the procedure to proceed if an error occurs because the table already exists.

A shortened example of the create table statement is below:

execute immediate ‘create table staging_table ( c01 varchar2(2048), … , c20 varchar2(2048) )’;

Load the BICS Table

This step uses SQL commands to truncate the staging table and insert rows from the BIP report XML content.

The XML content is parsed using an XPATH command inside two LOOP commands.

The following XPATH examples are for a data set that contains 11 rows and 3 columns per row:

//G_1[2]/*[1]/text() — Returns the value of the first column of the second row

//G_1[2]/*[4]/text() — Returns a null value for the 4^th column signaling the end of the row

//G_1[12]/*[1]/text() — Returns a null value for the first column of a new row signaling the end of the — data set

After each row is parsed, it is inserted into the BICS staging table.

An image of the staging table result is shown below:

Summary

This post detailed a method of loading data that has been extracted from Oracle Business Intelligence Publisher (BIP) into the Oracle Business Intelligence Cloud Service (BICS).

Data was extracted and parsed from an XML-formatted BIP report using REST web services wrapped in the Oracle PL/SQL APEX_WEB_SERVICE package.

A BICS staging table was created and populated. This table can then be transformed into star-schema objects for use in modeling.

For more BICS and BI best practices, tips, tricks, and guidance that the A-Team members gain from real-world experiences working with customers and partners, visit Oracle A-Team Chronicles for BICS.

References

Extracting Data from Oracle Business Intelligence 12c Using the BI Publisher REST API

Reference Guide for the APEX_WEB_SERVICE

REST API Testing Tool

Base64 decoding and encoding Testing Tool

All content listed on this page is the property of Oracle Corp. Redistribution not allowed without written permission

Introduction

Oracle Storage Cloud Service should be the central place for persisting raw data produced from another PaaS services and also the entry point for data that is uploaded from the customer’s data center. Big Data Cloud Service ( BDCS ) supports data transfers between Oracle Storage Cloud Service and HDFS. Both Hadoop and Oracle provides various tools and Oracle engineered solutions for the data movement. This document outlines various tools and describes the best practices to improve data transfer usability between Oracle Storage Cloud Service and HDFS.

Main Article

Architectural Overview

Interfaces to Oracle Storage Cloud Service

Interface

Resource

odcp

Accessing Oracle Storage Cloud Service Using Oracle Distributed Copy

Distcp

Accessing Oracle Storage Cloud Service Using Hadoop Distcp

Upload CLI

Accessing Oracle Storage Cloud Service Using the Upload CLI Tool

Hadoop fs -cp

Accessing Oracle Storage Cloud Service Using hadoop File System shell copy

Oracle Storage Cloud Software Appliance

Accessing Oracle Storage Cloud Service Using Oracle Storage Cloud Software Appliance

Application Programming platform

Java Library – Accessing Oracle Storage Cloud Service Using Java Library

File Transfer Manager API – Accessing Oracle Storage Cloud Service Using File Transfer Manager API

REST API – Accessing Oracle Storage Cloud Service Using REST API

Oracle Distributed Copy (odcp)

Oracle Distributed Copy (odcp) is a tool for copying very large data files in a distributed environment between HDFS and an Oracle Storage Cloud Service.

How does it work?

odcp tool has two main components.

(a) odcp launcher script

(b) conductor application

odcp launcher script is a bash script serving as a launcher for the spark application which provides a fully parallel transfer of files.

Conductor application is an Apache Spark application to copy large files between HDFS and an Oracle Storage Cloud Service.

For end users it is recommended to use the odcp launcher script. The odcp launcher script simplifies the usage of Conductor application by encapsulating environment variables setup for hadoop/Java, spark-submit parameters setup and invoking spark application etc. The conductor application is an ideal approach while performing data movement using spark application.

odcp takes the given input file (source) and splits it into smaller file chunks. Each input chunk is then transferred by one executor over the network to destination store.

When all chunks are successfully transferred, executors take output chunks and merge them into final output files.

Examples

Oracle Storage Cloud Service is based on Swift, the open-source Open Stack Object Store. The data stored in Swift can be used as the direct input to a MapReducer job by simply using the “swift:// <URL>” to declare the source of the data. In a Swift File system URL, the hostname part of the URL identifies the container and the service to work with; the path identifies the name of the object.

Swift syntax:

Swift://<MyContainer.MyProvider>/<filename>

odcp launcher script

Copy file from HDFS to Oracle Storage Cloud Service

odcp hdfs:///user/oracle/data.raw swift://myContainer.myProvider/data.raw

Copy file from Oracle Storage Cloud Service to HDFS:

odcp swift://myContainer.myProvider/data.raw hdfs:///user/oracle/odcp-data.raw

Copy directory from HDFS to Oracle Storage Cloud Service:

odcp hdfs:///user/data/ swift://myContainer.myProvider/backup

In case the system has more than 3 nodes, transfer speed can be increased by specifying a higher number of executors. For 6 nodes, use the following command:

odcp –num-executors=6 hdfs:///user/oracle/data.raw swift://myContainer.myProvider/data.raw

Highlight of odcp launcher script Options
–executor-cores: This option is called number of executor cores. This specifies the number of thread counts which depends on vCPU. This allows scripts to run in parallel based on the thread count. The default value is 30.
–num-executors: This option is called number of executors. This will be the same as the number of physical node/VMs. The default value is 3.

Conductor application

Usage: Conductor [options] <source URI...> <destination URI>

<source URI...> <destination URI>

source/destination file(s) URI, examples:

hdfs://[HOST[:PORT]]/<path>

swift://<container>.<provider>/<path>

file:///<path>

-i <value> | --fsSwiftImpl <value>

swift file system configuration. Default taken from etc/hadoop/core-site.xml (fs.swift.impl)

-u <value> | --swiftUsername <value>

swift username. Default taken from etc/hadoop/core-site.xml fs.swift.service.<PROVIDER>.username)

-p <value> | --swiftPassword <value>

swift password. Default taken from etc/hadoop/core-site.xml (fs.swift.service.<PROVIDER>.password)

-i <value> | --swiftIdentityDomain <value>

swift password. Default taken from etc/hadoop/core-site.xml (fs.swift.service.<PROVIDER>.tenant)

-a <value> | --swiftAuthUrl <value>

swift auth URL. Default taken from etc/hadoop/core-site.xml (fs.swift.service.<PROVIDER>.auth.url)

-P <value> | --swiftPublic <value>

indicates if all URLs are public - yes/no (default yes). Default taken from etc/hadoop/core-site.xml (fs.swift.service.<PROVIDER>.public)

-r <value> | --swiftRegion <value>

swift Keystone region

-b <value> | --blockSize <value>

destination file block size (default 268435456 B), NOTE: remainder after division of partSize by blockSize must be equal to zero

-s <value> | --partSize <value>

destination file part size (default 1073741824 B), NOTE: remainder after division of partSize by blockSize must be equal to zero

-e <value> | --srcPattern <value>

copies file when their names match given regular expression pattern, NOTE: ignored when used with --groupBy

-g <value> | --groupBy <value>

concatenate files when their names match given regular expression pattern

-n <value> | --groupName <value>

group name (use only with --groupBy), NOTE: slashes are not allowed

--help

display this help and exit

One can submit a spark conductor application to a spark deployment environment for execution of spark applications. Below is an example of how to submit a spark conductor application.

spark-submit
–conf spark.yarn.executor.memoryOverhead=600
–jars hadoop-openstack-spoc-2.7.2.jar,scopt_2.10-3.4.0.jar
–class oracle.paas.bdcs.conductor.Conductor
–master yarn
–deploy-mode client
–executor-cores <number of executor core e.g. 5>
–executor-memory <memory size e.g. 40G>
–driver-memory < driver memory size e.g. 10G>
original-conductor-1.0-SNAPSHOT.jar
–swiftUsername <oracle username@oracle.com>
–swiftPassword <password>
–swiftIdentityDomain <storage ID assigned to this user>
–swiftAuthUrl https://<Storage cloud domain name e.g. storage.us2.oraclecloud.com:443>/auth/v2.0/tokens
–swiftPublic true
–fsSwiftImpl org.apache.hadoop.fs.swift.snative.SwiftNativeFileSystem
–blockSize <block size e.g. 536870912>
swift://<container.provider e.g. rstrejc.a424392>/someDirectory
swift:// <container.provider e.g. rstrejc.a424392>/someFile
hdfs:///user/oracle/

Limitations

odcp consumes a lot of resources of the cluster. While running other Spark/MapReduce jobs parallel to odcp, one needs to adjust the number of executors, the amount of memory available to the executors or the number of executor cores using the options –executor-cores, –executor-memory and –num-executors parameter value for better performance.

Distcp

Distcp (distributed copy) is a Hadoop utility used for inter/intra-cluster copying of large amounts of data in parallel. The Distcp command submits a regular MapReducer job that performs a file-by-file copy.

How does it work?

Distcp involves two steps:

(a) Building the list of files to copy (known as the copy list)

(b) Running a MapReduce job to copy files, with the copy list as input

The MapReduce job that does the copying has only mappers—each mapper copies a subset of files in the copy list. By default, the copy list is a complete list of all files in the source directory parameters of Distcp.

Examples

Copying data from HDFS to Oracle Storage Cloud Service syntax:

hadoop distcp hdfs://<hadoop namenode>/<source filename> swift://<MyContainer.MyProvider>/<destination filename>

Allocation of JVM heap-size:

export HADOOP_CLIENT_OPTS=”-Xms<start heap memory size> –Xmx<max heap memory size>”

Setting timeout syntax:

hadoop distcp – Dmapred.task.timeout=<time in milliseconds> hdfs://<hadoop namenode>/<source filename> swift://<MyContainer.MyProvider>/<destination filename>

Hadoop getmerge syntax:

bin/hadoop fs -getmerge [nl] <source directory> <destination directory>/<output filename>

Hadoop getmerge command takes a source directory and a destination file as an input and concatenates source files into the destination local file. The parameter –nl can be set to add a newline character at the end of each file.

Limitations

For a large file copy, one has to make sure that the task has a termination strategy in case the task doesn’t read an input, write an output, or update its status string. The option -Dmapred.task.timeout=<time in milliseconds> can be used to set the maximum timeout value. In case of 1TB file size use -Dmapred.task.timeout=60000000 (approximately 16 hours) with Distcp command.

Distcp might run out of memory while copying very large files. To get around this, consider changing the -Xmx JVM heap-size parameters before executing hadoop distcp command. This value must be multiple of 1024

In order to improve the transfer speed of very large file, one has to split the file at source and copy these split files to destination. Once the files are successfully transferred, at the destination end, Hadoop performs merge operation.

Upload CLI

How does it work?

The Upload CLI tool is a cross-platform Java-based command line tool that you can use to efficiently upload files to Oracle Storage Cloud Service. This tool optimized uploads through segmentation and parallelization to maximize network efficiency and reduce overall upload time. During the large file transfer process, if the system gets interrupted, upload CLI tool maintain the state and resumes from the point where the file transfer get interrupted. This tool has an automatic retry option on failures.

Example:

Syntax of upload CLI:

java -jar uploadcli.jar -url REST_Endpoint_URL -user userName -container containerName file-or-files-or-directory

To upload a file named file.txt to a standard container myContainer in the domain myIdentityDomain as the user abc.xyz@oracle.com, run the following command:

java -jar uploadcli.jar -url https://foo.storage.oraclecloud.com/myIdentityDomain-myServiceName -user abc.xyz@oracle.com -container myContainer file.txt

When running the Upload CLI tool on a host that’s behind a proxy server, specify the host name and port of the proxy server by using the https.proxyHost and https.proxyPort Java parameters.

Syntax of upload CLI behind proxy server:

java -Dhttps.proxyHost=host -Dhttps.proxyPort=port -jar uploadcli.jar -url REST_Endpoint_URL -user userName -container containerName file-or-files-or-directory

Limitations

Upload CLI is a java tool and will only run on hosts which satisfy the prerequisites for uploadcli tool.

Hadoop fs -cp

How does it work?

Hadoop fs -cp is a family of Hadoop file system shell commands that can run from source operating system’s command line interface. Hadoop fs -cp is not distributed across cluster. This command transfer data byte by byte from the source machine where the command has been issued.

Example

hadoop fs -cp /user/hadoop/file1 /user/hadoop/file2

Limitations

The byte by byte transfer takes a very long time to copy large file from HDFS to Oracle Storage Cloud Service.

Oracle Storage Cloud Software Appliance

How does it work?

Oracle Storage Cloud Software Appliance is a product that facilitates easy, secure, reliable data storage and retrieval from Oracle Storage Cloud Service. Businesses can use Oracle Cloud Storage without changing their data center applications and workflows. The applications which use standard file-based network protocol like NFS to store and retrieve data, can use Oracle Storage Cloud Software Appliance as a bridge between Oracle Storage Cloud Service which uses object storage and standard file storage. Oracle Storage Cloud Software Appliance caches frequently retrieved data on the local host, minimizing the number of REST API calls to Oracle Storage Cloud Service and enabling low-latency, high-throughput file I/O.

The application host instance can mount directory to the Oracle Storage Cloud Software Appliance that acts as a cloud storage gateway. This enables the application host instance to access Oracle Cloud Storage container as a standard NFS file system.

Architecture

Limitations

The appliance is ideal for backup and archive use cases that require the replication of infrequently accessed data to cloud containers. Read-only and read-dominated content repositories are ideal target. Once the Oracle Storage Cloud Service container is mapped to a filesystem in Oracle Storage Cloud Software Appliance, other data movement tools like REST API, odcp, distcp, java library can’t be used for the specific container. Doing so would cause the data in the appliance to become inconsistent with data in Oracle Storage Cloud Service.

Application Programming Platform

Oracle provides various java library APIs to access Oracle Storage Cloud Services. The following interfaces summarizes various APIs one can use programmatically to access Oracle storage cloud service.

Interface	Description
Java library	Accessing Oracle Storage Cloud Service Using Java Library
File Transfer Manager API	Accessing Oracle Storage Cloud Service Using File Transfer Manager API
REST API	Accessing Oracle Storage Cloud Service Using REST API

Java Library

How does it work?

The Java library is useful for Java Applications which prefer to use Oracle Cloud Java API for Oracle Storage Cloud Service instead of tools provided by Oracle and Hadoop. The Java library wraps the RESTful web service API. Most of the major RESTful API features to Oracle Storage Cloud Service are available through the Java Library. The Java Library is available via separate Oracle Cloud Service Java SDK.

Example

Sample Code snippet

package storageupload;

import oracle.cloud.storage.*;

import oracle.cloud.storage.model.*;

import oracle.cloud.storage.exception.*;

import java.io.*;

import java.util.*;

import java.net.*;

public class UploadingSegmentedObjects {

public static void main(String[] args) {

try {

CloudStorageConfig myConfig = new CloudStorageConfig();

myConfig.setServiceName(“Storage-usoracleXXXXX”)

.setUsername(“xxxxxxxxx@yyyyyyyyy.com”)

.setPassword(“xxxxxxxxxxxxxxxxx”.toCharArray())

.setServiceUrl(“https://xxxxxx.yyyy.oraclecloud.com”);

CloudStorage myConnection = CloudStorageFactory.getStorage(myConfig);

System.out.println(“\nConnected!!\n”);

if ( myConnection.listContainers().isEmpty() ){

myConnection.createContainer(“myContainer”);

}

FileInputStream fis = new FileInputStream(“C:\\temp\\hello.txt”);

myConnection.storeObject(“myContainer”, “C:\\temp\\hello.txt”, “text/plain”, fis);

fis = new FileInputStream(“C:\\temp\\hello.txt”);

myConnection.storeObject(“myContainer”, “C:\\temp\\hello1.txt”, “text/plain”, fis);

fis = new FileInputStream(“C:\\temp\\hello.txt”);

myConnection.storeObject(“myContainer”, “C:\\temp\\hello2.txt”, “text/plain”, fis);

List myList = myConnection.listObjects(“myContainer”, null);

Iterator it = myList.iterator();

while (it.hasNext()) {

System.out.println((it.next().getKey().toString()));

}

} catch (Exception e) {

e.printStackTrace();

}

}

}

Limitations

Java API cannot create Oracle storage Cloud Service archive container. Appropriate JRE version is required for the Java Library.

File Transfer Manager API

How does it Work?

The File Transfer Manager (FTM) API is a Java library that simplifies uploading to and downloading from Oracle Storage Cloud Service. The File Transfer Manager provides both synchronous and asynchronous APIs to transfer files. It provides a way to track the operations for asynchronous version. The Java Library is available via separate Oracle Cloud Service Java SDK.

Example

Uploading a Single File Sample Code snippet

FileTransferAuth auth = new FileTransferAuth

(

"email@oracle.com", // user name

"xxxxxx", // password

"yyyyyy", // service name

"https://xxxxx.yyyyy.oraclecloud.com", // service URL

"xxxxxx" // identity domain

);

FileTransferManager manager = null;

try {

manager = FileTransferManager.getDefaultFileTransferManager(auth);

String containerName = "mycontainer";

String objectName = "foo.txt";

File file = new File("/tmp/foo.txt");

UploadConfig uploadConfig = new UploadConfig();

uploadConfig.setOverwrite(true);

uploadConfig.setStorageClass(CloudStorageClass.Standard);

System.out.println("Uploading file " + file.getName() + " to container " + containerName);

TransferResult uploadResult = manager.upload(uploadConfig, containerName, objectName, file);

System.out.println("Upload completed successfully.");

System.out.println("Upload result:" + uploadResult.toString());

} catch (ClientException ce) {

System.out.println("Upload failed. " + ce.getMessage());

} finally {

if (manager != null) {

manager.shutdown();

}

}

REST API

How does it work?

The REST API can be accessed from any application or programming platform that correctly and completely understands the Hypertext Transfer Protocol (HTTP). The REST API uses advanced facets of HTTP such as secure communication over HTTPS, HTTP headers, and specialized HTTP verbs (PUT, DELETE). cURL is one of the many applications that meet these requirement.

Example

cURL syntax:

curl -v -s -X PUT -H “X-Auth-Token: <Authorization Token ID>” https://Oracle Cloud Storage domain name>/v1/<storage ID associated to user account/<container name>”

Some Data Transfer Test results

The configuration used to measure performance and data transfer rates are as following:

Test environment configuration:

- BDCS 16.2.5
- Hadoop Swift driver 2.7.2
- US2 production data center
- 3 nodes cluster that is running in BDA
- Every node has 256GB memory/30 vCPU
- File size: 1TB (Terabyte)
- File contains all zeros

#	Interface	Source	Destination	Time	Comment
1	odcp	HDFS	Oracle Storage Cloud Service	54 minutes	Transfer rate : 2.47 GB/sec 1.11 TB/hour
2	hadoop Distcp	Oracle Storage Cloud Service	HDFS	failed	Not Enough memory (after 1h)
3	hadoop Distcp	HDFS	Oracle Storage Cloud Service	Failed
4	hadoop Distcp	HDFS	Oracle Storage Cloud Service	3 hours	Based on splitting 1TB files into 50 files with each file size of 10GB. Each 10GB file took 18 minutes (and with partition size 256MB)
5	Upload CLI	HDFS	Oracle Storage Cloud Service	5 hours 55 minutes	Data was read from Big Data Cloud Service HDFS mounted using fuse_dfs
6	hadoop fs -cp	HDFS	Oracle Storage Cloud Service	11 hours 50 minutes 50 seconds	Parallelism 1, Transfer rate: 250 Mb/sec

Summary

One can make following conclusions from the above analysis.

Data File size and Data transfer time are two main components on deciding the appropriate interface for data movement between HDFS and Oracle Storage Cloud Service.

There is no additional overhead of data manipulation and processing using odcp interface.

All content listed on this page is the property of Oracle Corp. Redistribution not allowed without written permission

Introduction

This is the second part of a two part article which demonstrates how to upload data in near-real time from an on-premise oracle database to Oracle Storage Cloud Service.

In the previous article of this series, we demonstrated Oracle GoldenGate functionality to write to a flat file using Apache Flume File Roll Sink. If you would like to read the first part in this article series please visit Oracle GoldenGate : Apply to Apache Flume File Roll Sink

In this article we will demonstrate using the cURL command which will upload the flat file to Oracle Storage Cloud Service.

We used the Oracle Big Data Lite Virtual Machine as the test bed for this article. The VM image is available for download on the Oracle Technology Network website.

Main Article

There are various tools available to access Oracle Storage Cloud Service. According to Best Practices – Data movement between Oracle Storage Cloud Service and HDFS , cURL REST interface is appropriate for this requirement.

REST API

REST API is used to manage containers and objects in the Oracle Storage Cloud Service instance. Anyone can access the REST API from any application or programming platform that understands the Hypertext Transfer Protocol (HTTP) and has Internet connectivity.

cURL is one of the tools used to access the REST interface. cURL is an open source tool used for transferring data which supports various protocols including HTTP and HTTPS. cURL is typically available by default on most UNIX-like hosts. For information about downloading and installing cURL, see Quick Start.

Oracle Storage Cloud Service ( OSCS )

Oracle Storage Cloud Service enables applications to store and manage contents in the cloud. Stored objects can be retrieved directly by external clients or by applications running within Oracle Cloud (For example: Big Data Preparation Cloud Service).

A container is a storage compartment that provides a way to organize the data stored in Oracle Storage Cloud Service. Containers are similar to directories, but with a key distinction; unlike directories, containers cannot be nested.

Prerequisites

First, we need access to the Oracle Storage Cloud Service and information about the Oracle Cloud user name, password, and identity domain.

Requesting an Authentication Token

Oracle Storage Cloud Service requires authentication for any operation against the service instance. Authentication is performed by using an authentication token. Authentication tokens are requested from the service by authenticating user credentials with the service. All provisioned authentication tokens are temporary and will expire in 30 minutes. We will include a current authentication token with every request to Oracle Storage Cloud Service.

Request an authentication token by running the following cURL command:

curl -v -s -X GET -H ‘X-Storage-User: <my identity domain>:<Oracle Cloud user name>’ -H ‘X-Storage-Pass: <Oracle Cloud user password>’ https://<myIdentityDomain>.storage.oraclecloud.com/auth/v1.0

We ran the above cURL command. The following is the output of this command, with certain key lines highlighted. Note that if the request includes the correct credentials, it returns the HTTP/1.1 200 OK response.

From the output of the command we just ran, note the following:

– The value of the X-Storage-Url header.

This value is the REST endpoint URL of the service. This URL value will be used in the next step to create the container.

-The value of the X-Auth-Token header.

This value is the authentication token, which will be used in the next step to create the container. Note that the authentication token expires after 30 minutes, after the token expires you should request a fresh token.

Creating A Container

Run the following cURL command to create a new container:

curl -v -s -X PUT -H “X-Auth-Token: <Authentication Token ID>” https://storage.oraclecloud.com/v1/Storage-myIdentityDomain/myFirstContainer

– Replace the value of the X-Auth-Token header with the authentication token that you obtained earlier.
– Change https://storage.oraclecloud.com/v1/Storage-myIdentityDomain to the X-Storage-Url header value that you noted while getting an authentication token.
– And change myFirstContainer to the name of the container that you want to create.

Verifying that A Container is created

Run the following cURL command:

curl -v -s -X GET -H “X-Auth-Token: <Authentication Token ID>” https://storage.oraclecloud.com/v1/Storage-myIdentityDomain/myFirstContainer

If the request is completed successfully, it returns the HTTP/1.1 204 No Content response. This response indicates that there are no objects yet in new container.

In this exercise, as we are not creating a new container. We will use an existing container to upload the file. So we don’t need to verify the container creation .

Uploading an Object

Once Oracle GolgenGate completes writing the records to a file at /u01/ogg-bd/flumeOut directory, the cURL program reads the file present at /u01/ogg-bd/flumeOut directory. Then it uploads the file to create an object in the container myFirstContainer. Any user with the Service Administrator role or a role that is specified in the X-Container-Write ACL of the container can create an object.

We ran the following cURL command:

curl -v -X PUT -H “X-Auth-Token: <Authentication Token ID>”-T myfile https://<MyIdentityDomain>.storage.oraclecloud.com/v1/Storage-myIdentityDomain/myFirstContainer/myObject

When running this command we…
– Replaced the value of the X-Auth-Token header with the authentication token that we obtained earlier.
– Changed https://<MyIdentiryDomain>.storage.oraclecloud.com/v1/Storage-myIdentityDomain to the X-Storage-Url header value that we noted while getting an authentication token.
– Changed myFirstContainer to the name of the container that we want to create.
– Changed myfile to the full path and name of the file that we want to upload
– Changed myObject to the name of the object that we want to create in the container

If the request is completed successfully, it returns the HTTP/1.1 201 Created response, as shown in the following output. We verified the full transfer by comparing “Content-Length” value.

We also verified the proper transfer of the file to Oracle Storage Cloud Service using Big Data Preparation Cloud Service.

Summary

In this article we demonstrated the functionality of REST API which uploads the data from the On Premise Big Data Lite VM to Oracle Storage Cloud Service. After combining both articles we demonstrated the functionality of moving data on near real-time from the On-premise Oracle database to Oracle Storage Cloud Service using Oracle Golden Gate and REST API.

All content listed on this page is the property of Oracle Corp. Redistribution not allowed without written permission

Introduction

This post details a method of loading data that has been extracted from Oracle Transactional Business Intelligence (OTBI) using SOAP into the Oracle Business Intelligence Cloud Service (BICS). The OTBI instance may either be Cloud-Based or On-Premise. This method may also be used to load data from Oracle Business Intelligence Enterprise Edition (OBIEE).

It builds upon the A-Team post Using Oracle BI Answers to Extract Data from HCM via Web Services which details the extraction process.

This post uses the PL/SQL language to wrap the SOAP extract, XML parsing commands, and database table operations in a stored procedure in the BICS Schema Service database. It produces a BICS staging table which can then be transformed into star-schema object(s) for use in modeling. The transformation processes and modeling are not discussed in this post.

The most complex portion of this post details how to convert the analysis XML report results, embedded in a CDATA (Character Data) text attribute, back into standard XML markup notation so the rows and columns of data can be parsed.

Additional detailed information, including the complete text of the procedure described, is included in the References section at the end of the post.

Rationale for using PL/SQL

PL/SQL is the only procedural tool that runs on the BICS / Database Schema Service platform. Other wrapping methods e.g. Java, ETL tools, etc. require a platform outside of BICS to run on.

PL/SQL can utilize native SQL commands to operate on the BICS tables. Other methods require the use of the BICS REST API.

Note: PL/SQL is a very good at showcasing functionality. However, it tends to become prohibitively resource intensive when deploying in an enterprise production environment.

For the best enterprise deployment, an ETL tool such as Oracle Data Integrator (ODI) should be used to meet these requirements and more:

* Security

* Logging and Error Handling

* Parallel Processing – Performance

* Scheduling

* Code re-usability and Maintenance

The steps below depict how to load a BICS table.

About the OTBI Analysis

The analysis used in this post is named Suppliers and is stored in a folder named Shared Folders/custom as shown below:

The analysis has three columns and output as shown below:

Note: The method used here requires all column values in the analysis to be NOT NULL for two reasons. The XPATH parsing command signals either the end of a row or the end of the data when a null result is returned. All columns being NOT NULL ensures that the result set is dense and not sparse. A dense result set ensures that each column is represented in each row. Additional information regarding dense and sparse result sets may be found in the Oracle document Database PL/SQL Language Reference.

One way to ensure a column is not null is to use the IFNull function in the analysis column definition as shown below:

An optional parameter may be sent at run time to filter each column.

Ensuring the Web Services are Available

To ensure that the web services are available in the required environment, type a form of the following URL into a browser:

https://hostname/analytics-ws/saw.dll/wsdl/v9

Note: The version number e.g. v9 may vary from server to server.

If you are not able to reach the website, the services may not be offered. Discuss this with the server administrator.

Calling the Analysis

Calling the analysis is a two-step process. The first step initiates a session in OTBI and returns a session ID. The second step uses that session ID to call the analysis and extract the data.

The SOAP API requests should be constructed and tested using a SOAP API testing tool e.g. SoapUI.

Note: API testing tools such as SoapUI, cURL, Postman, and so on are third-party tools for using SOAP and REST services. Oracle does not provide support for these tools or recommend a particular tool for its APIs. You can select the tool based on your requirements.

The procedure uses the APEX_WEB_SERVICE package to issue the SOAP API requests and store the XML result in a XMLTYPE variable. The key inputs to the package call are:

* The URL for the OTBI Session Web Service

* The URL for the OTBI XML View Web Service

* The Base64 encoded credentials to access the analysis

* The SOAP envelopes expected by the OTBI Web Service.

* Optional Parameters to filter the results

* An optional proxy override

Decoding the Credentials

To avoid hard-coding credentials in the procedure, the credentials are expected to be encoded in a base64 format prior to invoking the procedure. A useful base64 encoding tool may be found at Base64 Decoding and Encoding Testing Tool. The text to encode should be in the format username:password

The APEX_WEB_SERVICE and the DBMS_LOB packages and the INSTR function are used to decode the credentials into username and password variables. The APEX_WEB_SERVICE package decodes the credentials into a BLOB variable. The DBMS_LOB package converts the BLOB to a CLOB variable. The INSTR function then separates the decoded result into the two variables.

Examples are below:

— Decode the Base 64 Credentials
f_blob := apex_web_service.clobbase642blob(f_base64_creds);
— Create a temporary CLOB instance
dbms_lob.createtemporary(f_clob, true);
— Convert the decoded BLOB credentials to a CLOB
dbms_lob.converttoclob(
f_clob,
f_blob,
v_file_size,
v_dest_offset,
v_src_offset,
v_blob_csid,
v_lang_context,
v_warning);
— Parse the credentials into username and password
f_au := substr ( f_clob, 1, instr(f_clob, ‘:’) -1 ); — username
f_ap := substr ( f_clob, instr(f_clob, ‘:’) +1 ); — password

Calling the Session Service

An example Session URL is below:

https://hostname/analytics-ws/saw.dll?SoapImpl=nQSessionService

An example Logon Request envelope is below. The result will be an envelope containing a session ID.

<soapenv:Envelope xmlns:soapenv=”http://schemas.xmlsoap.org/soap/envelope/” xmlns:v9=”urn://oracle.bi.webservices/v9″>
<soapenv:Header/>
<soapenv:Body>
<v9:logon>
<v9:name>username</v9:name>
<v9:password>password</v9:password>
</v9:logon>
</soapenv:Body>
</soapenv:Envelope>

An example APEX_WEB_SERVICE call for the login is below:

f_xml := apex_web_service.make_request(p_url => f_session_url
,p_envelope => f_envelope
— ,p_proxy_override => — An optional Proxy URL
— ,p_wallet_path => — An optional path to an Oracle database wallet file
— ,p_wallet_pwd => — The password for the optional Oracle database wallet file
);

The APEX_WEB_SERVICE package is used to parse the XML result from above to obtain the session ID. An example is below:

f_session_id := apex_web_service.parse_xml_clob(p_xml => f_xml
,p_xpath => ‘//*:sessionid/text()‘
);

Troubleshooting the Session Service Call

Three common issues are the need for a proxy, the need for a trusted certificate (if using HTTPS), and the need to use the TLS security protocol.

The need for a proxy may be detected when the following error occurs: ORA-12535: TNS:operation timed out. Adding the optional p_proxy_override parameter to the call may correct the issue. An example proxy override is:

www-proxy.us.oracle.com

The need for a trusted certificate is detected when the following error occurs: ORA-29024: Certificate validation failure.

A workaround may be to run this procedure from a full Oracle Database Could Service or an on-premise Oracle database. Adding the trusted certificate(s) to an Oracle database wallet file and adding the optional p_wallet_path and p_wallet_pwd parameters to the call should correct the issue. For more information on Oracle wallets, refer to Using Oracle Wallet Manager in the References section of this post.

The need to use the TLS protocol maybe detected when the following error occurs: ORA-29259: end-of-input reached.

A workaround is to run this procedure from a different Oracle Database Could Service or an on-premise Oracle database. Ensure the database version is 11.2.0.4.10 or above.

Additionally: When using an on-premise Oracle database, the SQL Operations described later in this post (Create Table, Truncate Table, Insert) may be modified to use the BICS REST API. For more information refer to the REST APIs for Oracle BI Cloud Service in the References section of this post.

Calling the XML View Service

An example XML View service URL is:

https://hostname/analytics-ws/saw.dll?SoapImpl=xmlViewService

An example Analysis Request envelope is below. This envelope contains the session ID from the logon call, the location of the analysis, a placeholder variable for the VNUM analysis variable, and a filter value for the VTYPE variable.

<soapenv:Envelope xmlns:soapenv=”http://schemas.xmlsoap.org/soap/envelope/” xmlns:v9=”urn://oracle.bi.webservices/v9″>
<soapenv:Header/>
<soapenv:Body>
<v9:executeXMLQuery>
<v9:report>
<v9:reportPath>/shared/custom/Suppliers</v9:reportPath>
<v9:reportXml></v9:reportXml>
</v9:report>
<v9:outputFormat>xml</v9:outputFormat>
<v9:executionOptions>
<v9:async></v9:async>
<v9:maxRowsPerPage></v9:maxRowsPerPage>
<v9:refresh></v9:refresh>
<v9:presentationInfo></v9:presentationInfo>
<v9:type></v9:type>
</v9:executionOptions>
<v9:reportParams>
<!–Zero or more repetitions:–>
<v9:variables>
<v9:name>VNUM</v9:name>
<v9:value></v9:value>
</v9:variables>
<v9:variables>
<v9:name>VTYPE</v9:name>
<v9:value>Supplier</v9:value>
</v9:variables>
</v9:reportParams>
<v9:sessionID>’||F_SESSION_ID||'</v9:sessionID>
</v9:executeXMLQuery>
</soapenv:Body>
</soapenv:Envelope>

An example APEX_WEB_SERVICE call for the analysis result is below:

f_xml := apex_web_service.make_request(p_url => f_report_url
,p_envelope => f_envelope
— ,p_proxy_override => — An optional Proxy URL
— ,p_wallet_path => — An optional path to an Oracle database wallet file
— ,p_wallet_pwd => — The password for the optional Oracle database wallet file
);

Preparing the XML Result

The XML result from the Analysis call contains the report results in a CDATA text section. In order to parse the results, the XML within the text section is converted into standard XML using the XMLTYPE package and the REPLACE function.

An example of the CDATA section result, as seen in SoapUI, is below:

<sawsoap:rowset xsi:type=”xsd:string”><![CDATA[<rowset xmlns=”urn:schemas-microsoft-com:xml-analysis:rowset”>
<Row>
<Column0>UJ Catering Service AG</Column0>
<Column1>5991</Column1>
<Column2>Supplier</Column2>
</Row>
</rowset>]]>
</sawsoap:rowset>

The same result, as seen in APEX_WEB_SERVICE, is below:

<sawsoap:rowset xsi:type=”xsd:string”><rowset xmlns="urn:schemas-microsoft-com:xml-analysis:rowset">
<Row>
<Column0>UJ Catering Service AG</Column0>
<Column1>5991</Column1>
<Column2>Supplier</Column2>
</Row>
</rowset>
</sawsoap:rowset>

The converted result needed for parsing is below:

<sawsoap:rowset xsi:type=”xsd:string”><bi:rowset xmlns:bi=”urn:schemas-microsoft-com:xml-analysis:rowset”>
<Row>
<Column0>UJ Catering Service AG</Column0>
<Column1>5991</Column1>
<Column2>Supplier</Column2>
</Row>
</bi:rowset>
</sawsoap:rowset>

The XMLTYPE package and the REPLACE function usage is below. Note: the CHR(38) function returns the ‘&’ character.

F_CLOB := F_XML.GETCLOBVAL(); — Convert to CLOB
F_CLOB := REPLACE (F_CLOB, CHR(38)||’lt;’, ‘<‘);
F_CLOB := REPLACE (F_CLOB, CHR(38)||’gt;’, ‘>’ );
F_CLOB := REPLACE (F_CLOB, CHR(38)||’quot;’,'”‘);
F_CLOB := REPLACE (F_CLOB, ‘/rowset’, ‘/bi:rowset’); — Insert bi namespace
F_CLOB := REPLACE (F_CLOB, ‘<rowset’, ‘<bi:rowset’); — Insert bi namespace
F_CLOB := REPLACE (F_CLOB, ‘xmlns=’, ‘xmlns:bi=’); — Insert bi namespace
F_XML := XMLTYPE.createXML( F_CLOB ); — Convert back to XMLTYPE

Creating a BICS Table

A When Others exception block allows the procedure to proceed if an error occurs because the table already exists. An example is below:

EXCEPTION
WHEN OTHERS THEN NULL; — Ignore error if table exists

Note: The table needs to be created once before compiling the procedure the first time. The complete DDL is below:

CREATE TABLE STAGING_TABLE
(
C01 VARCHAR2(2048 BYTE),C02 VARCHAR2(2048 BYTE), C03 VARCHAR2(2048 BYTE), C04 VARCHAR2(2048 BYTE), C05 VARCHAR2(2048 BYTE),
C06 VARCHAR2(2048 BYTE),C07 VARCHAR2(2048 BYTE), C08 VARCHAR2(2048 BYTE), C09 VARCHAR2(2048 BYTE), C10 VARCHAR2(2048 BYTE),
C11 VARCHAR2(2048 BYTE),C12 VARCHAR2(2048 BYTE), C13 VARCHAR2(2048 BYTE), C14 VARCHAR2(2048 BYTE), C15 VARCHAR2(2048 BYTE),
C16 VARCHAR2(2048 BYTE),C17 VARCHAR2(2048 BYTE), C18 VARCHAR2(2048 BYTE), C19 VARCHAR2(2048 BYTE), C20 VARCHAR2(2048 BYTE)
)

A shortened example of the create table statement is below:

execute immediate ‘create table staging_table ( c01 varchar2(2048), … , c20 varchar2(2048) )’;

Loading the BICS Table

This step uses SQL commands to truncate the staging table and insert rows from the BIP report XML content.

The XML content is parsed using an XPATH command inside two LOOP commands.

The following XPATH examples are for a data set that contains 5 rows and 3 columns per row:

//Row[2]/*[1]/text() — Returns the value of the first column of the second row
//Row[2]/*[4]/text() — Returns a null value for the 4th column signaling the end of the row
//Row[6]/*[1]/text() — Returns a null value for the first column of a new row signaling the end of the — data set

After each row is parsed, it is inserted into the BICS staging table.

An image of the staging table result is shown below:

Summary

This post detailed a method of loading data that has been extracted from Oracle Transactional Business Intelligence (OTBI) using SOAP into the Oracle Business Intelligence Cloud Service.

A BICS staging table was created and populated. This table can then be transformed into star-schema objects for use in modeling.

For more BICS and BI best practices, tips, tricks, and guidance that the A-Team members gain from real-world experiences working with customers and partners, visit Oracle A-Team Chronicles for BICS.

References

Complete Text of Procedure Described

Using Oracle BI Answers to Extract Data from HCM via Web Services

Reference Guide for the APEX_WEB_SERVICE

REST APIs for Oracle BI Cloud Service

All content listed on this page is the property of Oracle Corp. Redistribution not allowed without written permission

Introduction

This post details a method of extracting and loading data from Oracle Field Service Cloud (OFSC) into the Oracle Business Intelligence Cloud Service (BICS).

A compelling reason to use such a method is when data is required that is not in the standard daily extract. Such data might be planning (future) data or data recently provided in new releases of the application.

This post uses SOAP web services to extract XML-formatted data responses. It also uses the PL/SQL language to wrap the SOAP extract, XML parsing commands, and database table operations in a Stored Procedure. It produces a BICS staging table and a staging view which can then be transformed into star-schema object(s) for use in modeling. The transformation processes and modeling are not discussed in this post.

Finally, an example of a database job is provided that executes the Stored Procedure on a scheduled basis.

The PL/SQL components are for demonstration purposes only and are not intended for enterprise production use. Additional detailed information, including the complete text of the PL/SQL procedure described, is included in the References section at the end of this post.

Rationale for Using PL/SQL

PL/SQL is the only procedural tool that runs on the BICS / Database Schema Service platform. Other wrapping methods e.g. Java, ETL tools, etc. require a platform outside of BICS to run on.

PL/SQL may also be used in a DBCS that is connected to BICS.

PL/SQL can utilize native SQL commands to operate on the BICS tables. Other methods require the use of the BICS REST API.

Note: PL/SQL is very good at showcasing functionality. However, it tends to become prohibitively resource intensive when deploying in an enterprise production environment. For the best enterprise deployment, an ETL tool such as Oracle Data Integrator (ODI) should be used to meet these requirements and more:

* Security

* Logging and Error Handling

* Parallel Processing and Performance

* Scheduling

* Code Re-usability and Maintenance

Using Oracle Database Cloud Service

Determining Security Protocol Requirements

If the web service requires a security protocol, key exchange or cypher not supported by the default BICS Schema Database Service, another Oracle Database Cloud Service (DBCS) may be used.

An example security protocol is TLS version 1.2 which is used by the OFSC web service accessed in this post.

Note: For TLSv1.2, specify a database version of 11.2.0.4.10 or greater, or any version of 12c. If the database is not at the required version, PL/SQL may throw the following error: ORA-29259: end-of-input reached

To detect what protocol a web service uses, open the SOAP WSDL page in a browser, click the lock icon, and navigate to the relevant security section. A Chrome example from an OFSC WSDL page is below:

Preparing the DBCS

If a DBCS other than the default Schema Service is used, the following steps need to be performed.

Create a BICS user in the database. The use of Jobs and the DBMS_CRPTO package shown in the example below are discussed later in the post. Example SQL statements are below:

— USER SQL
CREATE USER “BICS_USER” IDENTIFIED BY password
DEFAULT TABLESPACE “USERS”
TEMPORARY TABLESPACE “TEMP”
ACCOUNT UNLOCK;
— QUOTAS
ALTER USER “BICS_USER” QUOTA UNLIMITED ON USERS;
— ROLES
ALTER USER “BICS_USER” DEFAULT ROLE “CONNECT”,”RESOURCE”;
— SYSTEM PRIVILEGES
GRANT CREATE VIEW TO “BICS_USER”;
GRANT CREATE ANY JOB TO “BICS_USER”;
–OBJECT PERMISSIONS
GRANT EXECUTE ON SYS.DBMS_CRYPTO TO BICS_USER;

Create an entry in a new or existing Oracle database wallet for the trusted public certificate used to secure connections to the web service via the Internet. A link to the Oracle Wallet Manager documentation is included in the References section. Note the location and password of the wallet as they is used to issue the SOAP request.

The need for a trusted certificate is detected when the following error occurs: ORA-29024: Certificate validation failure.

An example certificate path found using Chrome browser is shown below. Both of these trusted certificates need to be in the Oracle wallet.

Preparing the Database Schema

Two objects need to be created prior to compiling the PL/SQL stored procedure.

The first is a staging table comprising a set of identical columns. This post uses a staging table named QUOTA_STAGING_TABLE. The columns are named consecutively as C01 through Cnn. This post uses 50 staging columns. The SQL used to create this table may be viewed here.

The second is a staging view named QUOTA_STAGING_VIEW built over the staging table. The view column names are the attribute names used in the API WSDL. The SQL used to create this view may be viewed here. The purpose of the view is to relate an attribute name found in the SOAP response to a staging table column based on the view column’s COLUMN_ID in the database. For example, if a response attribute name of bucket_id is detected and the COLUMN_ID of the corresponding view column is 3, then the staging table column populated with the attribute value would be C03.

Ensuring the Web Services are Available

To ensure that the web services are available in the required environment, type a form of the following URL into a browser:

https://hostname/soap/capacity/?wsdl

Note: If you are unable to reach the website, the services may not be offered or the URL may have changed. Discuss this with the service administrator.

Using API Testing Tools

The SOAP Request Envelope should be developed in an API testing tool such as SoapUI or Postman. The XPATH expressions for parsing should be developed and tested in an XPATH expression testing tool such as FreeFormatter. Links to these tools are provided in the References section.

Note: API testing tools such as SoapUI, FreeFormatter, Postman, and so on are third-party tools for using SOAP and REST services. Oracle does not provide support for these tools or recommend a particular tool for its APIs. You can select the tool based on your requirements.

Preparing the SOAP Request

This post uses the get_quota_data method of the Oracle Field Service Cloud Capacity Management API. Additional information about the API is included as a link in the References section.

Use a browser to open the WSDL page for this API. An example URL for the page is: https://hostname/soap/capacity/?wsdl. This page provides important information regarding the request and response envelopes used by the API.

The request envelope is comprised of the following sections. Note: To complete the envelope creation, the sections are concatenated together to provide a single request envelope. An example of a complete request enveloped may be viewed here.

Opening

The Opening section is static text as shown below:

<soapenv:Envelope xmlns:soapenv=”http://schemas.xmlsoap.org/soap/envelope/” xmlns:urn=”urn:toa:capacity”>
<soapenv:Header/>
<soapenv:Body>
<urn:get_quota_data>

User

The User section is dynamic and comprises the following components:

Now

The now component is the current time in the UTC time zone. An example is: <now>2016-12-19T09:13:10+00:00</now>. It is populated by the following command:

SELECT TO_CHAR (SYSTIMESTAMP AT TIME ZONE ‘UTC’, ‘YYYY-MM-DD”T”HH24:MI:SS”+00:00″‘ ) INTO V_NOW FROM DUAL;

Login

The login component is the user name.

Company

The company component is the company for which data is being retrieved.

Authorization String

The auth_string component is the MD5 hash of the concatenation of the now component with the MD5 hash of the user password. In pseudo-code it would be md5 (now + md5 (password)). It is populated by the following command:

SELECT
LOWER (
DBMS_CRYPTO.HASH (
V_NOW||
LOWER( DBMS_CRYPTO.HASH (V_PASSWORD,2) )
,2
)
)
INTO V_AUTH_STRING FROM DUAL;

Note: ‘2’ is the code for MD5.

An example is:

<auth_string>b477d40346ab40f1a1a038843d88e661fa293bec5cc63359895ab4923051002a,/auth_string>

Required Parameters

There are two required parameters: date and resource_id. Each may have multiple entries. However the sample procedure in this post allows only one resource id. It also uses just one date to start with and then issues the request multiple times for the number of consecutive dates requested.

In this post, the starting date is the current date in Sydney, Australia. An example is below:

<date>2016-12-21</date> <resource_id>Test_Resource_ID</resource_id>

The starting date and subsequent dates are populated by this command:

CASE WHEN P_DATE IS NULL
THEN SELECT TO_CHAR (SYSTIMESTAMP AT TIME ZONE ‘Australia/Sydney’, ‘YYYY-MM-DD’) INTO P_DATE FROM DUAL;
ELSE P_DATE:= TO_CHAR (TO_DATE (P_DATE, ‘YYYY-MM-DD’) + 1,’YYYY-MM-DD’); — Increments the day by 1
END CASE;

Aggregation

The aggregation component specifies whether to aggregate the results. Since BI will do this automatically, aggregation and totals are set to 0 (no). An example is:

<aggregate_results>0</aggregate_results> <calculate_totals>0</calculate_totals>

Field Requests

This section may be passed as a parameter and it lists the various data fields to be included in the extract. An example is below:

<day_quota_field>max_available</day_quota_field>
<time_slot_quota_field>max_available</time_slot_quota_field>
<time_slot_quota_field>quota</time_slot_quota_field>
<category_quota_field>used</category_quota_field>
<category_quota_field>used_quota_percent</category_quota_field>
<work_zone_quota_field>status</work_zone_quota_field>

Closing

The Closing section is static text as shown below:

</urn:get_quota_data>
</soapenv:Body>
</soapenv:Envelope>

Calling the SOAP Request

The APEX_WEB_SERVICE package is used to populate a request header and issue the SOAP request. The header requests that the web service return the contents in a non-compressed text format as shown below:

APEX_WEB_SERVICE.G_REQUEST_HEADERS(1).NAME := ‘Accept-Encoding’;
APEX_WEB_SERVICE.G_REQUEST_HEADERS(1).VALUE := ‘identity’;

For each date to be processed the SOAP request envelope is created and issued as shown below:

F_XML := APEX_WEB_SERVICE.MAKE_REQUEST(
P_URL => F_SOAP_URL
,P_ENVELOPE => F_REQUEST_ENVELOPE
,P_WALLET_PATH => ‘file:wallet location’
,P_WALLET_PWD => ‘wallet password‘ );

Troubleshooting the SOAP Request Call

Common issues are the need for a proxy, the need for a trusted certificate (if using HTTPS), and the need to use the TLS security protocol. Note: This post uses DBCS so the second and third issues have been addressed.

The need for a proxy may be detected when the following error occurs: ORA-12535: TNS:operation timed out. Adding the optional p_proxy_override parameter to the call may correct the issue. An example proxy override is:

www-proxy.us.oracle.com

Parsing the SOAP Response

For each date to be processed the SOAP response envelope is parsed to obtain the individual rows and columns.

The hierarchy levels of the capacity API are listed below:

Bucket > Day > Time Slot > Category > Work Zone

Each occurrence of every hierarchical level is parsed to determine attribute names and values. Both the name and the value are then used to populate a column in the staging table.

When a hierarchical level is completed and no occurrences of a lower level exist, a row is inserted into the BICS staging table.

Below is an example XML response element for one bucket.

<bucket>
<bucket_id>TEST Bucket ID</bucket_id>
<name>TEST Bucket Name</name>
<day>
<date>2016-12-21</date>
<time_slot>
<label>7-10</label>
<quota_percent>100</quota_percent>
<quota>2520</quota>
<max_available>2520</max_available>
<used_quota_percent>0</used_quota_percent>
<category>
<label>TEST Category</label>
<quota_percent>100</quota_percent>
<quota>2520</quota>
<max_available>2340</max_available>
<used_quota_percent>0</used_quota_percent>
</category>
</time_slot>
<time_slot>
<label>10-14</label>
<quota_percent>100</quota_percent>
<quota>3600</quota>
<max_available>3600</max_available>
<used_quota_percent>0</used_quota_percent>
<category>
<label>TEST Category</label>
<quota_percent>100</quota_percent>
<quota>3600</quota>
<max_available>3360</max_available>
<used_quota_percent>0</used_quota_percent>
</category>
</time_slot>
<time_slot>
<label>14-17</label>
<quota_percent>100</quota_percent>
<quota>2220</quota>
<max_available>2220</max_available>
<used_quota_percent>0</used_quota_percent>
<category>
<label>TEST Category</label>
<quota_percent>100</quota_percent>
<quota>2220</quota>
<max_available>2040</max_available>
<used_quota_percent>0</used_quota_percent>
</category>
</time_slot>
</day>
</bucket>

The processing of the bucket element is as follows:

Occurrences 1 and 2 of the bucket level are parsed to return attribute names of bucket_id and name. The bucket_id attribute is used as-is and the name attribute is prefixed with “bucket_” to find the corresponding column_ids in the staging view. The corresponding columns in the staging table, C03 and C04, are then populated.

Occurrence 3 of the bucket level returns the day level element tag. Processing then continues at the day level.

Occurrence 1 of the day level returns the attribute name of date. The attribute name is prefixed with “day_” to find the corresponding column_id in the staging view. The corresponding column in the staging table, C05, is then populated with the value ‘2016-12-21’.

Occurrence 2 of the day level returns the first of three time_slot level element tags. Processing for each continues at the time-slot level. Each time_slot element contains 5 attribute occurrences followed by a category level element tag.

Each category level contains 5 attribute occurrences. Note: there is no occurrence of a work_zone level element tag in the category level. Thus after each category level element is processed, a row is written to the staging table.

The end result is that 3 rows are written to the staging table for this bucket. The table below describes the XML to row mapping for the first row.

Attribute Name	Attribute Value	View Column Name	Table Column Name
bucket_id	TEST Bucket ID	BUCKET_ID	C03
name	TEST Bucket Name	BUCKET_NAME	C04
day	2016-12-21	DAY_DATE	C05
label	7-10	TIME_SLOT_LABEL	C18
quota_percent	100	TIME_SLOT_QUOTA_PERCENT	C19
quota	2520	TIME_SLOT_QUOTA	C21
max_available	2520	TIME_SLOT_MAX_AVAILABLE	C26
used_quota_percent	0	TIME_SLOT_USED_QUOTA_PERCENT	C29
label	TEST Category	CAT_LABEL	C32
quota_percent	100	CAT_QUOTA_PERCENT	C33
quota	2520	CAT_QUOTA	C35
max_available	2340	CAT_MAX_AVAILABLE	C42
used_quota_percent	0	CAT_USED_QUOTA_PERCENT	C44

In PL/SQL, the processing is accomplished using the LOOP command. There is a loop for each hierarchical level. Loops end when no results are returned for a parse statement.

XPATH statements are used for parsing. Additional information regarding XPATH statements may be found in the References section. Examples are below:

Statement	Returns
/bucket[5]	The entire fifth bucket element in the response. If no results then all buckets have been processed.
/bucket/*[1]	The first bucket attribute or element name.
/bucket/*[2]/text()	The second bucket attribute value.
/bucket/day/*[6]	The sixth day attribute or element name.
/bucket/day[1]/*[6]/text()	The sixth day attribute value.
/bucket/day/time_slot[2]/*[4]	The fourth attribute or element name of the second time_slot.

Scheduling the Procedure

The procedure may be scheduled to run periodically through the use of an Oracle Scheduler job. A link to the Scheduler documentation may be found in the References section.

A job is created using the CREATE_JOB procedure by specifying a job name, type, action and a schedule. Setting the enabled argument to TRUE enables the job to automatically run according to its schedule as soon as you create it.

An example of a SQL statement to create a job is below:

BEGIN
DBMS_SCHEDULER.CREATE_JOB (
JOB_NAME        => ‘OFSC_SOAP_QUOTA_EXTRACT’,
JOB_TYPE        => ‘STORED_PROCEDURE’,
ENABLED          => TRUE,
JOB_ACTION      => ‘BICS_OFSC_SOAP_INTEGRATION’,
START_DATE      => ’21-DEC-16 10.00.00 PM Australia/Sydney’,
REPEAT_INTERVAL => ‘FREQ=HOURLY; INTERVAL=24’ — this will run the job every 24 hours
);
END;
/

Note: If using the BICS Schema Service database, the package name is CLOUD_SCHEDULER rather than DBMS_SCHEDULER.

The job log and status may be queried using the *_SCHEDULER_JOBS views. Examples are below:

SELECT JOB_NAME, STATE, NEXT_RUN_DATE from USER_SCHEDULER_JOBS;
SELECT LOG_DATE, JOB_NAME, STATUS from USER_SCHEDULER_JOB_LOG;

Summary

This post detailed a method of extracting and loading data from Oracle Field Service Cloud (OFSC) into the Oracle Business Intelligence Cloud Service (BICS).

The post used SOAP web services to extract the XML-formatted data responses. It used a PL/SQL Stored Procedure to wrap the SOAP extract, XML parsing commands, and database table operations. It loaded a BICS staging table and a staging view which can be transformed into star-schema object(s) for use in modeling.

Finally, an example of a database job was provided that executes the Stored Procedure on a scheduled basis.

For more BICS and BI best practices, tips, tricks, and guidance that the A-Team members gain from real-world experiences working with customers and partners, visit Oracle A-Team Chronicles for BICS.

References

Text of Complete Procedure

OFSC Capacity API Document

OFSC Capacity API WSDL

Scheduling Jobs with Oracle Scheduler

Reference Guide for the APEX_WEB_SERVICE

Oracle Business Intelligence Cloud Service Tasks

All content listed on this page is the property of Oracle Corp. Redistribution not allowed without written permission

Introduction

This post details a method of extracting and loading data from Oracle Identity Cloud Service (IDCS) into the Oracle Business Intelligence Cloud Service (BICS). It builds upon the A-team post IDCS Audit Event REST API which details the REST API calls used.

One use case for this method is for analyzing trends regarding audit events.

This post uses REST web services to extract JSON-formatted data responses. It also uses the PL/SQL language to wrap the REST extract, JSON parsing commands, and database table operations in a Stored Procedure. It produces a BICS staging table which can then be transformed into star-schema object(s) for use in modeling. The transformation processes and modeling are not discussed in this post.

Finally, an example of a database job is provided that executes the Stored Procedure on a scheduled basis.

Rationale for Using PL/SQL

PL/SQL is the only procedural tool that runs on the BICS / Database Schema Service platform. Other wrapping methods e.g. Java, ETL tools, etc. require a platform outside of BICS to run on.

PL/SQL may also be used in a DBaaS (Database as a Service) that is connected to BICS.

PL/SQL can utilize native SQL commands to operate on the BICS tables. Other methods require the use of the BICS REST API.

Note: PL/SQL is a very good at showcasing functionality. However, it tends to become prohibitively resource intensive when deploying in an enterprise production environment. For the best enterprise deployment, an ETL tool such as Oracle Data Integrator (ODI) should be used to meet these requirements and more:

* Security

* Logging and Error Handling

* Parallel Processing – Performance

* Scheduling

* Code Re-usability and Maintenance

Using Oracle Database as a Service

Determining Security Protocol Requirements

If the web service requires a security protocol, key exchange or cypher not supported by the default BICS Schema Database Service, another Oracle Database Cloud Service (DBaaS) may be used.

Note: For the most consistent response, specify a database version of 11.2.0.4.10 or greater, or any version of 12c. If the database is not at the required version, PL/SQL may throw the following error: ORA-29259: end-of-input reached

To detect what protocol a web service uses, open the IDCS Login page in a browser, click the lock icon, and navigate to the relevant security section. A Chrome example from an IDCS Login page is below:

Preparing the DBaaS

If DBaaS is used, the following steps need to be performed.

Creating the BICS User

Create a BICS user in the database. The use of the Job privilege is discussed later in the post. Example SQL statements are below:

— USER SQL
CREATE USER “BICS_USER” IDENTIFIED BY password
DEFAULT TABLESPACE “USERS”
TEMPORARY TABLESPACE “TEMP”
ACCOUNT UNLOCK;
— QUOTAS
ALTER USER “BICS_USER” QUOTA UNLIMITED ON USERS;
— ROLES
ALTER USER “BICS_USER” DEFAULT ROLE “CONNECT”,”RESOURCE”;
— SYSTEM PRIVILEGES
GRANT CREATE VIEW TO “BICS_USER”;
GRANT CREATE ANY JOB TO “BICS_USER”;

Managing Trusted Certificates

Create an entry in a new or existing Oracle database wallet for the trusted public certificate used to secure connections to the web service via the Internet. A link to the Oracle Wallet Manager documentation is included in the References section. Note the location and password of the wallet as they are used to issue the REST request.

The need for a trusted certificate is detected when the following error occurs: ORA-29024: Certificate validation failure.

An example certificate path found using Chrome browser is shown below. Both of these trusted certificates need to be in the Oracle wallet.

Granting Network Access

This post uses the UTL_HTTP package which requires the user to have permission to access web services via an Access Control List (ACL).

The need for an ACL privilege is detected when the following error occurs: ORA-24247: network access denied by access control list (ACL).

Grant the BICS_USER authority to connect to the network access control list (ACL). To determine your unique network ACL name run the following:

SELECT * FROM DBA_NETWORK_ACLS;

Using the network name from above run the following:

BEGIN
DBMS_NETWORK_ACL_ADMIN.ADD_PRIVILEGE(acl   => ‘NETWORK_ACL_YourUniqueSuffix’
principal   => ‘BICS_USER’,
is_grant    => true,
privilege   => ‘connect’);
END;
/

Preparing the Database Schema

A staging table needs to be created prior to compiling the PL/SQL stored procedure.

This post uses a staging table named AUDIT_EVENT. The columns are those chosen from the REST API for Oracle Identity Cloud Service. A link to the document may be found in the References section. This post uses the following columns:

ACTOR_DISPLAY_NAME
ACTOR_ID
ACTOR_NAME
ACTOR_TYPE
ADMIN_REF_RESOURCE_NAME
ADMIN_RESOURCE_NAME
EC_ID
EVENT_ID
ID
MESSAGE
SSO_COMMENTS
SSO_PROTECTED_RESOURCE
SSO_USER_AGENT
TIMESTAMP

The SQL used to create this table may be viewed here.

Using API Testing Tools

The REST requests should be developed in API testing tools such as SoapUI and Postman. The JSON expressions for parsing should be developed and tested in a JSON expression testing tool such as CuriousConcept. Links to these tools are provided in the References section.

Note: API testing tools such as SoapUI, CuriousConcept, Postman, and so on are third-party tools for using SOAP and REST services. Oracle does not provide support for these tools or recommend a particular tool for its APIs. You can select the tool based on your requirements. As a starting point and for some examples refer to the A-Team post IDCS OAuth 2.0 and REST API.

Preparing and Calling the IDCS REST Service

This post uses the AuditEvents and Token methods of the IDCS REST API

Preparing the Token Request

IDCS uses the OAuth 2.0 framework for authorization. This requires an access token to be requested and provided via the Token method of the API.

Before preparing the REST request, a Web Application needs to be created in IDCS. This administrative function is not covered in this post. You will need the Client ID and the Client Secret generated with the web application.

You must encode the Client ID and Client Secret when you include it in a request for an access token. A Base64 encoding tool such as Base64 may be used to perform this step. Place the Client ID and Client Secret on the same line and insert a colon between them: clientid:clientsecret and then encode the string. An example encoded result is

Y2xpZW50aWQ6Y2xpZW50c2VjcmV0

You will need the wallet path and password discussed in the Preparing the DBaaS section above. An example path from a linux server is:

/u01/app/oracle

You will need the URL for the Token method of the URL such as:

https://idcs-hostname/oauth2/v1/token

The APEX_WEB_SERVICE package is used to set the headers and parameters described below.

Two HTTP request headers are needed. The first is a Content-Type header and the second is an Authorization header. The authorization header value is the concatenation of the string ‘Basic ‘ with the Base64 encoded result of the Client ID and the Client Secret as shown below:

v_authorization_token := ‘Y2xpZW50aWQ6Y2xpZW50c2VjcmV0’;
apex_web_service.g_request_headers (1).name := ‘Content-Type’;
apex_web_service.g_request_headers(1).value := ‘application/x-www-form-urlencoded; charset=UTF-8’;
apex_web_service.g_request_headers(2).name := ‘Authorization’;
apex_web_service.g_request_headers(2).value := ‘Basic ‘||v_authorization_token ;

The parameter method is set to POST and two HTTP request parameters are needed. The first is a grant_type and the second is a scope as shown below:

p_http_method => ‘POST’,
p_parm_name => apex_util.string_to_table(‘grant_type:scope’),
p_parm_value => apex_util.string_to_table(‘client_credentials~urn:opc:idm:__myscopes__’,’~’)

Note: The urn:opc:idm:__myscopes__ in the scope parameter value is used as a tag by Oracle Identity Cloud Service clients requesting access tokens from the OAuth authorization server. Access tokens are returned that contain all applicable Oracle Identity Cloud Service scopes based on the privileges represented by the Oracle Identity Cloud Service administrator roles granted to the requesting client.

Calling the Token Request

The APEX_WEB_SERVICE package is used to call the request and store the result in a CLOB variable as shown below:

l_ws_response_clob := apex_web_service.make_rest_request (
p_url => l_ws_url,
p_http_method => ‘POST’,
p_parm_name => apex_util.string_to_table(‘grant_type:scope’),
p_parm_value => apex_util.string_to_table (‘client_credentials~urn:opc:idm:__myscopes__’,’~’)
,p_wallet_path => ‘file:/u01/app/oracle‘
,p_wallet_pwd => ‘password‘
);

The result of the call is shown below with a partial token. The token is actually over 2,000 characters long.

{“access_token”:”eyJ4NXQjUzI1NiI6Ijg1a3E1M… “, “token_type”:”Bearer”,”expires_in”:3600}

Note: The response includes the expires_in:3600 parameter. This means that your token is no longer valid after one hour from the time that you generate it.

Parsing the Token Response

The APEX_JSON package is used to parse the token response and store the result in a VARCHAR variable as shown below. Additional information about this package is included as a link in the References section.

apex_json.parse(l_ws_response_clob);
f_idcs_token := apex_json.get_varchar2(p_path => ‘access_token’);

The result of the parse is just the token itself which is used to prepare the Audit Events request.

Preparing the Audit Events Request

The Audit Events request is prepared two or more times. Once to get a first response containing one event that has a field holding the total number of events. Then subsequent requests are made to retrieve all of the events.

IDCS has a limit of how many events are returned for each request. This post uses 500 as a chunk size value which may be modified. Check with the web services administrator for the maximum number of events per request. Also ensure that the number of events inserted into the BICS table equals the total number found in the initial response.

The number of subsequent requests needed is calculated as the total number of events divided by the chunk size, rounded up to the nearest integer. For example 614 events divided by 500 would result in two subsequent requests needed.

The UTL_HTTP package is used instead of the APEX_WEB_SERVICE package to avoid a limitation of 1,024 characters on the length of a header value. The access token is used in a header value and is over 2,000 characters. The error received with the APEX_WEB_SERVICE call is: ORA-06502: PL/SQL: numeric or value error: character string buffer too small.

Preparing All Requests

All requests need to have the following:

The wallet path and password specified. These are specified globally as shown below:

utl_http.set_wallet(‘file:/u01/app/oracle’, ‘password‘); — For Trusted Certificates

Persistent connection support enabled as shown below:

utl_http.set_persistent_conn_support(FALSE, 1); — Set default persistent connections (1)

Begin the request as shown below:

req := utl_http.begin_request(l_ws_url, ‘get’,’http/1.1′);

Note: The result is stored in a variable named req which is of the req type defined in the UTL_HTTP package as shown below:

— A PL/SQL record type that represents a HTTP request
TYPE req IS RECORD (
url VARCHAR2(32767 byte), — Requested URL
method VARCHAR2(64), — Requested method
http_version VARCHAR2(64), — Requested HTTP version
private_hndl PLS_INTEGER — For internal use only
);

The following three HTTP headers set are shown below:

utl_http.set_header(REQ, ‘Content-Type’, ‘application/scim+json’);
utl_http.set_header(REQ, ‘Cache-Control’, ‘no-cache’);
utl_http.set_header(REQ, ‘Authorization’, ‘Bearer ‘ || l_idcs_token); — The received access token

All but the last need persistent connection support as shown below:

utl_http.set_persistent_conn_support(req, TRUE); — Keep Connection Open

Note: The last request does not have the above setting so will default to FALSE and the connection to the service will be closed.

Preparing Individual Requests

Individual requests need to have the following:

The URL set as shown below:

l_ws_url := https://idcs-hostname/admin/v1/AuditEvents?count=1′; — Get first event for total event count

Subsequent URLs are as shown below:

l_ws_url := https://idcs-hostname /admin/v1/AuditEvents?count=500&startIndex=1&sortBy=timestamp;

Note: subsequent requests need the startindex parameter incremented by the chunk size (500).

Calling the Audit Events Request

The Audit Events requests are called using the UTL_HTTP package as shown below:

resp := utl_http.get_response(req);

Note: The result is stored in a variable named resp which is of the resp type defined in the UTL_HTTP package as shown below:

— A PL/SQL record type that represents a HTTP response
TYPE resp IS RECORD (
status_code PLS_INTEGER, — Response status code
reason_phrase VARCHAR2(256), — Response reason phrase
http_version VARCHAR2(64), — Response HTTP version
private_hndl PLS_INTEGER — For internal use only
);

Troubleshooting the REST Request Calls

Common issues are the need for a proxy, the need for an ACL, the need for a trusted certificate (if using HTTPS), and the need to use the correct TLS security protocol. Note: This post uses DBaaS so all but the first issue has been addressed.

www-proxy.us.oracle.com

Parsing the Audit Event Responses

The APEX_JSON package is used to parse the responses.

Before parsing begins the staging table is truncated as shown below:

execute immediate ‘truncate table audit_event’;

An example of a response containing just one event is below:

{“schemas”:[“urn:scim:api:messages:2.0:ListResponse”]
,”totalResults”:614
,”Resources”:[
{“eventId”:”sso.authentication.failure”
,”ssoProtectedResource”:”https://idcs-hostname:443/ui/v1/myconsole”
,”actorName”:”user.name@oracle.com”
,”ssoIdentityProvider”:”localIDP”
,”ssoCSR”:”false”
,”ssoUserPostalCode”:”null”
,”ssoUserCity”:”null”
,”reasonValue”:”SSO-1018″
,”ssoUserCountry”:”null”
,”rId”:”0:1:3:2:4″
,”message”:”Authentication failure User not found.”
,”timestamp”:”2016-10-04T09:38:46.336Z”
,”ssoComments”:”Authentication failure User not found.”
,”ssoApplicationHostId”:”idcs-hostname”
,”ssoUserState”:”null”
,”ecId”:”q^Unq0s8000000000″
,”ssoRp”:”IDCS”
,”ssoLocalIp”:”10.196.29.102″
,”serviceName”:”SSO”
,”ssoAuthnLevel”:0
,”actorType”:”User”
,”ssoSessionId”:”null”
,”ssoUserAgent”:”Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/53.0.2785.143 Safari/537.36″
,”actorId”:”IDCS”
,”id”:”0a37c7374c494ed080d15c554ae75be8″
,”meta”: {“created”:”2016-10-04T09:38:46.353Z”
,”lastModified”:”2016-10-04T09:38:46.353Z”
,”resourceType”:”AuditEvent”
,”location”:”https://idcs-hostname/admin/v1/AuditEvents/0a37c7374c494ed080d15c554ae75be8″}
,”schemas”:[“urn:ietf:params:scim:schemas:oracle:idcs:AuditEvent”]
,”idcsCreatedBy”: {“value”:”UnAuthenticated”
,”$ref”:”https://idcs-hostname/admin/v1/AuditEvents/UnAuthenticated”}
,”idcsLastModifiedBy”: {“value”:”UnAuthenticated”
,”$ref”:”https://idcs-hostname/admin/v1/AuditEvents/UnAuthenticated”}
}],”startIndex”:1,”itemsPerPage”:1}

Parsing the First Response

The first JSON response of one event is read into a varchar variable as shown below:

utl_http.read_text(resp, l_ws_response_varchar, 32766);

The variable is then parsed as shown below:

apex_json.parse(l_ws_response_varchar);

Note: the above result is implicitly stored in a global package array named g_values. This array contains the JSON members and values.

The value of the JSON member named totalResults is retrieved and stored in a variable as shown below:

v_resultSet := apex_json.get_varchar2(p_path => ‘totalResults’);

This is the total number of events to be retrieved and is all that is wanted from the first response.

Parsing the Subsequent Responses

Subsequent Responses may contain a number of events up to the setting of the chunk size (500 in this post). These responses will need to be stored in a temporary CLOB variable.

The DBMS_LOB package is used to manage the temporary CLOB variable. Additional information about the package may be found in the References section.

This variable is created at the beginning of the parsing and freed at the end of the procedure as shown below:

dbms_lob.createtemporary(l_ws_response_clob, true);
dbms_lob.freetemporary(l_ws_response_clob);

This variable is also trimmed to zero characters at the beginning of each chunk of events using the following:

DBMS_LOB.TRIM (l_ws_response_clob, 0);

The response is read by a LOOP command. Each iteration of the loop reads 32,766 characters of text and appends these to the temporary CLOB variable as shown below:

while not(EOB)
LOOP
BEGIN
utl_http.read_text(resp, l_ws_response_varchar, 32766);
if l_ws_response_varchar is not null and length(l_ws_response_varchar)>0 then
dbms_lob.writeappend(l_ws_response_clob, length(l_ws_response_varchar), l_ws_response_varchar);
end if;
EXCEPTION
WHEN utl_http.end_of_body THEN
EOB := TRUE;
utl_http.end_response(resp);
END;
END LOOP;

The CLOB result is then parsed into the implicit package array of JSON elements and values as shown below. This array contains a number of events equal to or less than the chunk size setting (500).

apex_json.parse(l_ws_response_clob);

Each event in the array is retrieved, has its columns parsed, and is inserted into the BICS staging table as shown below:

for i in 1..v_chunkSize LOOP
v_loadCount := v_loadCount + 1;
IF v_loadCount > v_resultSet THEN NULL;
ELSE
INSERT
INTO AUDIT_EVENT
(
EVENT_ID,
ID,
ACTOR_ID,
ADMIN_REF_RESOURCE_NAME,
ACTOR_NAME,
ACTOR_DISPLAY_NAME,
MESSAGE,
SSO_COMMENTS,
SSO_PROTECTED_RESOURCE,
SSO_USER_AGENT,
TIMESTAMP,
ACTOR_TYPE,
ADMIN_RESOURCE_NAME,
EC_ID
)
VALUES
(
apex_json.get_varchar2(p_path => ‘Resources[‘ || i || ‘].eventId’)
,apex_json.get_varchar2(p_path => ‘Resources[‘ || i || ‘].id’)
,apex_json.get_varchar2(p_path => ‘Resources[‘ || i || ‘].actorId’)
,apex_json.get_varchar2(p_path => ‘Resources[‘ || i || ‘].adminRefResourceName’)
,apex_json.get_varchar2(p_path => ‘Resources[‘ || i || ‘].actorName’)
,apex_json.get_varchar2(p_path => ‘Resources[‘ || i || ‘].actorDisplayName’)
,apex_json.get_varchar2(p_path => ‘Resources[‘ || i || ‘].message’)
,apex_json.get_varchar2(p_path => ‘Resources[‘ || i || ‘].ssoComments’)
,apex_json.get_varchar2(p_path => ‘Resources[‘ || i || ‘].ssoProtectedResource’)
,apex_json.get_varchar2(p_path => ‘Resources[‘ || i || ‘].ssoUserAgent’)
,apex_json.get_varchar2(p_path => ‘Resources[‘ || i || ‘].timestamp’)
,apex_json.get_varchar2(p_path => ‘Resources[‘ || i || ‘].actorType’)
,apex_json.get_varchar2(p_path => ‘Resources[‘ || i || ‘].adminResourceName’)
,apex_json.get_varchar2(p_path => ‘Resources[‘ || i || ‘].ecId’)
);
v_row_count := v_row_count + 1;
END IF;
END LOOP;

After the last chunk of events is processed the procedure terminates.

Scheduling the Procedure

The procedure may be scheduled to run periodically through the use of an Oracle Scheduler job. A link to the Scheduler documentation may be found in the References section.

A job is created using the DBMS_SCHEDULER.CREATE_JOB procedure by specifying a job name, type, action and a schedule. Setting the enabled argument to TRUE enables the job to automatically run according to its schedule as soon as you create it.

An example of a SQL statement to create a job is below:

BEGIN
dbms_scheduler.create_job (
job_name => ‘IDCS_REST_AUDIT_EXTRACT’,
job_type => ‘STORED_PROCEDURE’,
enabled => TRUE,
job_action => ‘BICS_IDCS_REST_INTEGRATION’,
start_date => ’21-DEC-16 10.00.00 PM Australia/Sydney’,
repeat_interval => ‘freq=hourly;interval=24’ — this will run once every 24 hours
);
END;
/

Note: If using the BICS Schema Service database, the package name is CLOUD_SCHEDULER rather than DBMS_SCHEDULER.

The job log and status may be queried using the *_SCHEDULER_JOBS views. Examples are below:

SELECT JOB_NAME, STATE, NEXT_RUN_DATE from USER_SCHEDULER_JOBS;
SELECT LOG_DATE, JOB_NAME, STATUS from USER_SCHEDULER_JOB_LOG;

Summary

This post detailed a method of extracting and loading data from Oracle Identity Cloud Service (IDCS) into the Oracle Business Intelligence Cloud Service (BICS).

The post used REST web services to extract the JSON-formatted data responses. It used a PL/SQL Stored Procedure to wrap the REST extract, JSON parsing commands, and database table operations. It loaded a BICS staging table which can be transformed into star-schema object(s) for use in modeling.

Finally, an example of a database job was provided that executes the Stored Procedure on a scheduled basis.

For more BICS and BI best practices, tips, tricks, and guidance that the A-Team members gain from real-world experiences working with customers and partners, visit Oracle A-Team Chronicles for BICS.

References

Complete Procedure

REST API for Oracle Identity Cloud Service

Scheduling Jobs with Oracle Scheduler

APEX_WEB_SERVICE Reference Guide

APEX_JSON Reference Guide

UTL_HTTP Package Reference Guide

Curious Concept JOSN Testing Tool

Oracle Business Intelligence Cloud Service Tasks

DBMS_LOB Reference Guide

All content listed on this page is the property of Oracle Corp. Redistribution not allowed without written permission

Introduction

This article discusses best practices on using the Reverse Engineering features of Oracle Data Integrator (ODI) on the cloud and on premises. The first part of this article presents the various options available in ODI to reverse-engineer metadata from a data server. Then, the article discusses performance considerations when running and executing reverse-engineering tasks. The last section of this article discusses the ODI reverse-engineering best practices.

Oracle Data Integrator Best Practices: Using Reverse-Engineering on the Cloud and on Premises

In ODI, reverse-engineering is the process of selecting metadata from a data server and populating the selected metadata into an ODI model. An ODI model contains objects or datastores such as tables, views, queues, and synonyms. ODI models also contain attributes, keys, and constraints for each datastore. An ODI model is connected to an ODI logical schema of a given ODI technology.

Figure 1, below, shows an example of an ODI model called Staging. This ODI model is connected to an ODI logical schema called Oracle Warehouse – Staging. The ODI technology for this logical schema is Oracle. The ODI reverse-engineering options are located on the menu options of the ODI Model screen, as illustrated on Figure 1, below.

Figure 1 – ODI Reverse-Engineering – ODI Model

There are two reverse-engineering options in ODI: Reverse Engineer, and Selective Reverse-Engineering. The following section of this article discusses these two options.

Using the ODI Reverse Engineer Option

Figure 2, below, shows the ODI Reverse Engineer option for the ODI model called Staging. This option offers two ways of performing a reverse-engineering task: Standard, and Customized. Figure 2, below, shows the Standard option. The Standard option is the default option; it provides basic reverse-engineering capabilities – users can retrieve a minimum set of attributes with this option.

Figure 2 – ODI Reverse-Engineering – Standard Option

Tip:

When using the Standard option, the reverse-engineering task can only be executed with a local agent – the default agent of the ODI Studio.

The Standard option can filter the selection of metadata by object type. In this example, on Figure 2, above, the selected object type is Table; thus, only Oracle tables will be reverse-engineered. The Mask option provides additional filtering capabilities. In this example, the reverse-engineer task only brings the Oracle tables starting with a name of STG, followed by any additional characters – the percent wildcard (%) specifies any characters.

In this example, on Figure 2, above, the Standard option retrieves the name of the Oracle tables, the table attributes, and the table constraints. The table attributes include the column names, the column types, and the column lengths. The table constraints include primary keys, unique keys, foreign keys, and check constraints.

Figure 3, below, show a list of datastores for this ODI model: STG_CUSTOMER, STG_ORDERS, STG_PRODUCT, and STG_STATUS. These datastores have been reverse-engineered with the Standard option. The attributes of the datastore called STG_CUSTOMER are also illustrated on Figure 3, below:

Figure 3 – ODI Reverse-Engineering – ODI Data Stores

The Standard option uses the Java Database Connectivity (JDBC) API to retrieve metadata from a data server. The JDBC API is the industry standard for database-independent connectivity between Java applications and a wide range of databases – the ODI Studio is a Java application. The Standard option has an extensive number of features, but it can only retrieve a limited set of metadata due to the limitations of the JDBC API driver. For instance, if an Oracle table is partitioned, the Standard option cannot reverse-engineer the partitions of a table because the JDBC API driver does not support the selection of table partitions. On the other hand, the Customized option provides additional features, and it can retrieve additional metadata such as table partitions from a data server. The Customized option requires a Reverse-Engineering Knowledge Module (RKM), which can be customized to perform additional tasks. When using the Customized option, the reverse-engineering task can be executed with the local agent of the ODI Studio (default), or with any agent configured in the ODI Topology. Figure 4, below, shows the Customized option:

Figure 4 – ODI Reverse-Engineering – Customized Option

In this example, on Figure 4, above, the Customized option uses a logical agent called OracleDIAgent-JCS. This ODI agent is a J2EE agent, located on an instance of the Oracle Java Cloud Service (JCS). In this example, the type of object to reverse-engineer is Table, and the Mask option has been set to STG% – all tables starting with a prefix of STG will be reverse-engineered. The RKM for this reverse-engineering task is the RKM Oracle. Also, the options for this RKM are illustrated on Figure 4, above.

Figure 5, below, shows a list of tasks for this RKM. Some of the RKM tasks include retrieving partitions, foreign keys (FK), index keys, table conditions, and other metadata from the Oracle database.

Figure 5 – ODI Reverse-Engineering – RKM Oracle Tasks

When the Customized option is used for a reverse-engineer task, the ODI agent executes the code generated by the RKM, and the ODI model gets populated with metadata from the data server. Figure 6, below, shows the Partitions screen of an ODI datastore called W_ORDERS_F. In this example, the partitions for this datastore – an Oracle partitioned table – have been populated using the RKM Oracle.

Figure 6 – ODI Reverse-Engineering – Datastore Partitions

Tip:

The RKM tasks and options depend on the available features of a given technology. For additional information on RKMs, go to “Introduction to Oracle Data Integrator Knowledge Modules.”

Using the ODI Selective Reverse-Engineering Option

The Selective Reverse-Engineering option, illustrated on Figure 7, below, offers additional capabilities such as reverse-engineering new datastores, and existing datastores. This option works in conjunction with the Standard option, and it is only available if the Standard option is selected in the Reverse Engineer tab. This option allows users to select from a list of objects before executing the reverse-engineering task.

Figure 7 – ODI Reverse-Engineering – Selective Reverse-Engineering

Figure 7, above, shows a list of objects to be reverse-engineered: STG_CUSTOMER, STG_ORDERS, STG_PRODUCT, and STG_STATUS. These objects are Oracle tables that the Selective Reverse-Engineering option found when the Objects to Reverse Engineer check-box was selected. The objects listed on Figure 7, above, are the result from the filters put in place in the Reverse Engineer tab.

If these objects already exist in the ODI model, and the Reverse Engineer Execution button is clicked, the metadata for the existing objects will be updated. If the objects are new, they will be added to the ODI model.

ODI Reverse-Engineering Considerations

The execution time of a reverse-engineering task depends on several factors. For instance, a large number of tables and columns may take longer to reverse-engineer than a small set of tables or columns. Also, the location of the ODI Studio and the type of agent used for the reverse-engineering task may also have an impact on the overall execution time. For instance, let’s assume that an ODI user wants to reverse-engineer a set of Oracle tables located on an instance of DBCS. Also, let’s assume that the ODI repository is located on another instance of DBCS. Let’s assume that the ODI user selects the Standard option to execute a reverse-engineer task, and the task is executed from the premises of the ODI user. Under this scenario, executing the reverse-engineer task from the premises of the ODI user is not a recommended strategy. Figure 8, below, shows an example of this unfavorable practice:

Figure 8 – ODI Reverse-Engineering – On-Premise ODI Studio

In this example, on Figure 8, above, the selected metadata must be exported from Instance A – where the source data server is located – to the promises of the ODI user – where the ODI Studio is located. Then, the local agent must upload the selected metadata from the ODI Studio into Instance B – where the ODI repository is located. This strategy does not offer the best performance, since the content of the selected metadata must travel from the cloud to the on premises of the user, and then back to the cloud.

The best strategy is to execute the reverse-engineering task from an instance of the ODI Studio that is running on the Oracle Cloud such as the Oracle Java Cloud Service (JCS) or the Oracle Compute Cloud Service (CCS). Figure 9, below, shows an example:

Figure 9 – ODI Reverse-Engineering – ODI Studio on JCS

This strategy, shown on Figure 9, above, offers the best performance when performing reverse-engineering tasks between database cloud services. The same strategy can be applied to other SQL databases that are on other cloud services.

ODI Reverse-Engineering Best Practices

When using the reverse-engineering features of ODI, follow these rules of thumb:

Use the reverse-engineering Standard option if a minimum set of attributes – such as table name, table columns, and table constraints – are needed for an ODI model.
Use the reverse-engineering Customized option if the Standard option does not populate the required metadata due to the limitations of the JDBC API driver.
When using the Customized option, select and import the RKM that supports the technology of the ODI model. Get familiar with the options and steps of the selected RKM before using it. If a RKM needs to be modified for additional tasks, rename the RKM and document the changes.
When performing the reverse-engineering task, use the Mask option to filter the number of objects to be reverse-engineered. The Mask option is available on both the Standard option and the Customized option.
When the Standard option is used for a reverse-engineering task, run the ODI Studio in a location that is near both data servers: the source data server and the ODI repository data server. This will reduce the amount of time it may take to populate the ODI model with the desired metadata.
Use the best available ODI agent when performing a reverse-engineering task. The best available ODI agent is the one that is located the closest to both the source data server and the ODI repository data server.
When performing a reverse-engineering task, ensure the connection-user executing the task has the necessary privileges to access the metadata from the source data server. The connection-user is the user configured in the physical data server of the ODI Topology. If the reverse-engineering task completes successfully but does not populate any metadata into the ODI model, the connection-user may not have the necessary privileges to read and access the metadata from the data server.

Conclusion

The ODI reverse-engineering features offer a mechanism to retrieve metadata from a data server and to populate the metadata into an ODI model. This metadata can then be used in ODI mappings to build data integration tasks. The ODI reverse-engineering features offer various options – Standard and Customized – to reverse engineer objects from a data server. The Standard option leverages the JDBC driver to retrieve metadata from a data server. The Customized option leverages RKMs to retrieve and populate additional metadata that cannot be retrieved with the Standard option. These RKMs offer additional features and options to reverse-engineer additional metadata from a data server.

For more Oracle Data Integrator best practices, tips, tricks, and guidance that the A-Team members gain from real-world experiences working with customers and partners, visit “Oracle A-team Chronicles for Oracle Data Integrator (ODI).”

ODI Related Articles

Integrating Oracle Data Integrator (ODI) On-Premise with Cloud Services

Connect ODI to Oracle Database Cloud Service (DBCS)

ODI 12c and DBaaS in the Oracle Public Cloud

Oracle Platform as a Service (PaaS)

Infrastructure as a Service (IaaS)

Oracle Storage Cloud Service (SCS)

Applications as a Service (SaaS)

Oracle Database Cloud Service (DBCS)

Using Oracle Database Schema Cloud Service

Oracle Exadata Cloud Service (ExaCS)

Loading Data into the Oracle Database in an Exadata Cloud Service Instance

All content listed on this page is the property of Oracle Corp. Redistribution not allowed without written permission

Introduction

This post details a method of extracting and loading data from Oracle Field Service Cloud (OFSC) into the Oracle Business Intelligence Cloud Service (BICS) using RESTful services. It is a companion to the A-Team post Loading Data from Oracle Field Service Cloud into Oracle BI Cloud Service using SOAP . Both this post and the SOAP post offer methods to compliment the standard OFSC Daily Extract described in Oracle Field Service Cloud Daily Extract Description.

One case for using this method is analyzing trends regarding OFSC events.

This post uses RESTful web services to extract JSON-formatted data responses. It also uses the PL/SQL language to call the web services, parse the JSON responses, and perform database table operations in a Stored Procedure. It produces a BICS staging table which can then be transformed into star-schema object(s) for use in modeling. The transformation processes and modeling are not discussed in this post.

Finally, an example of a database job is provided that executes the Stored Procedure on a scheduled basis.

Rationale for Using PL/SQL

PL/SQL is the only procedural tool that runs on the BICS / Database Schema Service platform. Other wrapping methods e.g. Java, ETL tools, etc. require a platform outside of BICS to run on.

PL/SQL may also be used in a DBaaS (Database as a Service) that is connected to BICS.

PL/SQL can utilize native SQL commands to operate on the BICS tables. Other methods require the use of the BICS REST API.

Note: PL/SQL is a very good at showcasing functionality. However, it tends to become prohibitively resource intensive when deploying in an enterprise production environment. For the best enterprise deployment, an ETL tool such as Oracle Data Integrator (ODI) should be used to meet these requirements and more:

* Security

* Logging and Error Handling

* Parallel Processing – Performance

* Scheduling

* Code Re-usability and Maintenance

About the OFSC REST API

The document REST API for Oracle Field Service Cloud Service should be used extensively, especially the Authentication, Paginating, and Working with Events sections. Terms described there such as subscription, page, and authorization are used in the remainder of this post.

In order to receive events, a subscription is needed listing the specific events desired. The creation of a subscription returns both a subscription ID and a page number to be used in the REST calls to receive events.

At this time, a page contains 0 to 100 items (events) along with the next page number to use in a subsequent call.

The following is a list of supported events types available from the REST API:

Activity Events
Activity Link Events
Inventory Events
Required Inventory Events
User Events
Resource Events
Resource Preference Events

This post uses the following subset of events from the Activity event type:

activityCreated
activityUpdated
activityStarted
activitySuspended
activityCompleted
activityNotDone
activityCanceled
activityDeleted
activityDelayed
activityReopened
activityPreworkCreated
activityMoved

The process described in this post can be modified slightly for each different event type. Note: the columns returned for each event type differ slightly and require modifications to the staging table and parsing section of the procedure.

Using Oracle Database as a Service

This post uses the new native support for JSON offered by the Oracle 12c database. Additional information about these new features may be found in the document JSON in Oracle Database.

These features provide a solution that overcomes a current limitation in the APEX_JSON package. The maximum length of JSON values in that package is limited to 32K characters. Some of the field values in OFSC events exceed this length.

Preparing the DBaaS Wallet

Create an entry in a new or existing Oracle database wallet for the trusted public certificates used to secure connections to the web service via the Internet. A link to the Oracle Wallet Manager documentation is included in the References section. Note the location and password of the wallet as they are used to issue the REST request.

The need for a trusted certificate is detected when the following error occurs: ORA-29024: Certificate validation failure.

An example certificate path found using Chrome browser is shown below. Both of these trusted certificates need to be in the Oracle wallet.

Creating a BICS User in the Database

The complete SQL used to prepare the DBaaS may be viewed here.

Example SQL statements are below:

CREATE USER “BICS_USER” IDENTIFIED BY password
DEFAULT TABLESPACE “USERS”
TEMPORARY TABLESPACE “TEMP”
ACCOUNT UNLOCK;
— QUOTAS
ALTER USER “BICS_USER” QUOTA UNLIMITED ON USERS;
— ROLES
ALTER USER “BICS_USER” DEFAULT ROLE “CONNECT”,”RESOURCE”;
— SYSTEM PRIVILEGES
GRANT CREATE VIEW TO “BICS_USER”;
GRANT CREATE ANY JOB TO “BICS_USER”;

Creating Database Schema Objects

Three tables need to be created prior to compiling the PL/SQL stored procedure. These tables are:

* A staging table to hold OFSC Event data

* A subscription table to hold subscription information.

* A JSON table to hold the JSON responses from the REST calls

The staging table, named OFSC_EVENT_ACTIVITY, has columns described in the OFSC REST API for the Activity event type. These columns are:

PAGE_NUMBER — for the page number the event was extracted from
ITEM_NUMBER — for the item number within the page of the event
EVENT_TYPE
EVENT_TIME
EVENT_USER
ACTIVITY_ID
RESOURCE_ID
SCHEDULE_DATE
APPT_NUMBER
CUSTOMER_NUMBER
ACTIVITY_CHANGES — To store all of the individual changes made to the activity

The subscription table, named OFSC_SUBSCRIPTION_PAGE, has the following columns:

SUBSCRIPTION_ID — for the supported event types
NEXT_PAGE — for the next page to be extracted in an incremental load
LAST_UPDATE — for the date of the last extract
SUPPORTED_EVENT — for the logical name for the subscription event types
FIRST_PAGE — for the first page to be extracted in a full load

The JSON table, named OFSC_JSON_TMP, has the following columns:

PAGE_NUMBER — for the page number extracted
JSON_CLOB — for the JSON response received for each page

Using API Testing Tools

The REST requests should be developed in API testing tools such as cURL and Postman. The JSON expressions for parsing should be developed and tested in a JSON expression testing tool such as CuriousConcept. Links to these tools are provided in the References section.

Note: API testing tools such as SoapUI, CuriousConcept, Postman, and so on are third-party tools for using SOAP and REST services. Oracle does not provide support for these tools or recommend a particular tool for its APIs. You can select the tool based on your requirements.

Subscribing to Receive Events

Create subscriptions prior to receiving events. A subscription specifies the types of events that you want to receive. Multiple subscriptions are recommended. For use with the method in this post, a subscription should only contain events that have the same response fields.

The OFSC REST API document describes how to subscribe using a cURL command. Postman can also easily be used. Either tool will provide a response as shown below:

{
“subscriptionId”: “a0fd97e62abca26a79173c974d1e9c19f46a254a”,
“nextPage”: “160425-457,0”,
“links”: [ … omitted for brevity ]
}.

Note: The default next page is for events after the subscription is created. Ask the system administrator for a starting page number if a past date is required.

Use SQL*Plus or SQL Developer and insert a row for each subscription into the OFSC_SUBSCRIPTION_PAGE table.

Below is an example insert statement for the subscription above:

INSERT INTO OFSC_SUBSCRIPTION_PAGE
(
SUBSCRIPTION_ID,
NEXT_PAGE,
LAST_UPDATE,
SUPPORTED_EVENT,
FIRST_PAGE
)
VALUES
(
‘a0fd97e62abca26a79173c974d1e9c19f46a254a’,
‘160425-457,0’,
sysdate,
‘Required Inventory’,
‘160425-457,0’
);

Preparing and Calling the OFSC RESTful Service

This post uses the events method of the OFSC REST API.

This method requires the Basic framework for authorization and mandates a base64 encoded value for the following information: user-login “@” instance-id “:” user-password

An example encoded result is:

dXNlci1sb2dpbkBpbnN0YW5jZS1pZDp1c2VyLXBhc3N3b3Jk

The authorization header value is the concatenation of the string ‘Basic’ with the base64 encoded result discussed above. The APEX_WEB_SERVICE package is used to set the header as shown below:

v_authorization_token := ‘ dXNlci1sb2dpbkBpbnN0YW5jZS1pZDp1c2VyLXBhc3N3b3Jk’;
apex_web_service.g_request_headers(1).name := ‘Authorization’;
apex_web_service.g_request_headers(1).value := ‘Basic ‘||v_authorization_token;

The wallet path and password discussed in the Preparing the DBaaS Wallet section are also required. An example path from a Linux server is:

/u01/app/oracle

Calling the Events Request

The events request is called for each page available for each subscription stored in the OFSC_SUBSCRIPTION_PAGE table using a cursor loop as shown below:

For C1_Ofsc_Subscription_Page_Rec In C1_Ofsc_Subscription_Page
Loop
V_Subscription_Id := C1_Ofsc_Subscription_Page_Rec.Subscription_Id;
Case When P_Run_Type = ‘Full’ Then
V_Next_Page := C1_Ofsc_Subscription_Page_Rec.First_Page;
Else
V_Next_Page := C1_Ofsc_Subscription_Page_Rec.Next_Page;
End Case; … End Loop;

The URL is modified for each call. The subscription_id and the starting page are from the table.

For the first call only, if the parameter / variable p_run_type is equal to ‘Full’, the staging table is truncated and the page value is populated from the FIRST_PAGE column in the OFSC_SUBSCRIPTION_PAGE table. Otherwise, the staging table is not truncated and the page value is populated from the NEXT_PAGE column.

Subsequent page values come from parsing the nextPage value in the responses.

An example command to create the URL from the example subscription above is:

f_ws_url := v_base_url||’/events?subscriptionId=’ ||v_subscription_id|| chr(38)||’page=’ ||v_next_page;

The example URL result is:

https://ofsc-hostname/rest/ofscCore/v1/events?subscriptionId=a0fd97e62abca26a79173c974d1e9c19f46a254a&page=160425-457,0

An example call using the URL is below:

f_ws_response_clob := apex_web_service.make_rest_request (
p_url => f_ws_url
,p_http_method => ‘GET’
,p_wallet_path => ‘file:/u01/app/oracle’
,p_wallet_pwd => ‘wallet-password‘ );

Storing the Event Responses

Each response (page) is processed using a while loop as shown below:

While V_More_Pages
Loop
Extract_Page;
End Loop;

Each page is parsed to obtain the event type of the first item. A null (empty) event type signals an empty page and the end of the data available. An example parse to obtain the event type of the first item is below. Note: for usage of the JSON_Value function below see JSON in Oracle Database.

select json_value (f_ws_response_clob, ‘$.items[0].eventType’ ) into f_event_type from dual;

If there is data in the page, the requested page number and the response clob are inserted into the OFSC_JSON_TMP table and the response is parsed to obtain the next page number for the next call as shown below:

f_json_tmp_rec.page_number := v_next_page; — this is the requested page number
f_json_tmp_rec.json_clob := f_ws_response_clob;
insert into ofsc_json_tmp values f_json_tmp_rec;
select json_value (f_ws_response_clob, ‘$.nextPage’ ) into v_next_page from dual;

Parsing and Loading the Events Responses

Each response row stored in the OFSC_JSON_TMP table is retrieved and processed via a cursor loop statement as shown below:

for c1_ofsc_json_tmp_rec in c1_ofsc_json_tmp
loop
process_ofsc_json_page (c1_ofsc_json_tmp_rec.page_number);
end loop;

An example response is below with only the first item shown:

{
“found”: true,
“nextPage”: “170110-13,0”,
“items”: [
{
“eventType”: “activityUpdated”,
“time”: “2017-01-04 12:49:51”,
“user”: “soap”,
“activityDetails”: {
“activityId”: 1297,
“resourceId”: “test-resource-id“,
“resourceInternalId”: 2505,
“date”: “2017-01-25”,
“apptNumber”: “82994469003”,
“customerNumber”: “12797495”
},
“activityChanges”: {
“A_LastMessageStatus”: “SuccessFlag – Fail – General Exception: Failed to update FS WorkOrder details. Reason: no rows updated for: order_id = 82994469003 service_order_id = NULL”
}
}
],
“links”: [
…
]
}

Each item (event) is retrieved and processed via a while loop statement as shown below:

while f_more_items loop
process_item (i);
i := i + 1;
end loop;

For each item, a dynamic SQL statement is prepared and submitted to return the columns needed to insert a row into the OFSC_EVENT_ACTIVITY staging table as shown below (the details of creating the dynamic SQL statement have been omitted for brevity):

An example of a dynamically prepared SQL statement is below. Note: for usage of the JSON_Table function below see JSON in Oracle Database.

The execution of the SQL statement and the insert are shown below:

execute immediate f_sql_stmt into ofsc_event_activity_rec;
insert into ofsc_event_activity values ofsc_event_activity_rec;

Verifying the Loaded Data

Use SQL*Plus, SQL Developer, or a similar tool to display the rows loaded into the staging table.

A sample set of rows is shown below:

Troubleshooting the REST Calls

Common issues are the need for a proxy, the need for an ACL, the need for a trusted certificate (if using HTTPS), and the need to use the correct TLS security protocol. Note: This post uses DBaaS so all but the first issue has been addressed.

www-proxy.us.oracle.com

Scheduling the Procedure

The procedure may be scheduled to run periodically through the use of an Oracle Scheduler job as described in Scheduling Jobs with Oracle Scheduler.

A job is created using the DBMS_SCHEDULER.CREATE_JOB procedure by specifying a job name, type, action and a schedule. Setting the enabled argument to TRUE enables the job to automatically run according to its schedule as soon as you create it.

An example of a SQL statement to create a job is below:

BEGIN
dbms_scheduler.create_job (
job_name => ‘OFSC_REST_EVENT_EXTRACT’,
job_type => ‘STORED_PROCEDURE’,
enabled => TRUE,
job_action => ‘BICS_OFSC_REST_INTEGRATION’,
start_date => ’12-JAN-17 11.00.00 PM Australia/Sydney’,
repeat_interval => ‘freq=hourly;interval=24’ — this will run once every 24 hours
);
END;
/

Note: If using the BICS Schema Service database, the package name is CLOUD_SCHEDULER rather than DBMS_SCHEDULER.

The job log and status may be queried using the *_SCHEDULER_JOBS views. Examples are below:

SELECT JOB_NAME, STATE, NEXT_RUN_DATE from USER_SCHEDULER_JOBS;
SELECT LOG_DATE, JOB_NAME, STATUS from USER_SCHEDULER_JOB_LOG;

Summary

This post detailed a method of extracting and loading data from Oracle Field Service Cloud (OFSC) into the Oracle Business Intelligence Cloud Service (BICS) using RESTful services.

The method extracted JSON-formatted data responses and used the PL/SQL language to call the web services, parse the JSON responses, and perform database table operations in a Stored Procedure. It also produced a BICS staging table which can then be transformed into star-schema object(s) for use in modeling.

Finally, an example of a database job was provided that executes the Stored Procedure on a scheduled basis.

For more BICS and BI best practices, tips, tricks, and guidance that the A-Team members gain from real-world experiences working with customers and partners, visit Oracle A-Team Chronicles for BICS.

References

Complete Procedure

JSON in Oracle Database

REST API for Oracle Field Service Cloud Service

Scheduling Jobs with Oracle Scheduler

APEX_WEB_SERVICE Reference Guide

APEX_JSON Reference Guide

Curious Concept JSON Testing Tool

Postman Testing Tool