start portlet menu bar

HCLSoftware: Fueling the Digital+ Economy

Display portlet menu
end portlet menu bar
Close
Select Page

Let’s start understanding Google BigQueryand then go forward to discovering the GCP BigQuery plug-in and the benefits of using it.

Google BigQuery

Google BigQuery is a serverless data warehousing platform where you can query and process vast amounts of data. The best part about it is, that one can run multiple queries in a matter of seconds, even if the datasets are relatively large in size. It is a Platform-as-a-Service (Paas) that supports querying using ANSI SQL. It also has built-in machine learning capabilities.

Thus, to empower your Workload Automation environment, download the Google BigQuery plug-in available on Automation Hub.

After downloading it, log in to the Dynamic Workload Console and go to the Workload Designer. Create a new job and select “Google BigQuery”  in the Cloud section.

Job Definition PageFigure1: Job Definition Page

Connection

First, establish a connection to the Google Cloud server. In the Connection tab, you can choose between two options to connect to Google Cloud:

  1. GCP Default Credentials: you can choose this option if the VM already resides inside the GCP environment and there is no need to provide credentials explicitly.
  2. GCP Server Credentials: you can choose this option to manually provide the GCS account and the project ID (a unique name associated with each project).

Then, you can test the connection to Google Cloud by clicking on Test Connection.

Connection Page

Figure2: Connection Page

Test Connection

 

Figure3: Connection page> Test Connection

Action

After having successfully tested the connection, you can go to the Action tab and specify the bucket name and the operation you want to perform:

  • Job Name: specify the name of the job. It must be unique among running jobs.
  • Region: choose a BigQuery regional endpoint to deploy worker instances and store job metadata.(For example, US, us east-1, etc.)
  • Standard SQL: choose it as the primary method to analyse data in BigQuery. It scans one or more tables or expressions and returns the computed result rows. (For example, Select * From ‘TABLE NAME’ Limit 1000)
  • Script: choose it to manage the BigQuery projects, upload new data, and execute queries. Select the script with .sql extension and submit the job.
  • Procedure: choose it to take input arguments and return values as output. To call a stored procedure after the creation, use the “CALL”

For example:

“CALL `bigquerysandbox-342206.emp.my_proc`();”

The following statement calls the my_proc stored procedure.

Action Page

Figure4: Action Page

  • Load Jobs:

Choose it to load the data from a set of one or more files into a single database table. The file should be like JSON, CSV, AVRO, etc.
Specify a name for the data set, a name for the table, and the source path of the Google storage (For example, “gs://cloud-samples-data/bigquery/us-states/us-states.csv”).

Load Job

 

Figure 5: Action page> Load Job

  • Copy Jobs:

Choose it to copy a dataset within a region or from one region to another, without having to extract, move, and reload data into BigQuery. First, you have to create a destination data set and a table where to transfer data, otherwise the plug-in creates them automatically. Then, you can fill out the fields providing the data set name, the source table name, the destination data set name, and the destination table name.

Copy Jobs

Figure 6: Action page> Copy Jobs

  • Export Jobs:

Choose it to export different table data from BigQuery. You can export up to a maximum of 1GB for a file. It is so fast to export data from the source. To export a file, you must provide the source data set name, the source table name, the GCS bucket name (for example, BigQuerysandbox-342206), the file name, and choose the format the file should be exported.

Export Jobs

Figure 7: Action page> Export Jobs

Then, enable Save Function Output to File and specify the output file path.

Action page Contd.

Figure 8: Action page Contd.

Submitting your job

Now, it is time to submit the job in the current plan. Add the job to the job stream that automates your business process flow. Select the action menu in the top-left corner of the job definition panel and click Submit Job into Current Plan. A confirmation message is displayed, and you can switch to the Monitoring view to see what is going on.

Submitting job

Figure 9: Submitting job

Monitor Page:

Monitor Page

Figure 10: Monitor Page

Job Logs:

Job log

Figure 11: Job log

Google BigQuery plug-in is easy and quick to install. Thanks to it you can orchestrate your cloud and big data automation processes and analyze a huge amount of data in a short time.

So, then, what are you waiting to start using it? Download it now!

 

Comment wrap
Automation | March 20, 2024
Optimizing Cloud Processes: Unleashing the Power of HCL Workload Automation with AWS Step Functions
Automate complex workflows with AWS step functions and HCL Workload Automation. Learn how to connect and use the AWS Step Functions Plug-in for efficient workload orchestration.
Automation | March 13, 2024
Enhancing Automation: HCL Workload Automation and AWS Step Functions
Unlock powerful automation: HCL Workload Automation and AWS Step Functions join forces. Expand integration, manage hybrid clouds, optimize costs and more!
Automation | February 23, 2024
Demystifying OpenID Connect, OAuth 2, and JWTs on HWA
Integrate IBM/HCL Workload Automation with Okta, Auth0, Ping, Microsoft Entra ID, or SiteMinder via OpenID Connect for streamlined identity management.