This short series discusses many of the often-overlooked issues surrounding data preparation for business analysts. Toad Data Point is the perfect tool for making data preparation easy.

This blog will cover:

• Multiple database/data source connectivity
• Data cleansing/transformation
• Data subsetting and data maintenance

Data preparation is easy with Toad Data Point, check it out in the eStore.

Multiple database/data source connectivity

Toad Data Point is the best tool I’ve worked with for data preparation for the business analyst. Not only did Quest apply many of the best features of the popular Toad tool for use here, they greatly enhanced this tool’s ability in many ways.

First, I’ll show the multitude of database and data platforms supported by Toad Data Point.

Connecting to external data sounces for data preparation using Toad Data Point

Wide range of support allows Toad Data Point to connect to different data sources

Figure 1: Toad Data Point Connectivity Panel

This wide range of support allows Toad Data Point to connect to any of these data sources and include their data stores in a single query in either the Toad Data Point Query Builder window (my recommendation for cross-platform data access) or the Toad Data Point Editor window.

Notice that Toad Data Point can use Microsoft spreadsheets and/or Access databases as a data store! Yes, you can also load Microsoft data easily into Toad Data Point’s local storage.

Any data store that is not in this list but has the ability to be accessed via an ODBC connection is also supported by Toad Data Point.

***Note*** The author is using Toad Data Point version 5.3.0.333 for these illustrations. All of these features shown here should work the same in any version of Toad Data Point. These are not new features to Toad Data Point, but base functionality that has been a core feature of the tool all along.

Data Cleansing/Transformation

A powerful data preparation feature of Toad Data Point is the ability to easily change, correct and add data columns to an existing data extract.

This feature is accessed from the data grid using a right-mouse click and selecting Send to > Transformation and Cleanse.

Data cleansing feature helps to solve data preparation issues

Figure 2: Toad Data Point Data Send to Menu

Toad Data Point Data Menu Bar Find and Calculated Column Buttons

Figure 3: Toad Data Point Data Menu Bar Find and Calculated Column Buttons

This example demonstrates fixing the null values that business analysts often encounter and have to fix after they notice the math didn’t work. This example also shows how Toad Data Point enables enhancing the data by using a calculated column that adds the SAL and COMM fields together.

Toad Data Point Fixing Null Values helps solve data preparation issues

Figure 4: Toad Data Point Fixing Null Values

To perform this operation, I clicked on the COMM field, then the binocular button (Find) (you can also right click on the column and get the same menu items) then filled in the Find box using the drop down menu and putting a 0 in the Replace box. See illustration above. Click on ‘Apply Rule’ when done. You will notice the rule is applied and the data changed to reflect the rule.

These data preparation rules can be saved and applied again using other queries and during an automation process.

Now, I will click the ‘Calculated Column’ button. I often call this the ‘Add Column’ button because this allows you to run literally any function/data manipulation/concatenation, etc., and have the output of these functions appear as a new column.

Calculated column step

Figure 5: Toad Data Point Adding a Column

With the ‘Calculated Column’ button, you can build almost any calculation or apply almost any function using your mouse. I double clicked first on SUM in the center box that lists the columns (notice the columns feature is selected in the left column). I then clicked on the ‘+’ sign just above these boxes, then I double clicked on the COMM field name. I gave the column the name ‘Total_Compensation.’ When you click the ‘Apply Rule’ button, you will see the new column appear, populated by the correct data, i.e., our null values are fixed and the math works correctly here.

Try Toad Data Point for free

Learn how Toad Data Point can help you access and prepare data faster. Seamlessly access more than 50 data sources—both on premises and in the cloud—and switch between these data source with near zero transition times.

Get started with our free 30-day trial.

Already in a trial? If Toad Data Point is helping you connect, query and prepare data for faster business insights, buy it now or contact a sales representative.

 

Data subsetting and data maintenance

The query could have fixed the null value and done the simple math, but I wanted to show how easily Toad Data Point could fix the data as well. The query could have selected data for a subset to be used in other queries, perhaps using a date range. The query could also have a group by clause allowing the data to be summarized.

The output of any query can be saved to your local storage – a database that is internal to Toad Data Point or Toad Intelligence Central. Saving to your local storage allows you to reuse the data as long as you want. No one else can see it or use it. Saving the data to Toad Intelligence Central allows anyone assigned to your group, or anyone using Toad Intelligence Central (this is a simple permissions thing which is easy to assign in Toad Intelligence Central) to reuse the data.

I want to save this data to Toad Intelligence Central and have Toad Intelligence Central automatically refresh the data once a day at midnight. This way, the data is always ready to go when the business analysts arrive in the morning.

***Note*** This feature addresses the issue of having to work with IT staff to save your subsetted data.

Select the button ‘Publish Data’ found along the bottom of the data panel.

Publish to Toad Intelligence Central

Figure 6: Toad Data Point Publish to Intelligence Central Panel

This brings up the ‘Publish to Toad Intelligence Central’ panel as seen above. There isn’t space in this blog to go into Toad Intelligence Central setup, but you can make folders and store almost anything you want to share. I will use the default values here except for the Scheduling feature.

Notice the Sharing link. This link allows you to define whether this data is useable just by you, a selection of people, your group, or public (anyone with access to Toad Intelligence Central).

Also notice you can index this data store, another topic for another time but Toad Data Point allows for performance features to be included.

We want this data store to automatically refresh and the name to remain the same. I have clicked the ‘Overwrite Existing Object’ box and will click on the Scheduling link.

Toad Data Point Scheduling the Refresh

Figure 7: Toad Data Point Scheduling the Refresh

These refresh options. and anything selected here, can easily be changed using the Toad Intelligence Central console. I have selected that this data be created in Intelligence Central now and refreshed daily at midnight.

Make your changes to the schedule and click ‘Publish’ when ready to create the data store in Intelligence Central.

Toad Data Point viewing New Object in Intelligence Central

Figure 8: Toad Data Point Viewing New Object in Intelligence Central

Now, connect to Toad Intelligence Central and you will see your data store. I double clicked on it here to bring up the object viewer so we could see our data preparation changes (null values fixed) and our new column. This data store can be used just like any other data object now. Since it is in Intelligence Central, it can be used by anyone in the group (for example).

If you are subsetting/summarizing/fixing data, this task can be performed once, then automatically refreshed thereby saving the data analyst from having to perform these data preparation tasks, and allowing them to be more efficient in their data analysis.

Data preparation is easy with Toad Data Point and its rich features for cross-database support, single query data access (across same cross-database platforms), data cleansing/transformations, and saving the data for use by others. This saved data can then be automatically maintained using Toad Intelligence Central

Watch the video demo

 

Related links:

Blog / video: What is data preparation and why is it key to the business analyst?

Webinar: Answering Your Top Five Questions About Toad Data Point Automation

Ebook: The Four Roadblocks of Data prep

Visit Quest for data prep and provisioning with Toad Data Point and Toad Intelligence Central

IDC Datasheet: Quest Software Acquires erwin Inc.

Visit Quest erwin, for enterprise data governance

Blog: How to prepare data for machine learning

Blog: Easy data prep: Toad® Data Point v5+ new features

Quest blog: Data prep and Analysis: Using Data Efficiently in the Era of Dark Data

 

Help your colleagues

If you think your colleagues would benefit from this blog, share it now on social media with the buttons located at the top of this blog post. Thanks!

About the Author

Dan Hotka

Dan Hotka is an Author/Instructor/Expert/Oracle ACE Director Alumni who uses TOAD regularly in his web-based and on-line courses. He is well published with many books still available on Amazon or www.DanHotka.com. Dan is available for user groups and company educational events. He regularly speaks at user groups around the world. Check his website for course listings, references, and discount codes for his online courses based in Safari.

Start the discussion at forums.toadworld.com