View Datasets
Access any dataset you are authorized to view and check how the data are distributed.
A dataset is a simple collection of data, usually presented in a table. You can use a dataset as the basis for your story, and as a data source for Smart Predict.
In this area, you can see all datasets that you've created or that you are authorized to view.
During the initial data import, a data structure is created; the data type of each column is inferred. The results are displayed in the right-hand Details panel.
- Dataset Overview: lists all the available columns under separate Measures and Dimensions headings in the Output tab. You can view a list of all available columns in the Columns tab.
- Details: includes information for a given column,
such as histograms (rectangles indicating frequency of data items in
successive numerical intervals), and the column's inferred Data
Type.
Numerical or textual histograms are displayed under Data Distribution.
Text histograms are horizontal, and the values are clustered by count. The number of clusters can be adjusted by dragging the slider displayed above the histogram. When a cluster contains more than one value, the displayed count is the average count for the cluster. The count is prefaced by a tilde symbol (~) if there are multiple different occurrences. Expand the cluster for a more detailed view of the values in the cluster along with individual counts. Use the search tool to look up specific column values, and press Enter to initiate the search. When you select a value in the histogram, the column is sorted, and the value is highlighted in the grid.NoteThe displayed histogram is determined by the column's data type. A column containing numbers could still be considered as text if Data Type is set to String.Numerical histograms are vertical and represent the range of values along the x-axis. Hover over any bar to show the Count, Min, and Max values for the data in the bar. The number of bars can also be adjusted by using the slider above the histogram. Toggling the Show Outliers box includes or removes outlier values from the histogram.
NoteBelow a numerical histogram, there is a box and whisker plot to visualize the histogram's distribution of values as well as Min, Median, and Max values.
If you cannot see the create options to create a dataset, it's because you don't have the right role or permission to do this. Standard Application Roles