

The method according to claim 8, wherein determining the semantic understanding for a first one of the criterions comprises comparing signifiers in values for the first criterion to a known format associated with a particular criterion and, if the signifiers in the values for the first criterion are in the known format, determining the first criterion is the particular criterion.ġ1. The method according to claim 8, wherein determining the semantic understanding for a first one of the criterions comprises comparing signifiers in the values for the first criterion across a set of the data entries to identify patterns consistent with one of a plurality of required criterions for the data analysis tool.ġ0. The method according to claim 1 further comprising, if the first data format does not correspond to the signature characteristics associated with any of the plurality of known data formats, determining a semantic understanding for each of the criterions based on analysis of signifiers in the values for the criterion and determining a plurality of data manipulation operations based on the semantic understanding for each of the criterions in order to convert the first data set into the desired data format for the data analysis tool.ĩ. The method according to claim 1, wherein the data manipulation operations comprise one or more operations for splitting, joining, adding, deleting, reformatting, grouping, ungrouping or editing signifiers within at least one cell, column or row within the first data file.Ĩ. The method according to claim 1, wherein determining if the first data format corresponds to the signature characteristics associated with one of the known data formats comprises identifying distinct markers in the first data file and comparing the distinct markers in the first data file to markers associated with the known data formats.ħ. The method according to claim 1, wherein determining if the first data format corresponds to the signature characteristics associated with one of the known data formats comprises semantically analysing formatting of the values for one or more of the criterions and comparing the formatting of the values for the criterions in the first data file to formatting associated with the known data formats.Ħ. The method according to claim 1, wherein determining if the first data format corresponds to the signature characteristics associated with one of the known data formats comprises semantically analysing one or more criterion indications associated with the criterions in the first data file and comparing the criterion indications in the first data file to criterion indications associated with the known data formats.ĥ. The method according to claim 1 further comprising determining if the criterions in the first data set are in the desired data format for the data analysis tool and wherein the determining if the first data format corresponds to the signature characteristics associated with one of the known data formats is in response to the criterions in the first data set not being in the desired data format for the data analysis tool.Ĥ.

The method according to claim 1 further comprising accessing a database that stores one or more data manipulation operations associated with each of the known data formats required to convert the corresponding known data format to the desired data format for the data analysis tool.ģ.

A method implemented by a computing apparatus for shaping data into a desired data format for a data analysis tool, the method comprising: receiving a first data set comprising a plurality of data entries organized in a first data format, each of the data entries comprising a value for each of a plurality of criterions determining if the first data format corresponds to one or more signature characteristics associated with one of a plurality of known data formats if the first data format corresponds to the signature characteristics associated with one of the plurality of known data formats, determining one or more data manipulation operations required to convert the corresponding known data format into the desired data format for the data analysis tool and conducting the plurality of data manipulation operations on the first data set.Ģ.
