|
|
|
|
Import data from a number of popular data formats. |
![]() |
GRC ToolsTM can process data effectively from almost any country. |
![]() |
Country coding - GRC ToolsTM works on a country-by-country and language region basis. |
![]() |
Choose the processes that you would like GRC ToolsTM to run on each country. |
![]() |
Choose the fields over which to run each process. |
![]() |
GRC ToolsTM will keep you updated of progress through the data. |
The speed at which GRCToolsTM runs depends on the specifications of the computer upon which it is being run, the number and length of fields being processed, the field contents, the number of countries being processed in a single pass and the number of records in the lookup table(s) (which differs by country).
This test shows processing numbers per hour for a single field per record per process. This test was run on a Dell Dimension 8100 Pentium 4 1.48 GHz computer running Windows XP. The test file contained 63875 records with data from 101 countries. The timings shown are for guidance only. They are processing times - the program run time will include additional unmonitored activities such as the opening and closing and encryption and decryption of data and lookup tables.
| Process | Records per hour (1 field processed per record) |
| Remove non-numeric characters | 88000 |
| Remove punctuation | 105900 |
| Remove double spaces | 118500 |
| Remove postal code country code (GB-, B-, CH- etc. preceding a postal code) | 143300 |
| Check postal code validity (in terms of length, disallowed characters etc.) | 16800 |
| Locate and parse postal codes | 131000 to locate them, 2200 to move them. |
| Assign language region codes (in multilingual countries) | 91800 |
| Remove accents and replace them with their correct non-accented equivalents | 94100 |
| Remove quotation mark pairs | 93900 |
| Add missing apostrophes for French-language strings | 99600 |
| Make data into upper case, taking account of correct equivalents for accented characters | 87500 |
| Make data into mixed case, taking account of correct equivalents for accented characters and words which must not start with upper-case letters and other exceptions | 73100 |
| Move articles ("the") to the front or back of a company name in a number of formats | 61500 |
| Standardize "and" strings | 121500 |
| Standardize abbreviations and acronyms | 84500 |
| Standardize company legal types and other indicators (Ltd, SA, BV etc.) | 38400 |
| Locate, parse and standardize postbox strings and numbers | 79500 |
| Standardize thoroughfare types (street, rue, via, straße etc.) - works for over 300000 thoroughfare strings! | 2300 |
| Move house numbers to a new field | 72900 |
| Add or remove commas after/before house numbers | 905100 |
| Standardise house number and letter format | 19100000 |
| Parse and standardise the sorting code found after a postal town name (cédex etc.) | 11600 |
| Parse settlement names to a new field, correcting/standardising them at the same time. Works for over 20 million postal code/settlement name combinations! | 836700 |
| Standardize settlement names (Munich to München, Turin to Torino etc.) Works for over 20 million postal code/settlement name combinations! | 800000 |
| Parse and/or assign provinces and/or regions | 20000-28000 |
| Parse and standardize personal name forms of address (Mr, Dr, Mme etc.) | 695400 |
GRC Database Information
Nieuwe Prinsengracht 80-hs
1018 VV AMSTERDAM
The Netherlands