|
Charsep Download Releases Help Disclaimer You will easily find by yourself how the tool work. But here are some basic hints. The tool allows to load a file, display/edit it in a matrix, and save it. Each time you open a file you provide parameters, like the file path, separator character, the charset, etc... and each time you save it you provide similar parameters (like in a "Save as..." dialog in common applications). The application always starts by opening an "open file" dialog, then displays the main screen, showing a menu and a toolbar on top, the matrix, and an event log at the bottom. You can find on my YouTube channel, some videos that provide an
overview of features of Charsep. The two first ones show fundamental
concepts, the other ones go deeper into specific topics.
Help on specific topics |

|
Renumber rows in grid First column of matrix is not coming from the source file but displays the row position. It stays stable when a sorting is applied so you can revert back to file-order of rows. This “renumber” option allows you to "renumber" the rows of the grid, after a processing occurred - insertion and deletion or reordering of rows. Grid Information The "Grid information" command provides information on the content of the current matrix : number of rows, number of columns, number of selected rows, number of characters... For each column it also provides the minimum, maximum and average number of chars for not null values - and the number of values for this column. The last column displays the row position (Or number of rows if more than one) on which this column has a max or min size. Since you can sort this information or search (Contextual search with right-click) you can discover quite a lot of information on your file... Double-clicking information about a specific column provides the “profile” of this column – distribution factor and list of distinct values, and for numeric columns sum, standard deviation, mean and median value.. Similar information can be obtained through the “Get selected cols profile” button to get information across more than one column. You can either load from a file (enter path and file
name or click on '...' to browse your volumes) or user your clipboard
content to create a new file. Selecting the "encoding" allows to load
files stored using different charsets (If you have no idea - most files
are now using UTF8). The "separator" allows you to define which
character in the file is separating the different columns. By ticking
'1st row is Headers', you indicate to Charsep that the headers of the
matrix will be based on the first row of the file - if unticked, the
headers will just indicate a column sequence number. By ticking
'Double-quotes can encapsulate separator', the file will load as a
'csv' file should do, that is, separators wrapped in double-quotes will
not be counted and will be displayed in cell content.
When processing large files, it may be interesting to filter out some
information and load only partially the file.
These filtering options allow to view and process huge
files - but of course if you then
"save" your work, only this partial information will get saved.
Very similar to the "Open file" dialog options - it will allow you to
change
the separator or the character set of a file in a snap.
The "Save file" also allows to save the matrix content as a text in the
clipboard
(In such a case UTF-8 is the standard character set that cannot be
changed).
This
option opens the “Generate Random Grid” dialog. This allows you to
generate rapidly mock-data / dummy data, for testing purpose for
example. Left-side of the dialog allows you to define each column –
selecting a header label, and a data type :
Before adding a
column you can select the option to include some empty values in the
column – with the approximate percentage of empty values expected. The Random grid generation also propose the direct-to-file generation to build big files in very rapid timeframe - this function called "Export direct file" allows to generate either a CSV file or a JSON structure based on the selected columns and patterns. Direct streaming occurs therefore files of multiple millions of rows are not a problem to be generated. |
| The
toolbar commands The toolbar provides a quick search tool, that allows to search either a value in a cell of the grid or a value in the headers of the grid. It will move focus to position on the first found occurence - if you click again on "Search" or "Search headers" it will move focus to the next occurence. This search can be case-sensitive or not. In the toolbar, there is also a combo-box that allows to display the sequence number of the column, or a letter - in place of the header loaded from file. The "row selection" function (also available in contextual menu - see "Selection functions"), provides various ways to select rows from the grid : all rows, rows up to a row position or from a row position until the last one, or only "odd" or "even" rows. There is also a possibility to select an exact number of rows - evenly distributed in the grid (for instance, if you select 200 evenly distributed rows in a grid of 1000 rows, every 5th row will get selected - rows 1, 6, 11, 16,, 21, ...). The "process" function opens a dialog to configure and execute file-based processing. It also provides a convenient way to configure command-line configuration files ("cfg file") that can be used in command-line processing. File-based processing allows for ultra-fast processing of very large files without in-memory limitations. For details on type of file-based processing, check Command-line processing help. |