×
Instructions for Generating Network
We chose the CSV(Comma-Separated Values), format to import the table because it has a simple structure and makes it possible to create a lightweight file, even with a large volume of data. Furthermore, in this format the user is free to include data from other works or databases in their network and not just data from their research.
Attention: In the .csv format, columns are separated by certain delimiters, usually by commas ( , ) as the name suggests, but depending on the way these files are created, other characters can also be used for this, such as semicolons ( ; ) or tab (/t). To avoid errors, make sure your table uses a comma ( , ) as a delimiter. It is possible to determine this when creating these files in certain IDEs such as RStudio or Visual Studio Code or even creating the table in Excel and exporting it in this format or exporting it in XLSX and converting it later on online sites, such as "convertio".
Data strings (text) and column titles may or may not be enclosed in double quotation marks ("), but avoid mixing the two formatting options. In the template provided, the information is enclosed in double quotation marks.
The table must have three columns with the following titles: “nodes_id”, “nodes_color” and “connected_nodes”, RESPECTIVELY. The “nodes_id” column contains the titles of all nodes in the network, which can be names of diseases, genes, mutations, proteins, etc. In the “nodes_color” column, the color of the node is defined, where you can choose the following colors:
⬤ lime
⬤ yellow
⬤ orange
⬤ red
⬤ salmon
⬤ pink
⬤ violet
⬤ purple
⬤ blue
⬤ aqua
⬤ green
⬤ olive
The “connected_nodes” column will add the edges of the graph, in it you place the nodes connected to the node in the “nodes_id” column. You can place one connected node per line, so if there is more than one connection to the same node, you must repeat the same line, replacing only the value in “connected_nodes” or you can place all connected nodes on the same line, but separated by tabulation ( | ), if you choose this method, the table delimiters cannot under any circumstances be this character ( | ).
Attention: In the “nodes_color” column it is possible to add 12 different colors for the nodes, if there are more types of nodes in your network, you can choose one of the available colors and place it on all lines.
Attention: In the “connected_nodes” column it is not necessary to repeat the connections, that is, if you have already defined in one line: the node “disease-1” in “nodes_id” and “variant-1” in “connected_nodes”, it is not necessary put “disease-1” in the “connected_nodes” column of the “variant-1” node row, leaving the value blank in this case. Note that in our models, we only place values in the “connected_nodes” column in rows with nodes from the “green” category in the “nodes_color” column, as there are only two categories of nodes and the central node, connected to all other nodes in this example.
After finishing the CSV, import the file in the card import area on the side and click "generate" to process the file and create the network visualization.
In the future in this tool, we may include the option to submit VCF (Variant Call Format) files, in addition to the CSV option.