Introduction and Installation¶
Purpose¶
Kunteksto (ˈkänˌteksto) [1] is a tool to translate simple CSV formatted data files into computable, semantically enhanced knowledge representations. As a foundation for crowd sourced, automated knowledge base construction; it provides a path for existing data sets to be used in conjuntion with the emerging datacentric, model first approach in analysis, general artificial intelligence and decision support systems. This approach opens the door for the change to a datacentric world as opposed to the application-centric one we have now. This new approach enables automatic interoperability avoiding the data quality issues created through data cleaning and massaging.
The importance of how this simplifies query and analysis capabilities and improves data quality is discussed in foundational S3Model documentation and references. However, detailed understanding of S3Model is not required to understand and use the power of Kunteksto. Addtional information on the datacentric movement can be found below [2]
Target Audience¶
Kunteksto design is based around the ability for domain experts from any field, with very little programming ability to quickly annotate data extracts to improve the usability of the data. Data engineers and data scientists can also benefit from Kunteksto in the same ways as domain experts. It just takes a bit more research to discover the semantics that the original domain experts usually have at hand.
Installation¶
Cross-Platform on Anaconda¶
This is the preferred environment for a tool like Kunteksto because it integrates easily with systems for domain experts, data engineers and data scientists.
- Download and install Anaconda Python 3.6+ for your platform.
- More detailed Anaconda instructions are here if you prefer.
- If you are using Linux or Mac OSX, follow the instructions for your platform for lxml installation.
- Open a terminal window and create a conda environment. On Windows it is best to open an Anaconda Prompt terminal from the Anaconda menu:
conda create -p kunteksto python=3 pycurl ujson
- Change to the directory
cd kunteksto
- Activate the environment according to the instructions shown by Anaconda in the terminal window.
Windows
activate <path/to/directory>
or Linux/MacOSX
source activate <path/to/directory>
- install Kunteksto
pip install kunteksto
These four quick steps creates a virtual environment in the subdirectory kunteksto. When it is created conda displays how to activate the environment. Once activated then Kunteksto is installed in the subdirectoy along with the environment.
The last step is to do the Tutorial/Demo
Windows¶
Details coming soon. Preferably use the Anaconda instructions above.
Mac OSX¶
Follow the instructions for your platform for lxml installation.
Details coming soon. Preferably use the Anaconda instructions above.
Linux¶
Install Python 3.6.x See: Downloads
Follow the instructions for your platform for lxml installation.
Now that that is settled. The rest is easy.
Note
Option 1:
Use the Anaconda instructions above.
Note
Option 2:
Just create a virtual environment for Python 3.6 or later, using your favorite tool; conda, virtualenv, etc.
conda create -p kunteksto python=3
Activate your virtual environment and navigate to the kunteksto directory. Then install kunteksto using pip.
pip install kunteksto
After several minutes of installing all the cool stuff, you’ll be ready to begin the tutorial.
Go to the Tutorial/Demo
Update to a New Version¶
When you want to upgrade to a new version of Kunteksto you can just use the line below in your terminal where you have activated the virtual environment.
pip install kunteksto --upgrade --no-cache-dir
What are all the files for?¶
Depending on how and where you installed Kunteksto you will see a varying number of files and subdirectories. Many of them may be part of the Anaconda environment so we won’t be covering those.
Referenced from the kunteksto directory created at install time:
Files¶
- README.md
- A brief explaination of Kunteksto’s purpose and links to background information.
- kunteksto.conf
- The required configuration file for Kunteksto.
- LICENSE.txt
- A copy of the copyright notice and license.
Directories¶
- example_data
- Where you will find information and sample data files for the tutorials. There is also an example of the completed Demo database, model and data in the archive Demo_with_semantics.zip.
- output
- The default location of all generated output from Kunteksto. This can be changed either via the configuration file or via the commandline.
- catalogs
- Where Kunteksto places XML catalog files that are used in in the data validation process.
- s3model
- Support files required for operation.
- s3model.owl - the core S3Model ontology.
- s3model_3_0_0.xsd - the reference model schema version 3.0.0
- s3model_3_0_0.rdf - the extracted semantics from the reference model schema version 3.0.0
- s3model_3_0_0.xsl - a stylesheet providing visualization in a browsser of the reference model schema version 3.0.0
- dm-description.xsl - a stylesheet providing visualization in a browser of any S3Model data model. Just place it in the same directory with the dm-{uuid}.xsd file and open the schema in a browser.
Footnotes
[1] | S3Model is called the Esperanto of information management. Kunteksto is the Esperanto translation for Context. See Wikipedia for more information about the Esperanto language. |
[2] |