Merge branch 'develop'HEAD v0.23.03 release main

author: Matthias Baumgartner <dev@igsor.net> 2023-03-05 19:22:58 +0100
committer: Matthias Baumgartner <dev@igsor.net> 2023-03-05 19:22:58 +0100
commit: a35b33f4f1ddcf6f1bb8ab0f41b87bf2b847f11d (patch)
tree: fb220da28bb7248ebf37ce09af5de88f2c1aaad4
parent: 7582c280ad5324a2f0427999911c7e7abc14a6ab (diff)
parent: af81318ae9311fd0b0e16949cef3cfaf7996970b (diff)
download: bsie-release.tar.gz
bsie-release.tar.bz2
bsie-release.zip
119 files changed, 4836 insertions, 1196 deletions
diff --git a/.gitignore b/.gitignore
index e45b114..d2785ad 100644
--- a/.gitignore
+++ b/.gitignore
@@ -21,5 +21,10 @@ build/
 
 # doc builds
 doc/build/
+doc/source/api
+
+# testing data
+test/reader/image/testimage.nef*
+test/reader/preview/testimage.nef*
 
 ## EOF ##
diff --git a/.pylintrc b/.pylintrc
index 1b34854..576e81a 100644
--- a/.pylintrc
+++ b/.pylintrc
@@ -145,6 +145,21 @@ logging-format-style=old
 
 
 
+[MESSAGES CONTROL]
+
+# disable similarities check
+disable=raw-checker-failed,
+        bad-inline-option,
+        locally-disabled,
+        file-ignored,
+        suppressed-message,
+        useless-suppression,
+        deprecated-pragma,
+        use-symbolic-message-instead,
+        duplicate-code
+
+
+
 [MISCELLANEOUS]
 
 # List of note tags to take in consideration, separated by a comma.
@@ -188,6 +203,4 @@ ignore-none=no
 callbacks=clbk,callback
 
 
-
-
 # Disable: R1735 (use-dict-literal)
diff --git a/CHANGELOG.md b/CHANGELOG.md
new file mode 100644
index 0000000..1240091
--- /dev/null
+++ b/CHANGELOG.md
@@ -0,0 +1,20 @@
+
+# Changelog
+
+## 0.23.03 (Initial release)
+
+### Added
+
+- Information extraction pipeline essentials
+- Filematcher to check the file type and format
+- Index app to run the IE pipeline
+- Initial documentation
+- Basic extractors
+  - Constant
+  - Filename
+  - Filesize
+  - Previews
+- Image extractors
+  - Exif
+  - Regionally dominant colors
+
diff --git a/MANIFEST.in b/MANIFEST.in
new file mode 100644
index 0000000..c4b7734
--- /dev/null
+++ b/MANIFEST.in
@@ -0,0 +1 @@
+include bsie/apps/default_config.yaml
diff --git a/README b/README
deleted file mode 100644
index 3326196..0000000
--- a/README
+++ /dev/null
@@ -1,56 +0,0 @@
-
-Black Star Information Extraction
-=================================
-
-
-### Developer tools setup
-
-#### Test coverage (coverage)
-
-Resources:
-* https://coverage.readthedocs.io/en/6.5.0/index.html
-* https://nedbatchelder.com/blog/200710/flaws_in_coverage_measurement.html
-
-Commands:
-$ pip install coverage
-$ coverage run ; coverage html ; xdg-open .htmlcov/index.html
-
-
-
-#### Static code analysis (pylint)
-
-Resources:
-* https://github.com/PyCQA/pylint
-* https://pylint.org/
-* https://pylint.pycqa.org/en/latest/user_guide/messages/messages_overview.html#messages-overview
-
-Commands:
-$ pip install pylint
-$ pylint bsie
-
-
-
-#### Type analysis (mypy)
-
-Resources:
-* https://github.com/python/mypy
-* https://mypy.readthedocs.io/en/stable/
-
-Commands:
-$ pip install mypy
-$ mypy
-
-
-
-#### Documentation (sphinx)
-
-Resources:
-* 
-* 
-
-Commands:
-$ pip install ...
-$ 
-
-
-
diff --git a/README.md b/README.md
new file mode 100644
index 0000000..eee19f7
--- /dev/null
+++ b/README.md
@@ -0,0 +1,88 @@
+
+# Black Star Information Extraction
+
+The Black Star Information Extraction (BSIE) package provides a pipeline
+to extract metadata and content-derived features from files and stores
+that information in a BSFS storage.
+
+## Installation
+
+You can install BSIE via pip. BSIE comes with support for various file formats.
+For this, it needs to install many external packages. BSIE lets you control
+which of these you want to install. Note that if you choose to not install
+support for some file types, BSIE will show a warning and skip them.
+All other formats will be processed normally.
+
+To install only the minimally required software, use:
+
+    $ pip install --extra-index-url https://pip.bsfs.io bsie
+
+To install all dependencies, use the following shortcut:
+
+    $ pip install --extra-index-url https://pip.bsfs.io bsie[all]
+
+To install a subset of all dependencies, modify the extras part (``[image, preview]``)
+of the follwing command to your liking:
+
+    $ pip install --extra-index-url https://pip.bsfs.io bsie[image,preview]
+
+Currently, BSIE providesthe following extra flags:
+
+* image: Read data from image files.
+  Note that you may also have to install ``exiftool`` through your system's
+  package manager (e.g. ``sudo apt install exiftool``).
+* preview: Create previews from a variety of files.
+  Note that support for various file formats also depends on what
+  system packages you've installed. You should at least install ``imagemagick``
+  through your system's package manager (e.g. ``sudo apt install imagemagick``).
+  See [Preview Generator](https://github.com/algoo/preview-generator) for
+  more detailed instructions.
+* features: Extract feature vectors from images.
+
+
+## Development
+
+Set up a virtual environment:
+
+    $ virtualenv env
+    $ source env/bin/activate
+
+Install bsie as editable from the git repository:
+
+    $ git clone https://git.bsfs.io/bsie.git
+    $ cd bsie
+    $ pip install -e .[all]
+
+If you want to develop (*dev*), run the tests (*test*), edit the
+documentation (*doc*), or build a distributable (*build*),
+install bsfs with the respective extras (in addition to file format extras):
+
+    $ pip install -e .[dev,doc,build,test]
+
+Or, you can manually install the following packages besides BSIE:
+
+    $ pip install coverage mypy pylint
+    $ pip install rdflib requests types-PyYAML
+    $ pip install sphinx sphinx-copybutton furo
+    $ pip install build
+
+To ensure code style discipline, run the following commands:
+
+    $ coverage run ; coverage html ; xdg-open .htmlcov/index.html
+    $ pylint bsie
+    $ mypy
+
+To build the package, do:
+
+    $ python -m build
+
+To run only the tests (without coverage), run the following command from the **test folder**:
+
+    $ python -m unittest
+
+To build the documentation, run the following commands from the **doc folder**:
+
+    $ sphinx-apidoc -f -o source/api ../bsie/ --module-first -d 1 --separate
+    $ make html
+    $ xdg-open build/html/index.html
+
diff --git a/bsie.app b/bsie.app
index ba9cee7..0f6f7bc 100755
--- a/bsie.app
+++ b/bsie.app
@@ -1,49 +1,6 @@
-"""BSIE tools.
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
-import argparse
-import typing
-
-# module imports
-import bsie
-import bsie.apps
-
-# exports
-__all__: typing.Sequence[str] = (
-    'main',
-    )
-
-# config
-apps = {
-    'index'     : bsie.apps.index,
-    'info'      : bsie.apps.info,
-    }
-
-
-## code ##
-
-def main(argv):
-    """Black Star File System maintenance tools."""
-    parser = argparse.ArgumentParser(description=main.__doc__, prog='bsie')
-    parser.add_argument('--version', action='version',
-        version='%(prog)s version {}.{}.{}'.format(*bsie.version_info))
-    parser.add_argument('app', choices=apps.keys(),
-        help='Select the application to run.')
-    parser.add_argument('rest', nargs=argparse.REMAINDER)
-    # parse
-    args = parser.parse_args()
-    # run application
-    apps[args.app](args.rest)
-
-   
-## main ##
-
+#!/usr/bin/env python3
 if __name__ == '__main__':
+    import bsie.apps
     import sys
-    main(sys.argv[1:])
+    bsie.apps.main(sys.argv[1:])
 
-## EOF ##
diff --git a/bsie.toml b/bsie.toml
deleted file mode 100644
index 10b0f37..0000000
--- a/bsie.toml
+++ /dev/null
@@ -1,11 +0,0 @@
-[project]
-name = "bsie"
-description = "Extract information from files and store them in a BSFS."
-version = "0.0.1"
-license = {text = "BSD 3-Clause License"}
-authors = [{name='Matthias Baumgartner', email="dev@igsor.net"}]
-dependencies = [
-  "rdflib",
-  "bsfs",
-]
-requires-python = ">=3.7"
diff --git a/bsie/__init__.py b/bsie/__init__.py
index 8d2308c..f6f2ff2 100644
--- a/bsie/__init__.py
+++ b/bsie/__init__.py
@@ -1,10 +1,6 @@
 """The BSIE module extracts triples from files for insertion into a BSFS storage.
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
 """
-# imports
+# standard imports
 import collections
 import typing
 
diff --git a/bsie/apps/__init__.py b/bsie/apps/__init__.py
index a548c3c..2fe4795 100644
--- a/bsie/apps/__init__.py
+++ b/bsie/apps/__init__.py
@@ -1,12 +1,13 @@
+#!/usr/bin/env python3
+"""BSIE tools.
 """
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
+import argparse
 import typing
 
+# bsie imports
+import bsie
+
 # inner-module imports
 from .index import main as index
 from .info import main as info
@@ -15,6 +16,39 @@ from .info import main as info
 __all__: typing.Sequence[str] = (
     'index',
     'info',
+    'main',
     )
 
+# config
+apps = {
+    'index'     : index,
+    'info'      : info,
+    }
+
+
+## code ##
+
+def main(argv=None):
+    """Black Star File System maintenance tools."""
+    parser = argparse.ArgumentParser(description=main.__doc__, prog='bsie')
+    # version
+    parser.add_argument('--version', action='version',
+        version='%(prog)s version {}.{}.{}'.format(*bsie.version_info)) # pylint: disable=C0209
+    # application selection
+    parser.add_argument('app', choices=apps.keys(),
+        help='Select the application to run.')
+    # dangling args
+    parser.add_argument('rest', nargs=argparse.REMAINDER)
+    # parse
+    args = parser.parse_args(argv)
+    # run application
+    apps[args.app](args.rest)
+
+
+## main ##
+
+if __name__ == '__main__':
+    import sys
+    main(sys.argv[1:])
+
 ## EOF ##
diff --git a/bsie/apps/_loader.py b/bsie/apps/_loader.py
new file mode 100644
index 0000000..6411f10
--- /dev/null
+++ b/bsie/apps/_loader.py
@@ -0,0 +1,47 @@
+
+# standard imports
+import typing
+
+# external imports
+import yaml
+
+# bsie imports
+from bsie.extractor import ExtractorBuilder
+from bsie.lib import PipelineBuilder
+from bsie.lib.pipeline import Pipeline
+from bsie.reader import ReaderBuilder
+
+# constants
+DEFAULT_CONFIG_FILE = 'default_config.yaml'
+
+# exports
+__all__: typing.Sequence[str] = (
+    'DEFAULT_CONFIG_FILE',
+    'load_pipeline',
+    )
+
+
+## code ##
+
+def load_pipeline(path: str) -> Pipeline:
+    """Load a pipeline according to a config at *path*."""
+    # load config file
+    with open(path, 'rt', encoding='utf-8') as ifile:
+        cfg = yaml.safe_load(ifile)
+
+    # reader builder
+    rbuild = ReaderBuilder(cfg['ReaderBuilder'])
+    # extractor builder
+    ebuild = ExtractorBuilder(cfg['ExtractorBuilder'])
+    # pipeline builder
+    pbuild = PipelineBuilder(
+        rbuild,
+        ebuild,
+        )
+    # build pipeline
+    pipeline = pbuild.build()
+
+    # return pipeline
+    return pipeline
+
+## EOF ##
diff --git a/bsie/apps/default_config.yaml b/bsie/apps/default_config.yaml
new file mode 100644
index 0000000..a59b0f3
--- /dev/null
+++ b/bsie/apps/default_config.yaml
@@ -0,0 +1,19 @@
+
+ReaderBuilder: {}
+
+ExtractorBuilder:
+
+  - bsie.extractor.preview.Preview:
+      max_sides: [50, 100, 200,400]
+
+  - bsie.extractor.generic.path.Path: {}
+
+  - bsie.extractor.generic.stat.Stat: {}
+
+  - bsie.extractor.image.colors_spatial.ColorsSpatial:
+      width: 32
+      height: 32
+      exp: 4
+
+  - bsie.extractor.image.photometrics.Exif: {}
+
diff --git a/bsie/apps/index.py b/bsie/apps/index.py
index 1dbfdd8..d64e8c2 100644
--- a/bsie/apps/index.py
+++ b/bsie/apps/index.py
@@ -1,19 +1,15 @@
-"""
 
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import argparse
 import os
 import typing
 
 # bsie imports
-from bsie.base import errors
-from bsie.lib import BSIE
-from bsie.tools import builder
-from bsie.utils import bsfs
+from bsie.lib import BSIE, DefaultNamingPolicy
+from bsie.utils import bsfs, errors, node as node_
+
+# inner-module imports
+from . import _loader
 
 # exports
 __all__: typing.Sequence[str] = (
@@ -26,7 +22,12 @@ __all__: typing.Sequence[str] = (
 def main(argv):
     """Index files or directories into BSFS."""
     parser = argparse.ArgumentParser(description=main.__doc__, prog='index')
-    parser.add_argument('--user', type=bsfs.URI, default=bsfs.URI('http://example.com/me'),
+    parser.add_argument('--config', type=str,
+        default=os.path.join(os.path.dirname(__file__), _loader.DEFAULT_CONFIG_FILE),
+        help='Path to the config file.')
+    parser.add_argument('--host', type=bsfs.URI, default=bsfs.URI('http://example.com'),
+        help='')
+    parser.add_argument('--user', type=str, default='me',
         help='')
     parser.add_argument('--collect', action='append', default=[],
         help='')
@@ -42,35 +43,15 @@ def main(argv):
         help='')
     args = parser.parse_args(argv)
 
-    # FIXME: Read reader/extractor configs from a config file
-    # reader builder
-    rbuild = builder.ReaderBuilder({})
-    # extractor builder
-    ebuild = builder.ExtractorBuilder([
-        {'bsie.extractor.generic.path.Path': {}},
-        {'bsie.extractor.generic.stat.Stat': {}},
-        {'bsie.extractor.generic.constant.Constant': dict(
-            tuples=[('http://bsfs.ai/schema/Entity#author', 'Me, myself, and I')],
-            schema='''
-                bse:author rdfs:subClassOf bsfs:Predicate ;
-                    rdfs:domain bsfs:Entity ;
-                    rdfs:range xsd:string ;
-                    bsfs:unique "true"^^xsd:boolean .
-                ''',
-            )},
-        ])
-    # pipeline builder
-    pbuild = builder.PipelineBuilder(
-        bsfs.Namespace(args.user + ('/' if not args.user.endswith('/') else '')),
-        rbuild,
-        ebuild,
-        )
-
     # build pipeline
-    pipeline = pbuild.build()
+    pipeline = _loader.load_pipeline(args.config)
+    # build the naming policy
+    naming_policy = DefaultNamingPolicy(
+        host=args.host,
+        user=args.user,
+        )
     # build BSIE frontend
-    bsie = BSIE(pipeline, args.collect, args.discard)
-
+    bsie = BSIE(pipeline, naming_policy, args.collect, args.discard)
 
     def walk(handle):
         """Walk through given input files."""
@@ -78,11 +59,12 @@ def main(argv):
         # FIXME: simplify code (below but maybe also above)
         # FIXME: How to handle dependencies between data?
         #        E.g. do I still want to link to a tag despite not being permitted to set its label?
-        # FIXME: node renaming?
 
         # index input paths
         for path in args.input_file:
-            if os.path.isdir(path) and args.recursive:
+            if not os.path.exists(path):
+                pass # FIXME: notify the user
+            elif os.path.isdir(path) and args.recursive:
                 for dirpath, _, filenames in os.walk(path, topdown=True, followlinks=args.follow):
                     for filename in filenames:
                         for node, pred, value in bsie.from_file(os.path.join(dirpath, filename)):
@@ -105,13 +87,14 @@ def main(argv):
     store.migrate(bsie.schema)
     # process files
     def handle(node, pred, value):
+        if isinstance(value, node_.Node):
+            value = store.node(value.node_type, value.uri)
         store.node(node.node_type, node.uri).set(pred.uri, value)
     walk(handle)
     # return store
     return store
 
 
-
 ## main ##
 
 if __name__ == '__main__':
diff --git a/bsie/apps/info.py b/bsie/apps/info.py
index eaf1f71..e27b70b 100644
--- a/bsie/apps/info.py
+++ b/bsie/apps/info.py
@@ -1,18 +1,15 @@
-"""
 
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import argparse
+import os
 import sys
 import typing
 
 # bsie imports
-from bsie.base import errors
-from bsie.tools import builder
-from bsie.utils import bsfs
+from bsie.utils import bsfs, errors
+
+# inner-module imports
+from . import _loader
 
 # exports
 __all__: typing.Sequence[str] = (
@@ -25,42 +22,24 @@ __all__: typing.Sequence[str] = (
 def main(argv):
     """Show information from BSIE."""
     parser = argparse.ArgumentParser(description=main.__doc__, prog='info')
-    parser.add_argument('what', choices=('predicates', ),
+    parser.add_argument('--config', type=str,
+        default=os.path.join(os.path.dirname(__file__), _loader.DEFAULT_CONFIG_FILE),
+        help='Path to the config file.')
+    parser.add_argument('what', choices=('predicates', 'schema'),
         help='Select what information to show.')
     args = parser.parse_args(argv)
 
-    # FIXME: Read reader/extractor configs from a config file
-    # reader builder
-    rbuild = builder.ReaderBuilder({})
-    # extractor builder
-    ebuild = builder.ExtractorBuilder([
-        {'bsie.extractor.generic.path.Path': {}},
-        {'bsie.extractor.generic.stat.Stat': {}},
-        {'bsie.extractor.generic.constant.Constant': dict(
-            tuples=[('http://bsfs.ai/schema/Entity#author', 'Me, myself, and I')],
-            schema='''
-                bse:author rdfs:subClassOf bsfs:Predicate ;
-                    rdfs:domain bsfs:Entity ;
-                    rdfs:range xsd:string ;
-                    bsfs:unique "true"^^xsd:boolean .
-                ''',
-            )},
-        ])
-    # pipeline builder
-    pbuild = builder.PipelineBuilder(
-        bsfs.Namespace('http://example.com/me/'), # not actually used
-        rbuild,
-        ebuild,
-        )
-
     # build pipeline
-    pipeline = pbuild.build()
+    pipeline = _loader.load_pipeline(args.config)
 
     # show info
     if args.what == 'predicates':
         # show predicates
         for pred in pipeline.schema.predicates():
             print(pred.uri)
+    elif args.what == 'schema':
+        # show schema
+        print(bsfs.schema.to_string(pipeline.schema))
     else:
         # args.what is already checked by argparse
         raise errors.UnreachableError()
diff --git a/bsie/base/__init__.py b/bsie/base/__init__.py
deleted file mode 100644
index 0d362cd..0000000
--- a/bsie/base/__init__.py
+++ /dev/null
@@ -1,24 +0,0 @@
-"""The base module defines the BSIE interfaces.
-
-You'll mostly find abstract classes here.
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
-import typing
-
-# inner-module imports
-from . import errors
-from .extractor import Extractor
-from .reader import Reader
-
-# exports
-__all__: typing.Sequence[str] = (
-    'Extractor',
-    'Reader',
-    'errors',
-    )
-
-## EOF ##
diff --git a/bsie/extractor/__init__.py b/bsie/extractor/__init__.py
index ef31343..36fa9ba 100644
--- a/bsie/extractor/__init__.py
+++ b/bsie/extractor/__init__.py
@@ -2,14 +2,18 @@
 
 Each Extractor class is linked to the Reader class whose content it requires.
 
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
 """
-# imports
+# standard imports
 import typing
 
+# inner-module imports
+from .base import Extractor
+from .builder import ExtractorBuilder
+
 # exports
-__all__: typing.Sequence[str] = []
+__all__: typing.Sequence[str] = (
+    'Extractor',
+    'ExtractorBuilder',
+    )
 
 ## EOF ##
diff --git a/bsie/base/extractor.py b/bsie/extractor/base.py
index c44021b..f92d7cc 100644
--- a/bsie/base/extractor.py
+++ b/bsie/extractor/base.py
@@ -1,10 +1,6 @@
 """The Extractor classes transform content into triples.
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
 """
-# imports
+# standard imports
 import abc
 import typing
 
@@ -28,16 +24,32 @@ SCHEMA_PREAMBLE = '''
     prefix schema: <http://schema.org/>
 
     # common bsfs prefixes
-    prefix bsfs: <http://bsfs.ai/schema/>
-    prefix bse: <http://bsfs.ai/schema/Entity#>
+    prefix bsfs: <https://schema.bsfs.io/core/>
+    prefix bsl: <https://schema.bsfs.io/core/Literal/>
+    prefix bsa: <https://schema.bsfs.io/core/Literal/Array/>
+    prefix bsd: <https://schema.bsfs.io/core/distance#>
+
+    prefix bsie: <https://schema.bsfs.io/ie/>
+    prefix bsn: <https://schema.bsfs.io/ie/Node/>
+    prefix bse: <https://schema.bsfs.io/ie/Node/Entity#>
+    prefix bsp: <https://schema.bsfs.io/ie/Node/Preview#>
+
+    # default definitions
+    bsl:Array rdfs:subClassOf bsfs:Literal .
+    bsl:Number rdfs:subClassOf bsfs:Literal .
+    bsl:Time rdfs:subClassOf bsfs:Literal .
+    bsa:Feature rdfs:subClassOf bsl:Array ;
+        bsfs:dimension "1"^^xsd:integer ;
+        bsfs:dtype <https://schema.bsfs.io/core/dtype#f16> ;
+        bsfs:distance bsd:euclidean .
 
     # essential nodes
-    bsfs:Entity rdfs:subClassOf bsfs:Node .
-    bsfs:File rdfs:subClassOf bsfs:Entity .
+    bsn:Entity rdfs:subClassOf bsfs:Node .
 
     # common definitions
     xsd:string rdfs:subClassOf bsfs:Literal .
-    xsd:integer rdfs:subClassOf bsfs:Literal .
+    xsd:integer rdfs:subClassOf bsl:Number .
+    xsd:float rdfs:subClassOf bsl:Number .
 
     '''
 
@@ -83,7 +95,7 @@ class Extractor(abc.ABC):
     @property
     def principals(self) -> typing.Iterator[bsfs.schema.Predicate]:
         """Return the principal predicates, i.e., relations from/to the extraction subject."""
-        ent = self.schema.node(ns.bsfs.Entity)
+        ent = self.schema.node(ns.bsn.Entity)
         return (
             pred
             for pred
@@ -99,5 +111,6 @@ class Extractor(abc.ABC):
             principals: typing.Iterable[bsfs.schema.Predicate],
             ) -> typing.Iterator[typing.Tuple[node.Node, bsfs.schema.Predicate, typing.Any]]:
         """Return (node, predicate, value) triples."""
+        # FIXME: type annotation could be more strict: value is Hashable
 
 ## EOF ##
diff --git a/bsie/extractor/builder.py b/bsie/extractor/builder.py
new file mode 100644
index 0000000..d691b0e
--- /dev/null
+++ b/bsie/extractor/builder.py
@@ -0,0 +1,72 @@
+
+# standard imports
+import typing
+
+# bsie imports
+from bsie.utils import bsfs, errors, safe_load, unpack_qualified_name
+
+# inner-module imports
+from . import base
+
+# exports
+__all__: typing.Sequence[str] = (
+    'ExtractorBuilder',
+    )
+
+
+## code ##
+
+class ExtractorBuilder():
+    """Build `bsie.base.Extractor instances.
+
+    It is permissible to build multiple instances of the same extractor
+    (typically with different arguments), hence the ExtractorBuilder
+    receives a list of build specifications. Each specification is
+    a dict with a single key (extractor's qualified name) and a dict
+    to be used as keyword arguments.
+    Example: [{'bsie.extractor.generic.path.Path': {}}, ]
+
+    """
+
+    # build specifications
+    _specs: typing.List[typing.Dict[str, typing.Dict[str, typing.Any]]]
+
+    def __init__(self, specs: typing.List[typing.Dict[str, typing.Dict[str, typing.Any]]]):
+        self._specs = specs
+
+    def __iter__(self) -> typing.Iterator[int]:
+        """Iterate over extractor specifications."""
+        return iter(range(len(self._specs)))
+
+    def build(self, index: int) -> base.Extractor:
+        """Return an instance of the n'th extractor (n=*index*)."""
+        # get build instructions
+        specs = self._specs[index]
+
+        # check specs structure. expecting[{name: {kwargs}}]
+        if not isinstance(specs, dict):
+            raise TypeError(f'expected a dict, found {bsfs.typename(specs)}')
+        if len(specs) != 1:
+            raise TypeError(f'expected a dict of length one, found {len(specs)}')
+
+        # get name and args from specs
+        name = next(iter(specs.keys()))
+        kwargs = specs[name]
+
+        # check kwargs structure
+        if not isinstance(kwargs, dict):
+            raise TypeError(f'expected a dict, found {bsfs.typename(kwargs)}')
+
+        # check name and get module/class components
+        module_name, class_name = unpack_qualified_name(name)
+
+        # import extractor class
+        cls = safe_load(module_name, class_name)
+
+        try: # build and return instance
+            return cls(**kwargs)
+
+        except Exception as err:
+            raise errors.BuilderError(f'failed to build extractor {name} due to {bsfs.typename(err)}: {err}') from err
+
+## EOF ##
diff --git a/bsie/extractor/generic/__init__.py b/bsie/extractor/generic/__init__.py
index 0cb7e7f..46a4bd6 100644
--- a/bsie/extractor/generic/__init__.py
+++ b/bsie/extractor/generic/__init__.py
@@ -3,11 +3,8 @@ files. Examples include file system information (file name and size, mime type,
 etc.) and information that is independent of the actual file (constant triples,
 host platform infos, current time, etc.).
 
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
 """
-# imports
+# standard imports
 import typing
 
 # exports
diff --git a/bsie/extractor/generic/constant.py b/bsie/extractor/generic/constant.py
index 11384e6..7acbe95 100644
--- a/bsie/extractor/generic/constant.py
+++ b/bsie/extractor/generic/constant.py
@@ -1,16 +1,14 @@
 """The Constant extractor produces pre-specified triples.
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
 """
-# imports
+# standard imports
 import typing
 
 # bsie imports
-from bsie.base import extractor
 from bsie.utils import bsfs, node
 
+# inner-module imports
+from .. import base
+
 # exports
 __all__: typing.Sequence[str] = (
     'Constant',
@@ -19,7 +17,7 @@ __all__: typing.Sequence[str] = (
 
 ## code ##
 
-class Constant(extractor.Extractor):
+class Constant(base.Extractor):
     """Extract information from file's path."""
 
     CONTENT_READER = None
@@ -32,7 +30,7 @@ class Constant(extractor.Extractor):
             schema: str,
             tuples: typing.Iterable[typing.Tuple[bsfs.URI, typing.Any]],
             ):
-        super().__init__(bsfs.schema.Schema.from_string(extractor.SCHEMA_PREAMBLE + schema))
+        super().__init__(bsfs.schema.from_string(base.SCHEMA_PREAMBLE + schema))
         # NOTE: Raises a KeyError if the predicate is not part of the schema
         self._tuples = tuple((self.schema.predicate(p_uri), value) for p_uri, value in tuples)
         # TODO: use schema instance for value checking
diff --git a/bsie/extractor/generic/path.py b/bsie/extractor/generic/path.py
index 7018e12..00c1121 100644
--- a/bsie/extractor/generic/path.py
+++ b/bsie/extractor/generic/path.py
@@ -1,15 +1,10 @@
-"""
 
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import os
 import typing
 
 # bsie imports
-from bsie.base import extractor
+from bsie.extractor import base
 from bsie.utils import bsfs, node, ns
 
 # exports
@@ -20,7 +15,7 @@ __all__: typing.Sequence[str] = (
 
 ## code ##
 
-class Path(extractor.Extractor):
+class Path(base.Extractor):
     """Extract information from file's path."""
 
     CONTENT_READER = 'bsie.reader.path.Path'
@@ -29,13 +24,13 @@ class Path(extractor.Extractor):
     _callmap: typing.Dict[bsfs.schema.Predicate, typing.Callable[[str], typing.Any]]
 
     def __init__(self):
-        super().__init__(bsfs.schema.Schema.from_string(extractor.SCHEMA_PREAMBLE + '''
+        super().__init__(bsfs.schema.from_string(base.SCHEMA_PREAMBLE + '''
             bse:filename rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:File ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
                 rdfs:label "File name"^^xsd:string ;
                 schema:description "Filename of entity in some filesystem."^^xsd:string ;
-                bsfs:unique "false"^^xsd:boolean .
+                bsfs:unique "true"^^xsd:boolean .
             '''))
         self._callmap = {
             self.schema.predicate(ns.bse.filename): self.__filename,
diff --git a/bsie/extractor/generic/stat.py b/bsie/extractor/generic/stat.py
index 0b9ce29..92b51f3 100644
--- a/bsie/extractor/generic/stat.py
+++ b/bsie/extractor/generic/stat.py
@@ -1,17 +1,15 @@
 """Extract information from the file system, such as filesize.
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
 """
-# imports
+# standard imports
 import os
 import typing
 
 # bsie imports
-from bsie.base import extractor
 from bsie.utils import bsfs, node, ns
 
+# inner-module imports
+from .. import base
+
 # exports
 __all__: typing.Sequence[str] = (
     'Stat',
@@ -20,7 +18,7 @@ __all__: typing.Sequence[str] = (
 
 ## code ##
 
-class Stat(extractor.Extractor):
+class Stat(base.Extractor):
     """Extract information from the file system."""
 
     CONTENT_READER = 'bsie.reader.stat.Stat'
@@ -29,13 +27,13 @@ class Stat(extractor.Extractor):
     _callmap: typing.Dict[bsfs.schema.Predicate, typing.Callable[[os.stat_result], typing.Any]]
 
     def __init__(self):
-        super().__init__(bsfs.schema.Schema.from_string(extractor.SCHEMA_PREAMBLE + '''
+        super().__init__(bsfs.schema.from_string(base.SCHEMA_PREAMBLE + '''
             bse:filesize rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:File ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:integer ;
                 rdfs:label "File size"^^xsd:string ;
                 schema:description "File size of entity in some filesystem."^^xsd:string ;
-                bsfs:unique "false"^^xsd:boolean .
+                bsfs:unique "true"^^xsd:boolean .
             '''))
         self._callmap = {
             self.schema.predicate(ns.bse.filesize): self.__filesize,
diff --git a/bsie/extractor/image/__init__.py b/bsie/extractor/image/__init__.py
new file mode 100644
index 0000000..f82424a
--- /dev/null
+++ b/bsie/extractor/image/__init__.py
@@ -0,0 +1,8 @@
+
+# standard imports
+import typing
+
+# exports
+__all__: typing.Sequence[str] = []
+
+## EOF ##
diff --git a/bsie/extractor/image/colors_spatial.py b/bsie/extractor/image/colors_spatial.py
new file mode 100644
index 0000000..e6661a9
--- /dev/null
+++ b/bsie/extractor/image/colors_spatial.py
@@ -0,0 +1,150 @@
+"""Spatial color features.
+"""
+# standard imports
+import typing
+
+# external imports
+import PIL.Image
+import numpy as np
+
+# bsie imports
+from bsie.utils import bsfs, node, ns
+
+# inner-module imports
+from .. import base
+
+# constants
+FEATURE_NAME = ns.bsf.ColorsSpatial()
+
+# exports
+__all__: typing.Sequence[str] = (
+    'ColorsSpatial',
+    )
+
+
+## code ##
+
+class ColorsSpatial(base.Extractor):
+    """Determine dominant colors of subregions in the image.
+
+    Computes the domiant color of increasingly smaller subregions of the image.
+    """
+
+    CONTENT_READER = 'bsie.reader.image.Image'
+
+    # Initial subregion width.
+    width: int
+
+    # Initial subregion height.
+    height: int
+
+    # Decrement exponent.
+    exp: float
+
+    # Principal predicate's URI.
+    _predicate_name: bsfs.URI
+
+    def __init__(
+            self,
+            width: int = 32,
+            height: int = 32,
+            exp: float = 4.,
+            ):
+        # instance identifier
+        uuid = bsfs.uuid.UCID.from_dict({
+            'width': width,
+            'height': height,
+            'exp': exp,
+            })
+        # determine symbol names
+        instance_name = getattr(FEATURE_NAME, uuid)
+        predicate_name = getattr(ns.bse, 'colors_spatial_' + uuid)
+        # get vector dimension
+        dimension = self.dimension(width, height, exp)
+        # initialize parent with the schema
+        super().__init__(bsfs.schema.from_string(base.SCHEMA_PREAMBLE + f'''
+            <{FEATURE_NAME}> rdfs:subClassOf bsa:Feature ;
+                # annotations
+                rdfs:label "Spatially dominant colors"^^xsd:string ;
+                schema:description "Domiant colors of subregions in an image."^^xsd:string ;
+                bsfs:distance <https://schema.bsfs.io/core/distance#euclidean> ;
+                bsfs:dtype xsd:integer .
+
+            <{instance_name}> rdfs:subClassOf <{FEATURE_NAME}> ;
+                bsfs:dimension "{dimension}"^^xsd:integer ;
+                # annotations
+                <{FEATURE_NAME}/args#width> "{width}"^^xsd:integer ;
+                <{FEATURE_NAME}/args#height> "{height}"^^xsd:integer ;
+                <{FEATURE_NAME}/args#exp> "{exp}"^^xsd:float .
+
+            <{predicate_name}> rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range <{instance_name}> ;
+                bsfs:unique "true"^^xsd:boolean .
+
+            '''))
+        # assign extra members
+        self.width = width
+        self.height = height
+        self.exp = exp
+        self._predicate_name = predicate_name
+
+    def __repr__(self) -> str:
+        return f'{bsfs.typename(self)}({self.width}, {self.height}, {self.exp})'
+
+    def __eq__(self, other: typing.Any) -> bool:
+        return super().__eq__(other) \
+          and self.width == other.width \
+          and self.height == other.height \
+          and self.exp == other.exp
+
+    def __hash__(self) -> int:
+        return hash((super().__hash__(), self.width, self.height, self.exp))
+
+    @staticmethod
+    def dimension(width: int, height: int, exp: float) -> int:
+        """Return the feature vector dimension."""
+        # FIXME: replace with a proper formula
+        dim = 0
+        while width >= 1 and height >= 1:
+            dim += width * height
+            width = np.floor(width / exp)
+            height = np.floor(height / exp)
+        dim *= 3 # per band
+        return int(dim)
+
+    def extract(
+            self,
+            subject: node.Node,
+            content: PIL.Image.Image,
+            principals: typing.Iterable[bsfs.schema.Predicate],
+            ) -> typing.Iterator[typing.Tuple[node.Node, bsfs.schema.Predicate, typing.Any]]:
+        # check principals
+        if self.schema.predicate(self._predicate_name) not in principals:
+            # nothing to do; abort
+            return
+
+        # convert to HSV
+        content = content.convert('HSV')
+
+        # get dimensions
+        width, height = self.width, self.height
+        num_bands = len(content.getbands()) # it's three since we converted to HSV before
+
+        features = []
+        while width >= 1 and height >= 1:
+            # downsample
+            img = content.resize((width, height), resample=PIL.Image.Resampling.BOX)
+            # feature vector
+            features.append(
+                np.array(img.getdata()).reshape((width * height, num_bands)))
+            # iterate
+            width = int(np.floor(width / self.exp))
+            height = int(np.floor(height / self.exp))
+
+        # combine bands and convert features to tuple
+        value = tuple(np.vstack(features).reshape(-1))
+        # return triple with feature vector as value
+        yield subject, self.schema.predicate(self._predicate_name), value
+
+## EOF ##
diff --git a/bsie/extractor/image/photometrics.py b/bsie/extractor/image/photometrics.py
new file mode 100644
index 0000000..42eb3c8
--- /dev/null
+++ b/bsie/extractor/image/photometrics.py
@@ -0,0 +1,211 @@
+
+# standard imports
+from fractions import Fraction
+import typing
+
+# bsie imports
+from bsie.utils import bsfs, node, ns
+
+# inner-module imports
+from .. import base
+
+# exports
+__all__: typing.Sequence[str] = (
+    'Exif',
+    )
+
+
+## code ##
+
+def _gps_to_dec(coords: typing.Tuple[float, float, float]) -> float:
+    """Convert GPS coordinates from exif to float."""
+    # unpack args
+    deg, min, sec = coords # pylint: disable=redefined-builtin # min
+    # convert to float
+    deg = float(Fraction(deg))
+    min = float(Fraction(min))
+    sec = float(Fraction(sec))
+
+    if float(sec) > 0:
+        # format is deg+min+sec
+        return (float(deg) * 3600 + float(min) * 60 + float(sec)) / 3600
+    # format is deg+min
+    return float(deg) + float(min) / 60
+
+
+class Exif(base.Extractor):
+    """Extract information from EXIF/IPTC tags of an image file."""
+
+    CONTENT_READER = 'bsie.reader.exif.Exif'
+
+    def __init__(self):
+        super().__init__(bsfs.schema.from_string(base.SCHEMA_PREAMBLE + '''
+            #bse:t_capture rdfs:subClassOf bsfs:Predicate ;
+            #    rdfs:domain bsn:Entity ;
+            #    rdfs:range xsd:float ;
+            #    bsfs:unique "true"^^xsd:boolean .
+            bse:exposure rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range xsd:float ;
+                bsfs:unique "true"^^xsd:boolean .
+            bse:aperture rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range xsd:float ;
+                bsfs:unique "true"^^xsd:boolean .
+            bse:iso rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range xsd:integer ;
+                bsfs:unique "true"^^xsd:boolean .
+            bse:focal_length rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range xsd:float ;
+                bsfs:unique "true"^^xsd:boolean .
+            bse:width rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range xsd:integer ;
+                bsfs:unique "true"^^xsd:boolean .
+            bse:height rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range xsd:integer ;
+                bsfs:unique "true"^^xsd:boolean .
+            bse:orientation rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range xsd:integer ;
+                bsfs:unique "true"^^xsd:boolean .
+            bse:orientation_label rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range xsd:string ;
+                bsfs:unique "true"^^xsd:boolean .
+            bse:altitude rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range xsd:float ;
+                bsfs:unique "true"^^xsd:boolean .
+            bse:latitude rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range xsd:float ;
+                bsfs:unique "true"^^xsd:boolean .
+            bse:longitude rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range xsd:float ;
+                bsfs:unique "true"^^xsd:boolean .
+            '''))
+        # initialize mapping from predicate to callback
+        self._callmap = {
+            #self.schema.predicate(ns.bse.t_capture):         self._date,
+            self.schema.predicate(ns.bse.exposure):          self._exposure,
+            self.schema.predicate(ns.bse.aperture):          self._aperture,
+            self.schema.predicate(ns.bse.iso):               self._iso,
+            self.schema.predicate(ns.bse.focal_length):      self._focal_length,
+            self.schema.predicate(ns.bse.width):             self._width,
+            self.schema.predicate(ns.bse.height):            self._height,
+            self.schema.predicate(ns.bse.orientation):       self._orientation,
+            self.schema.predicate(ns.bse.orientation_label): self._orientation_label,
+            self.schema.predicate(ns.bse.altitude):          self._altitude,
+            self.schema.predicate(ns.bse.latitude):          self._latitude,
+            self.schema.predicate(ns.bse.longitude):         self._longitude,
+            }
+
+    def extract(
+            self,
+            subject: node.Node,
+            content: dict,
+            principals: typing.Iterable[bsfs.schema.Predicate],
+            ) -> typing.Iterator[typing.Tuple[node.Node, bsfs.schema.Predicate, typing.Any]]:
+        for pred in principals:
+            # find callback
+            clbk = self._callmap.get(pred)
+            if clbk is None:
+                continue
+            # get value
+            value = clbk(content)
+            if value is None:
+                continue
+            # produce triple
+            yield subject, pred, value
+
+    #def _date(self, content: dict): # FIXME: Return type annotation
+    #    date_keys = (
+    #        'Exif.Photo.DateTimeOriginal',
+    #        'Exif.Photo.DateTimeDigitized',
+    #        'Exif.Image.DateTime',
+    #        )
+    #    for key in date_keys:
+    #        if key in content:
+    #            dt = content[key].value
+    #            if dt.tzinfo is None:
+    #                dt = dt.replace(tzinfo=ttime.NoTimeZone)
+    #            return dt
+    #    return None
+
+
+    ## photometrics
+
+    def _exposure(self, content: dict) -> typing.Optional[float]:
+        if 'Exif.Photo.ExposureTime' in content:
+            return 1.0 / float(Fraction(content['Exif.Photo.ExposureTime']))
+        return None
+
+    def _aperture(self, content: dict) -> typing.Optional[float]:
+        if 'Exif.Photo.FNumber' in content:
+            return float(Fraction(content['Exif.Photo.FNumber']))
+        return None
+
+    def _iso(self, content: dict) -> typing.Optional[int]:
+        if 'Exif.Photo.ISOSpeedRatings' in content:
+            return int(content['Exif.Photo.ISOSpeedRatings'])
+        return None
+
+    def _focal_length(self, content: dict) -> typing.Optional[float]:
+        if 'Exif.Photo.FocalLength' in content:
+            return float(Fraction(content['Exif.Photo.FocalLength']))
+        return None
+
+
+    ## image dimensions
+
+    def _width(self, content: dict) -> typing.Optional[int]:
+        # FIXME: consider orientation!
+        if 'Exif.Photo.PixelXDimension' in content:
+            return int(content['Exif.Photo.PixelXDimension'])
+        return None
+
+    def _height(self, content: dict) -> typing.Optional[int]:
+        # FIXME: consider orientation!
+        if 'Exif.Photo.PixelYDimension' in content:
+            return int(content['Exif.Photo.PixelYDimension'])
+        return None
+
+    def _orientation(self, content: dict) -> typing.Optional[int]:
+        if 'Exif.Image.Orientation' in content:
+            return int(content['Exif.Image.Orientation'])
+        return None
+
+    def _orientation_label(self, content: dict) -> typing.Optional[str]:
+        width = self._width(content)
+        height = self._height(content)
+        ori = self._orientation(content)
+        if width is not None and height is not None and ori is not None:
+            if ori <= 4:
+                return 'landscape' if width >= height else 'portrait'
+            return 'portrait' if width >= height else 'landscape'
+        return None
+
+
+    ## location
+
+    def _altitude(self, content: dict) -> typing.Optional[float]:
+        if 'Exif.GPSInfo.GPSAltitude' in content:
+            return float(Fraction(content['Exif.GPSInfo.GPSAltitude']))
+        return None
+
+    def _latitude(self, content: dict) -> typing.Optional[float]:
+        if 'Exif.GPSInfo.GPSLatitude' in content:
+            return _gps_to_dec(content['Exif.GPSInfo.GPSLatitude'].split())
+        return None
+
+    def _longitude(self, content: dict) -> typing.Optional[float]:
+        if 'Exif.GPSInfo.GPSLongitude' in content:
+            return _gps_to_dec(content['Exif.GPSInfo.GPSLongitude'].split())
+        return None
+
+## EOF ##
diff --git a/bsie/extractor/preview.py b/bsie/extractor/preview.py
new file mode 100644
index 0000000..145a01a
--- /dev/null
+++ b/bsie/extractor/preview.py
@@ -0,0 +1,96 @@
+
+# imports
+import io
+import typing
+
+# external imports
+import PIL.Image
+
+# bsie imports
+from bsie.utils import bsfs, node, ns
+
+# inner-module imports
+from . import base
+
+# exports
+__all__: typing.Sequence[str] = (
+    'Preview',
+    )
+
+
+## code ##
+
+class Preview(base.Extractor):
+    """Extract previews."""
+
+    CONTENT_READER = 'bsie.reader.preview.Preview'
+
+    def __init__(self, max_sides: typing.Iterable[int]):
+        super().__init__(bsfs.schema.from_string(base.SCHEMA_PREAMBLE + '''
+
+
+
+            bsn:Preview rdfs:subClassOf bsfs:Node .
+            bsl:BinaryBlob rdfs:subClassOf bsfs:Literal .
+            <https://schema.bsfs.io/ie/Literal/BinaryBlob/JPEG> rdfs:subClassOf bsl:BinaryBlob .
+
+            bse:preview rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range bsn:Preview ;
+                bsfs:unique "false"^^xsd:boolean .
+
+            bsp:width rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Preview ;
+                rdfs:range xsd:integer ;
+                bsfs:unique "true"^^xsd:boolean .
+
+            bsp:height rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Preview ;
+                rdfs:range xsd:integer ;
+                bsfs:unique "true"^^xsd:boolean .
+
+            bsp:asset rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Preview ;
+                rdfs:range <https://schema.bsfs.io/ie/Literal/BinaryBlob/JPEG> ;
+                bsfs:unique "true"^^xsd:boolean .
+
+            '''))
+        # initialize extra args
+        self.max_sides = set(max_sides)
+
+    def __eq__(self, other: typing.Any) -> bool:
+        return super().__eq__(other) \
+           and self.max_sides == other.max_sides
+
+    def __hash__(self) -> int:
+        return hash((super().__hash__(), tuple(sorted(self.max_sides))))
+
+    def extract(
+            self,
+            subject: node.Node,
+            content: typing.Callable[[int], PIL.Image.Image],
+            principals: typing.Iterable[bsfs.schema.Predicate],
+            ) -> typing.Iterator[typing.Tuple[node.Node, bsfs.schema.Predicate, typing.Any]]:
+        # check principals
+        if self.schema.predicate(ns.bse.preview) not in principals:
+            return
+
+        for max_side in self.max_sides:
+            # get the preview in the right resolution
+            img = content(max_side)
+            # convert the preview to jpeg
+            buffer = io.BytesIO()
+            img.save(buffer, format='jpeg')
+            # create a preview node
+            preview = node.Node(ns.bsn.Preview,
+                ucid=bsfs.uuid.UCID.from_bytes(buffer.getvalue()),
+                size=max_side,
+                source=subject,
+                )
+            # yield triples
+            yield subject, self.schema.predicate(ns.bse.preview), preview
+            yield preview, self.schema.predicate(ns.bsp.width), img.width
+            yield preview, self.schema.predicate(ns.bsp.height), img.height
+            yield preview, self.schema.predicate(ns.bsp.asset), buffer.getvalue()
+
+## EOF ##
diff --git a/bsie/lib/__init__.py b/bsie/lib/__init__.py
index 578c2c4..f44fb74 100644
--- a/bsie/lib/__init__.py
+++ b/bsie/lib/__init__.py
@@ -1,18 +1,16 @@
-"""
 
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import typing
 
 # inner-module imports
 from .bsie import BSIE
+from .builder import PipelineBuilder
+from .naming_policy import DefaultNamingPolicy
 
 # exports
 __all__: typing.Sequence[str] = (
     'BSIE',
+    'PipelineBuilder',
     )
 
 ## EOF ##
diff --git a/bsie/lib/bsie.py b/bsie/lib/bsie.py
index e087fa9..b02e707 100644
--- a/bsie/lib/bsie.py
+++ b/bsie/lib/bsie.py
@@ -1,16 +1,14 @@
-"""
 
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import typing
 
 # bsie imports
-from bsie.tools import Pipeline
 from bsie.utils import bsfs, node, ns
 
+# inner-module imports
+from .naming_policy import NamingPolicy
+from .pipeline import Pipeline
+
 # exports
 __all__: typing.Sequence[str] = (
     'BSIE',
@@ -39,15 +37,18 @@ class BSIE():
 
     def __init__(
             self,
-            # pipeline builder.
+            # pipeline.
             pipeline: Pipeline,
+            # naming policy
+            naming_policy: NamingPolicy,
             # principals to extract at most. None implies all available w.r.t. extractors.
             collect: typing.Optional[typing.Iterable[bsfs.URI]] = None,
             # principals to discard.
             discard: typing.Optional[typing.Iterable[bsfs.URI]] = None,
             ):
-        # store pipeline
+        # store pipeline and naming policy
         self._pipeline = pipeline
+        self._naming_policy = naming_policy
         # start off with available principals
         self._principals = {pred.uri for pred in self._pipeline.principals}
         # limit principals to specified ones by argument.
@@ -87,6 +88,6 @@ class BSIE():
         # predicate lookup
         principals = {self.schema.predicate(pred) for pred in principals}
         # invoke pipeline
-        yield from self._pipeline(path, principals)
+        yield from self._naming_policy(self._pipeline(path, principals))
 
 ## EOF ##
diff --git a/bsie/lib/builder.py b/bsie/lib/builder.py
new file mode 100644
index 0000000..3a15311
--- /dev/null
+++ b/bsie/lib/builder.py
@@ -0,0 +1,75 @@
+
+# standard imports
+import logging
+import typing
+
+# bsie imports
+from bsie.extractor import ExtractorBuilder
+from bsie.reader import ReaderBuilder
+from bsie.utils import errors
+
+# inner-module imports
+from . import pipeline
+
+# exports
+__all__: typing.Sequence[str] = (
+    'PipelineBuilder',
+    )
+
+
+## code ##
+
+logger = logging.getLogger(__name__)
+
+class PipelineBuilder():
+    """Build `bsie.tools.pipeline.Pipeline` instances."""
+
+    # builder for Readers.
+    rbuild: ReaderBuilder
+
+    # builder for Extractors.
+    ebuild: ExtractorBuilder
+
+    def __init__(
+            self,
+            reader_builder: ReaderBuilder,
+            extractor_builder: ExtractorBuilder,
+            ):
+        self.rbuild = reader_builder
+        self.ebuild = extractor_builder
+
+    def build(self) -> pipeline.Pipeline:
+        """Return a Pipeline instance."""
+        ext2rdr = {}
+
+        for eidx in self.ebuild:
+            # build extractor
+            try:
+                ext = self.ebuild.build(eidx)
+
+            except errors.LoaderError as err: # failed to load extractor; skip
+                logger.error('failed to load extractor: %s', err)
+                continue
+
+            except errors.BuilderError as err: # failed to build instance; skip
+                logger.error(str(err))
+                continue
+
+            try:
+                # get reader required by extractor
+                if ext.CONTENT_READER is not None:
+                    rdr = self.rbuild.build(ext.CONTENT_READER)
+                else:
+                    rdr = None
+                # store extractor
+                ext2rdr[ext] = rdr
+
+            except errors.LoaderError as err: # failed to load reader
+                logger.error('failed to load reader: %s', err)
+
+            except errors.BuilderError as err: # failed to build reader
+                logger.error(str(err))
+
+        return pipeline.Pipeline(ext2rdr)
+
+## EOF ##
diff --git a/bsie/lib/naming_policy.py b/bsie/lib/naming_policy.py
new file mode 100644
index 0000000..9b9a45d
--- /dev/null
+++ b/bsie/lib/naming_policy.py
@@ -0,0 +1,115 @@
+
+# standard imports
+import abc
+import os
+import typing
+
+# bsie imports
+from bsie.utils import bsfs, errors, ns
+from bsie.utils.node import Node
+
+# exports
+__all__: typing.Sequence[str] = (
+    'DefaultNamingPolicy',
+    )
+
+
+## code ##
+
+class NamingPolicy():
+    """Determine node uri's from node hints."""
+    def __call__(
+            self,
+            iterable: typing.Iterable[typing.Tuple[Node, bsfs.URI, typing.Any]],
+            ):
+        """Apply the policy on a triple iterator."""
+        return NamingPolicyIterator(self, iterable)
+
+    @abc.abstractmethod
+    def handle_node(self, node: Node) -> Node:
+        """Apply the policy on a node."""
+
+
+class NamingPolicyIterator():
+    """Iterates over triples, determines uris according to a *policy* as it goes."""
+
+    # source triple iterator.
+    _iterable: typing.Iterable[typing.Tuple[Node, bsfs.URI, typing.Any]]
+
+    # naming policy
+    _policy: NamingPolicy
+
+    def __init__(
+            self,
+            policy: NamingPolicy,
+            iterable: typing.Iterable[typing.Tuple[Node, bsfs.URI, typing.Any]],
+            ):
+        self._iterable = iterable
+        self._policy = policy
+
+    def __iter__(self):
+        for node, pred, value in self._iterable:
+            # handle subject
+            self._policy.handle_node(node)
+            # handle value
+            if isinstance(value, Node):
+                self._policy.handle_node(value)
+            # yield triple
+            yield node, pred, value
+
+
+class DefaultNamingPolicy(NamingPolicy):
+    """Compose URIs as <host/user/node_type#fragment>
+
+    What information is used as fragment depends on the node type.
+    Typically, the default is to use the "ucid" hint.
+    The fallback in all cases is to generate a random uuid.
+
+    Never changes previously assigned uris. Sets uris in-place.
+
+    """
+
+    def __init__(
+            self,
+            host: bsfs.URI,
+            user: str,
+            ):
+        self._prefix = bsfs.Namespace(os.path.join(host, user))
+        self._uuid = bsfs.uuid.UUID()
+
+    def handle_node(self, node: Node) -> Node:
+        if node.uri is not None:
+            return node
+        if node.node_type == ns.bsn.Entity :
+            return self.name_file(node)
+        if node.node_type == ns.bsn.Preview:
+            return self.name_preview(node)
+        raise errors.ProgrammingError('no naming policy available for {node.node_type}')
+
+    def name_file(self, node: Node) -> Node:
+        """Set a bsfs:File node's uri fragment to its ucid."""
+        if 'ucid' in node.hints: # content id
+            fragment = node.hints['ucid']
+        else: # random name
+            fragment = self._uuid()
+        node.uri = getattr(self._prefix.file(), fragment)
+        return node
+
+    def name_preview(self, node: Node) -> Node:
+        """Set a bsfs:Preview node's uri fragment to its ucid.
+        Uses its source fragment as fallback. Appends the size if provided.
+        """
+        fragment = None
+        if 'ucid' in node.hints: # content id
+            fragment = node.hints['ucid']
+        if fragment is None and 'source' in node.hints: # source id
+            self.handle_node(node.hints['source'])
+            fragment = node.hints['source'].uri.get('fragment', None)
+        if fragment is None: # random name
+            fragment = self._uuid()
+        if 'size' in node.hints: # append size
+            fragment += '_s' + str(node.hints['size'])
+        node.uri = getattr(self._prefix.preview(), fragment)
+        return node
+
+## EOF ##
diff --git a/bsie/tools/pipeline.py b/bsie/lib/pipeline.py
index 20e8ddf..30fd6fd 100644
--- a/bsie/tools/pipeline.py
+++ b/bsie/lib/pipeline.py
@@ -1,25 +1,19 @@
-"""
 
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 from collections import defaultdict
 import logging
 import typing
 
 # bsie imports
-from bsie import base
-from bsie.utils import bsfs, node, ns
+from bsie.extractor import Extractor
+from bsie.reader import Reader
+from bsie.utils import bsfs, errors, node, ns
 
 # exports
 __all__: typing.Sequence[str] = (
     'Pipeline',
     )
 
-# constants
-FILE_PREFIX = 'file#'
 
 ## code ##
 
@@ -39,19 +33,14 @@ class Pipeline():
     # combined extractor schemas.
     _schema: bsfs.schema.Schema
 
-    # node prefix.
-    _prefix: bsfs.Namespace
-
     # extractor -> reader mapping
-    _ext2rdr: typing.Dict[base.extractor.Extractor, typing.Optional[base.reader.Reader]]
+    _ext2rdr: typing.Dict[Extractor, typing.Optional[Reader]]
 
     def __init__(
             self,
-            prefix: bsfs.Namespace,
-            ext2rdr: typing.Dict[base.extractor.Extractor, typing.Optional[base.reader.Reader]]
+            ext2rdr: typing.Dict[Extractor, typing.Optional[Reader]]
             ):
         # store core members
-        self._prefix = prefix + FILE_PREFIX
         self._ext2rdr = ext2rdr
         # compile schema from all extractors
         self._schema = bsfs.schema.Schema.Union(ext.schema for ext in ext2rdr)
@@ -63,12 +52,11 @@ class Pipeline():
         return f'{bsfs.typename(self)}(...)'
 
     def __hash__(self) -> int:
-        return hash((type(self), self._prefix, self._schema, tuple(self._ext2rdr), tuple(self._ext2rdr.values())))
+        return hash((type(self), self._schema, tuple(self._ext2rdr), tuple(self._ext2rdr.values())))
 
     def __eq__(self, other: typing.Any) -> bool:
         return isinstance(other, type(self)) \
            and self._schema == other._schema \
-           and self._prefix == other._prefix \
            and self._ext2rdr == other._ext2rdr
 
     @property
@@ -116,27 +104,33 @@ class Pipeline():
             rdr2ext[rdr].add(ext)
 
         # create subject for file
-        uuid = bsfs.uuid.UCID.from_path(path)
-        subject = node.Node(ns.bsfs.File, self._prefix[uuid])
+        subject = node.Node(ns.bsn.Entity,
+            ucid=bsfs.uuid.UCID.from_path(path),
+            )
 
         # extract information
         for rdr, extrs in rdr2ext.items():
             try:
                 # get content
                 content = rdr(path) if rdr is not None else None
+                #logger.info('extracted %s from %s', rdr, path)
 
                 # apply extractors on this content
                 for ext in extrs:
                     try:
                         # get predicate/value tuples
-                        for subject, pred, value in ext.extract(subject, content, principals):
-                            yield subject, pred, value
+                        yield from ext.extract(subject, content, principals)
 
-                    except base.errors.ExtractorError as err:
+                    except errors.ExtractorError as err:
                         # critical extractor failure.
                         logger.error('%s failed to extract triples from content: %s', ext, err)
 
-            except base.errors.ReaderError as err:
+            except errors.UnsupportedFileFormatError:
+                # failed to read the file format. skip.
+                #logger.warning('%s could not process the file format of %s', rdr, err)
+                pass
+
+            except errors.ReaderError as err:
                 # failed to read any content. skip.
                 logger.error('%s failed to read content: %s', rdr, err)
 
diff --git a/bsie/reader/__init__.py b/bsie/reader/__init__.py
index a45f22b..a1c38a9 100644
--- a/bsie/reader/__init__.py
+++ b/bsie/reader/__init__.py
@@ -1,8 +1,8 @@
 """The Reader classes return high-level content structures from files.
 
 The Reader fulfills two purposes:
-    First, it brokers between multiple libraries and file formats.
-    Second, it separates multiple aspects of a file into distinct content types.
+First, it brokers between multiple libraries and file formats.
+Second, it separates multiple aspects of a file into distinct content types.
 
 Often, different libraries focus on reading different types of content from a
 file. E.g. one would use different modules to read file system infos than to
@@ -11,9 +11,18 @@ type. Each distinct type can be implemented in a file or submodule that
 provides a Reader implementation. Through utilization of submodules, different
 file formats can be supported.
 
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
 """
+# standard imports
+import typing
+
+# inner-module imports
+from .base import Reader
+from .builder import ReaderBuilder
+
+# exports
+__all__: typing.Sequence[str] = (
+    'Reader',
+    'ReaderBuilder',
+    )
 
 ## EOF ##
diff --git a/bsie/base/reader.py b/bsie/reader/base.py
index cbabd36..a775701 100644
--- a/bsie/base/reader.py
+++ b/bsie/reader/base.py
@@ -1,14 +1,5 @@
-"""The Reader classes return high-level content structures from files.
 
-The Reader fulfills two purposes:
-    First, it brokers between multiple libraries and file formats.
-    Second, it separates multiple aspects of a file into distinct content types.
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import abc
 import typing
 
@@ -39,7 +30,7 @@ class Reader(abc.ABC):
         return hash(type(self))
 
     @abc.abstractmethod
-    def __call__(self, path: bsfs.URI) -> typing.Any:
+    def __call__(self, path: str) -> typing.Any:
         """Return some content of the file at *path*.
         Raises a `ReaderError` if the reader cannot make sense of the file format.
         """
diff --git a/bsie/reader/builder.py b/bsie/reader/builder.py
new file mode 100644
index 0000000..d32700b
--- /dev/null
+++ b/bsie/reader/builder.py
@@ -0,0 +1,73 @@
+
+# standard imports
+import typing
+
+# bsie imports
+from bsie.utils import bsfs, errors, safe_load, unpack_qualified_name
+
+# inner-module imports
+from . import base
+
+# exports
+__all__: typing.Sequence[str] = (
+    'ReaderBuilder',
+    )
+
+
+## code ##
+
+class ReaderBuilder():
+    """Build `bsie.base.Reader` instances.
+
+    Readers are defined via their qualified class name
+    (e.g., bsie.reader.path.Path) and optional keyword
+    arguments that are passed to the constructor via
+    the *kwargs* argument (name as key, kwargs as value).
+    The ReaderBuilder keeps a cache of previously built
+    reader instances, as they are anyway built with
+    identical keyword arguments.
+
+    """
+
+    # keyword arguments
+    _kwargs: typing.Dict[str, typing.Dict[str, typing.Any]]
+
+    # cached readers
+    _cache: typing.Dict[str, base.Reader]
+
+    def __init__(
+            self,
+            kwargs: typing.Optional[typing.Dict[str, typing.Dict[str, typing.Any]]] = None):
+        if kwargs is None:
+            kwargs = {}
+        self._kwargs = kwargs
+        self._cache = {}
+
+    def build(self, name: str) -> base.Reader:
+        """Return an instance for the qualified class name."""
+        # return cached instance
+        if name in self._cache:
+            return self._cache[name]
+
+        # check name and get module/class components
+        module_name, class_name = unpack_qualified_name(name)
+
+        # import reader class
+        cls = safe_load(module_name, class_name)
+
+        # get kwargs
+        kwargs = self._kwargs.get(name, {})
+        if not isinstance(kwargs, dict):
+            raise TypeError(f'expected a kwargs dict, found {bsfs.typename(kwargs)}')
+
+        try: # build, cache, and return instance
+            obj = cls(**kwargs)
+            # cache instance
+            self._cache[name] = obj
+            # return instance
+            return obj
+
+        except Exception as err:
+            raise errors.BuilderError(f'failed to build reader {name} due to {bsfs.typename(err)}: {err}') from err
+
+## EOF ##
diff --git a/bsie/reader/chain.py b/bsie/reader/chain.py
new file mode 100644
index 0000000..79b44b4
--- /dev/null
+++ b/bsie/reader/chain.py
@@ -0,0 +1,86 @@
+
+# standard imports
+import logging
+import typing
+
+# bsie imports
+from bsie.utils import bsfs, errors
+
+# inner-module imports
+from . import base
+from . import builder
+
+# exports
+__all__: typing.Sequence[str] = (
+    'ReaderChain',
+    )
+
+
+## code ##
+
+logger = logging.getLogger(__name__)
+
+# Content type.
+T_CONTENT = typing.TypeVar('T_CONTENT') # pylint: disable=invalid-name
+
+class ReaderChain(base.Reader, typing.Generic[T_CONTENT]):
+    """Read an image."""
+
+    # sub-readers for specific file formats.
+    _children: typing.Tuple[base.Reader, ...]
+
+    def __init__(
+            self,
+            subreader_names: typing.Iterable[str],
+            cfg: typing.Optional[typing.Any] = None,
+            ):
+        rbuild = builder.ReaderBuilder(cfg)
+        children = []
+        for name in subreader_names:
+            try:
+                # build sub-reader
+                children.append(rbuild.build(name))
+            except (ValueError,
+                    TypeError,
+                    errors.LoaderError,
+                    errors.BuilderError) as err:
+                # failed to build a child; skip and notify
+                logger.warning('failed to load reader: %s', err)
+
+        if len(children) == 0:
+            logger.warning('%s failed to load any sub-readers.', bsfs.typename(self))
+
+        # copy children to member
+        self._children = tuple(children)
+
+    def __str__(self) -> str:
+        substr = ', '.join(str(child) for child in self._children)
+        return f'{bsfs.typename(self)}({substr})'
+
+    def __repr__(self) -> str:
+        return f'{bsfs.typename(self)}({self._children})'
+
+    def __eq__(self, other: typing.Any) -> bool:
+        return super().__eq__(other) \
+           and self._children == other._children
+
+    def __hash__(self) -> int:
+        return hash((super().__hash__(), self._children))
+
+    def __call__(self, path: str) -> T_CONTENT:
+        raise_error = False
+        for child in self._children:
+            try:
+                return child(path)
+            except errors.UnsupportedFileFormatError:
+                # child cannot read the file, skip.
+                pass
+            except errors.ReaderError:
+                # child failed to read the file, skip.
+                raise_error = True
+
+        if raise_error:
+            raise errors.ReaderError(path)
+        raise errors.UnsupportedFileFormatError(path)
+
+## EOF ##
diff --git a/bsie/reader/exif.py b/bsie/reader/exif.py
new file mode 100644
index 0000000..2d0428b
--- /dev/null
+++ b/bsie/reader/exif.py
@@ -0,0 +1,44 @@
+
+# standard imports
+import typing
+
+# external imports
+import pyexiv2
+
+# bsie imports
+from bsie.utils import errors, filematcher
+
+# inner-module imports
+from . import base
+
+# constants
+MATCH_RULE = 'mime=image/jpeg'
+
+# exports
+__all__: typing.Sequence[str] = (
+    'Exif',
+    )
+
+
+## code ##
+
+class Exif(base.Reader):
+    """Use pyexiv2 to read exif metadata from image files."""
+
+    def __init__(self):
+        self._match = filematcher.parse(MATCH_RULE)
+
+    def __call__(self, path: str) -> dict:
+        # perform quick checks first
+        if not self._match(path):
+            raise errors.UnsupportedFileFormatError(path)
+
+        try:
+            # open the file
+            img = pyexiv2.Image(path)
+            # read metadata
+            return img.read_exif()
+        except (TypeError, OSError, RuntimeError) as err:
+            raise errors.ReaderError(path) from err
+
+## EOF ##
diff --git a/bsie/reader/image/__init__.py b/bsie/reader/image/__init__.py
new file mode 100644
index 0000000..89642f2
--- /dev/null
+++ b/bsie/reader/image/__init__.py
@@ -0,0 +1,31 @@
+
+# standard imports
+import typing
+
+# external imports
+import PIL.Image
+
+# inner-module imports
+from .. import chain
+
+# constants
+_FILE_FORMAT_READERS: typing.Sequence[str] = (
+    __package__ + '._raw.RawImage',
+    __package__ + '._pillow.PillowImage',
+    )
+
+# exports
+__all__: typing.Sequence[str] = (
+    'Image',
+    )
+
+
+## code ##
+
+class Image(chain.ReaderChain[PIL.Image.Image]): # pylint: disable=too-few-public-methods
+    """Read an image file."""
+
+    def __init__(self, cfg: typing.Optional[typing.Any] = None):
+        super().__init__(_FILE_FORMAT_READERS, cfg)
+
+## EOF ##
diff --git a/bsie/reader/image/_pillow.py b/bsie/reader/image/_pillow.py
new file mode 100644
index 0000000..0611d3c
--- /dev/null
+++ b/bsie/reader/image/_pillow.py
@@ -0,0 +1,34 @@
+
+# standard imports
+import typing
+
+# external imports
+import PIL.Image
+
+# bsie imports
+from bsie.utils import errors
+
+# inner-module imports
+from .. import base
+
+# exports
+__all__: typing.Sequence[str] = (
+    'PillowImage',
+    )
+
+
+## code ##
+
+class PillowImage(base.Reader):
+    """Use PIL to read content of a variety of image file types."""
+
+    def __call__(self, path: str) -> PIL.Image.Image:
+        try:
+            # open file with PIL
+            return PIL.Image.open(path)
+        except PIL.UnidentifiedImageError as err:
+            raise errors.UnsupportedFileFormatError(path) from err
+        except IOError as err:
+            raise errors.ReaderError(path) from err
+
+# EOF ##
diff --git a/bsie/reader/image/_raw.py b/bsie/reader/image/_raw.py
new file mode 100644
index 0000000..e5745aa
--- /dev/null
+++ b/bsie/reader/image/_raw.py
@@ -0,0 +1,56 @@
+
+# standard imports
+import typing
+
+# external imports
+import PIL.Image
+import rawpy
+
+# bsie imports
+from bsie.utils import errors, filematcher
+
+# inner-module imports
+from .. import base
+
+# constants
+MATCH_RULE = 'mime={image/x-nikon-nef} | extension={nef}'
+
+# exports
+__all__: typing.Sequence[str] = (
+    'RawImage',
+    )
+
+
+## code ##
+
+class RawImage(base.Reader):
+    """Use rawpy to read content of raw image file types."""
+
+    # file matcher
+    _match: filematcher.Matcher
+
+    # additional kwargs to rawpy's postprocess
+    _rawpy_kwargs: typing.Dict[str, typing.Any]
+
+    def __init__(self, **rawpy_kwargs):
+        match_rule = rawpy_kwargs.pop('file_match_rule', MATCH_RULE)
+        self._match = filematcher.parse(match_rule)
+        self._rawpy_kwargs = rawpy_kwargs
+
+    def __call__(self, path: str) -> PIL.Image.Image:
+        # perform quick checks first
+        if not self._match(path):
+            raise errors.UnsupportedFileFormatError(path)
+
+        try:
+            # open file with rawpy
+            ary = rawpy.imread(path).postprocess(**self._rawpy_kwargs)
+            # convert to PIL.Image
+            return PIL.Image.fromarray(ary)
+        except (rawpy.LibRawFatalError, # pylint: disable=no-member # pylint doesn't find the errors
+                rawpy.NotSupportedError, # pylint: disable=no-member
+                rawpy.LibRawNonFatalError, # pylint: disable=no-member
+               ) as err:
+            raise errors.ReaderError(path) from err
+
+## EOF ##
diff --git a/bsie/reader/path.py b/bsie/reader/path.py
index d60f187..45eb127 100644
--- a/bsie/reader/path.py
+++ b/bsie/reader/path.py
@@ -1,14 +1,10 @@
 """The Path reader produces a file path.
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
 """
-# imports
+# standard imports
 import typing
 
-# bsie imports
-from bsie.base import reader
+# inner-module imports
+from . import base
 
 # exports
 __all__: typing.Sequence[str] = (
@@ -18,7 +14,7 @@ __all__: typing.Sequence[str] = (
 
 ## code ##
 
-class Path(reader.Reader):
+class Path(base.Reader):
     """Return the path."""
 
     def __call__(self, path: str) -> str:
diff --git a/bsie/reader/preview/__init__.py b/bsie/reader/preview/__init__.py
new file mode 100644
index 0000000..791a133
--- /dev/null
+++ b/bsie/reader/preview/__init__.py
@@ -0,0 +1,34 @@
+
+# imports
+import typing
+
+# external imports
+import PIL.Image
+
+# inner-module imports
+from .. import chain
+
+# constants
+_FILE_FORMAT_READERS: typing.Sequence[str] = (
+    # native image formats
+    __package__ + '._pillow.PillowPreviewReader',
+    __package__ + '._rawpy.RawpyPreviewReader',
+    # multiformat readers
+    __package__ + '._pg.PreviewGeneratorReader',
+    )
+
+# exports
+__all__: typing.Sequence[str] = (
+    'Preview',
+    )
+
+
+## code ##
+
+class Preview(chain.ReaderChain[typing.Callable[[int], PIL.Image.Image]]): # pylint: disable=too-few-public-methods
+    """Create a preview from a file."""
+
+    def __init__(self, cfg: typing.Optional[typing.Any] = None):
+        super().__init__(_FILE_FORMAT_READERS, cfg)
+
+## EOF ##
diff --git a/bsie/reader/preview/_pg.py b/bsie/reader/preview/_pg.py
new file mode 100644
index 0000000..401b33d
--- /dev/null
+++ b/bsie/reader/preview/_pg.py
@@ -0,0 +1,81 @@
+
+# standard imports
+from functools import partial
+import contextlib
+import io
+import os
+import shutil
+import tempfile
+import typing
+
+# external imports
+from preview_generator.manager import PreviewManager
+import PIL.Image
+
+# bsie imports
+from bsie.utils import errors
+
+# inner-module imports
+from .. import base
+
+# exports
+__all__: typing.Sequence[str] = (
+    'PreviewGeneratorReader',
+    )
+
+
+## code ##
+
+class PreviewGeneratorReader(base.Reader):
+    """Uses preview_generator to create previews for various data formats.
+    See `https://github.com/algoo/preview-generator`_ for details.
+    """
+
+    # PreviewManager instance.
+    _mngr: PreviewManager
+
+    # Set of mime types supported by PreviewManager.
+    _supported_mimetypes: typing.Set[str]
+
+    # PreviewManager cache.
+    _cache: str
+
+    # Determines whether the cache directory should be deleted after use.
+    _cleanup: bool
+
+    def __init__(self, cache: typing.Optional[str] = None):
+        # initialize cache directory
+        # TODO: initialize in memory, e.g., via PyFilesystem
+        if cache is None:
+            self._cache = tempfile.mkdtemp(prefix='bsie-preview-cache-')
+            self._cleanup = True
+        else:
+            self._cache = cache
+            self._cleanup = False
+        # create preview generator
+        with contextlib.redirect_stderr(io.StringIO()):
+            self._mngr = PreviewManager(self._cache, create_folder=True)
+        self._supported_mimetypes = set(self._mngr.get_supported_mimetypes())
+
+    def __del__(self):
+        if self._cleanup:
+            shutil.rmtree(self._cache, ignore_errors=True)
+
+    def __call__(self, path: str) -> typing.Callable[[int], PIL.Image.Image]:
+        if not os.path.exists(path):
+            raise errors.ReaderError(path)
+        if self._mngr.get_mimetype(path) not in self._supported_mimetypes:
+            raise errors.UnsupportedFileFormatError(path)
+        return partial(self._preview_callback, path)
+
+    def _preview_callback(self, path: str, max_side: int) -> PIL.Image.Image:
+        """Produce a jpeg preview of *path* with at most *max_side* side length."""
+        try:
+            # generate the preview
+            preview_path = self._mngr.get_jpeg_preview(path, width=max_side, height=max_side)
+            # open the preview and return
+            return PIL.Image.open(preview_path)
+        except Exception as err: # FIXME: less generic exception!
+            raise errors.ReaderError(path) from err
+
+## EOF ##
diff --git a/bsie/reader/preview/_pillow.py b/bsie/reader/preview/_pillow.py
new file mode 100644
index 0000000..2b797c6
--- /dev/null
+++ b/bsie/reader/preview/_pillow.py
@@ -0,0 +1,39 @@
+
+# standard imports
+from functools import partial
+import typing
+
+# external imports
+import PIL.Image
+
+# bsie imports
+from bsie.utils import errors
+
+# inner-module imports
+from . import utils
+from .. import base
+
+# exports
+__all__: typing.Sequence[str] = (
+    'PillowPreviewReader',
+    )
+
+
+## code ##
+
+class PillowPreviewReader(base.Reader):
+    """Produce previews for image files using the Pillow library."""
+
+    def __call__(self, path: str) -> typing.Callable[[int], PIL.Image.Image]:
+        try:
+            # open file with PIL
+            img = PIL.Image.open(path)
+            # return callback
+            return partial(utils.resize, img)
+        except PIL.UnidentifiedImageError as err:
+            # failed to open, skip file
+            raise errors.UnsupportedFileFormatError(path) from err
+        except OSError as err:
+            raise errors.ReaderError(path) from err
+
+# EOF ##
diff --git a/bsie/reader/preview/_rawpy.py b/bsie/reader/preview/_rawpy.py
new file mode 100644
index 0000000..16e8675
--- /dev/null
+++ b/bsie/reader/preview/_rawpy.py
@@ -0,0 +1,61 @@
+
+# standard imports
+from functools import partial
+import typing
+
+# external imports
+import PIL.Image
+import rawpy
+
+# bsie imports
+from bsie.utils import errors, filematcher
+
+# inner-module imports
+from . import utils
+from .. import base
+
+# constants
+MATCH_RULE = 'mime={image/x-nikon-nef} | extension={nef}'
+
+# exports
+__all__: typing.Sequence[str] = (
+    'RawpyPreviewReader',
+    )
+
+
+## code ##
+
+class RawpyPreviewReader(base.Reader):
+    """Produce previews for raw image files using the rawpy library."""
+
+    # file matcher
+    _match: filematcher.Matcher
+
+    # additional kwargs to rawpy's postprocess
+    _rawpy_kwargs: typing.Dict[str, typing.Any]
+
+    def __init__(self, **rawpy_kwargs):
+        match_rule = rawpy_kwargs.pop('file_match_rule', MATCH_RULE)
+        self._match = filematcher.parse(match_rule)
+        self._rawpy_kwargs = rawpy_kwargs
+
+    def __call__(self, path: str) -> typing.Callable[[int], PIL.Image.Image]:
+        # perform quick checks first
+        if not self._match(path):
+            raise errors.UnsupportedFileFormatError(path)
+
+        try:
+            # open file with rawpy
+            ary = rawpy.imread(path).postprocess(**self._rawpy_kwargs)
+            # convert to PIL.Image
+            img = PIL.Image.fromarray(ary)
+            # return callback
+            return partial(utils.resize, img)
+
+        except (rawpy.LibRawFatalError, # pylint: disable=no-member # pylint doesn't find the errors
+                rawpy.NotSupportedError, # pylint: disable=no-member
+                rawpy.LibRawNonFatalError, # pylint: disable=no-member
+               ) as err:
+            raise errors.ReaderError(path) from err
+
+## EOF ##
diff --git a/bsie/reader/preview/utils.py b/bsie/reader/preview/utils.py
new file mode 100644
index 0000000..82ecc31
--- /dev/null
+++ b/bsie/reader/preview/utils.py
@@ -0,0 +1,34 @@
+
+# standard imports
+import typing
+
+# external imports
+import PIL.Image
+
+# exports
+__all__: typing.Sequence[str] = (
+    'resize',
+    )
+
+
+## code ##
+
+def resize(
+        img: PIL.Image.Image,
+        max_size: int,
+        ) -> PIL.Image.Image:
+    """Resize an image to a given maximum side length."""
+    # determine target dimensions
+    ratio = img.width / img.height
+    if img.width > img.height:
+        width, height = max_size, round(max_size / ratio)
+    else:
+        width, height = round(ratio * max_size), max_size
+    # rescale and return
+    return img.resize(
+        (width, height),
+        resample=PIL.Image.Resampling.LANCZOS, # create high-quality image
+        reducing_gap=3.0, # optimize computation via fast size reduction
+        )
+
+## EOF ##
diff --git a/bsie/reader/stat.py b/bsie/reader/stat.py
index fc5fb24..f42e7fb 100644
--- a/bsie/reader/stat.py
+++ b/bsie/reader/stat.py
@@ -1,15 +1,14 @@
 """The Stat reader produces filesystem stat information.
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
 """
-# imports
+# standard imports
 import os
 import typing
 
 # bsie imports
-from bsie.base import errors, reader
+from bsie.utils import errors
+
+# inner-module imports
+from . import base
 
 # exports
 __all__: typing.Sequence[str] = (
@@ -19,7 +18,7 @@ __all__: typing.Sequence[str] = (
 
 ## code ##
 
-class Stat(reader.Reader):
+class Stat(base.Reader):
     """Read and return the filesystem's stat infos."""
 
     def __call__(self, path: str) -> os.stat_result:
diff --git a/bsie/tools/__init__.py b/bsie/tools/__init__.py
deleted file mode 100644
index 803c321..0000000
--- a/bsie/tools/__init__.py
+++ /dev/null
@@ -1,20 +0,0 @@
-"""
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
-import typing
-
-# inner-module imports
-from . import builder
-from .pipeline import Pipeline
-
-# exports
-__all__: typing.Sequence[str] = (
-    'builder',
-    'Pipeline',
-    )
-
-## EOF ##
diff --git a/bsie/tools/builder.py b/bsie/tools/builder.py
deleted file mode 100644
index 190d9bf..0000000
--- a/bsie/tools/builder.py
+++ /dev/null
@@ -1,226 +0,0 @@
-"""
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
-import importlib
-import logging
-import typing
-
-# bsie imports
-from bsie import base
-from bsie.base import errors
-from bsie.utils import bsfs
-
-# inner-module imports
-from . import pipeline
-
-# exports
-__all__: typing.Sequence[str] = (
-    'ExtractorBuilder',
-    'PipelineBuilder',
-    'ReaderBuilder',
-    )
-
-
-## code ##
-
-logger = logging.getLogger(__name__)
-
-def _safe_load(module_name: str, class_name: str):
-    """Get a class from a module. Raise BuilderError if anything goes wrong."""
-    try:
-        # load the module
-        module = importlib.import_module(module_name)
-    except Exception as err:
-        # cannot import module
-        raise errors.LoaderError(f'cannot load module {module_name}') from err
-
-    try:
-        # get the class from the module
-        cls = getattr(module, class_name)
-    except Exception as err:
-        # cannot find the class
-        raise errors.LoaderError(f'cannot load class {class_name} from module {module_name}') from err
-
-    return cls
-
-
-def _unpack_name(name):
-    """Split a name into its module and class component (dot-separated)."""
-    if not isinstance(name, str):
-        raise TypeError(name)
-    if '.' not in name:
-        raise ValueError('name must be a qualified class name.')
-    module_name, class_name = name[:name.rfind('.')], name[name.rfind('.')+1:]
-    if module_name == '':
-        raise ValueError('name must be a qualified class name.')
-    return module_name, class_name
-
-
-class ReaderBuilder():
-    """Build `bsie.base.Reader` instances.
-
-    Readers are defined via their qualified class name
-    (e.g., bsie.reader.path.Path) and optional keyword
-    arguments that are passed to the constructor via
-    the *kwargs* argument (name as key, kwargs as value).
-    The ReaderBuilder keeps a cache of previously built
-    reader instances, as they are anyway built with
-    identical keyword arguments.
-
-    """
-
-    # keyword arguments
-    _kwargs: typing.Dict[str, typing.Dict[str, typing.Any]]
-
-    # cached readers
-    _cache: typing.Dict[str, base.Reader]
-
-    def __init__(self, kwargs: typing.Dict[str, typing.Dict[str, typing.Any]]):
-        self._kwargs = kwargs
-        self._cache = {}
-
-    def build(self, name: str) -> base.Reader:
-        """Return an instance for the qualified class name."""
-        # return cached instance
-        if name in self._cache:
-            return self._cache[name]
-
-        # check name and get module/class components
-        module_name, class_name = _unpack_name(name)
-
-        # import reader class
-        cls = _safe_load(module_name, class_name)
-
-        # get kwargs
-        kwargs = self._kwargs.get(name, {})
-        if not isinstance(kwargs, dict):
-            raise TypeError(f'expected a kwargs dict, found {bsfs.typename(kwargs)}')
-
-        try: # build, cache, and return instance
-            obj = cls(**kwargs)
-            # cache instance
-            self._cache[name] = obj
-            # return instance
-            return obj
-
-        except Exception as err:
-            raise errors.BuilderError(f'failed to build reader {name} due to {bsfs.typename(err)}: {err}') from err
-
-
-class ExtractorBuilder():
-    """Build `bsie.base.Extractor instances.
-
-    It is permissible to build multiple instances of the same extractor
-    (typically with different arguments), hence the ExtractorBuilder
-    receives a list of build specifications. Each specification is
-    a dict with a single key (extractor's qualified name) and a dict
-    to be used as keyword arguments.
-    Example: [{'bsie.extractor.generic.path.Path': {}}, ]
-
-    """
-
-    # build specifications
-    _specs: typing.List[typing.Dict[str, typing.Dict[str, typing.Any]]]
-
-    def __init__(self, specs: typing.List[typing.Dict[str, typing.Dict[str, typing.Any]]]):
-        self._specs = specs
-
-    def __iter__(self) -> typing.Iterator[int]:
-        """Iterate over extractor specifications."""
-        return iter(range(len(self._specs)))
-
-    def build(self, index: int) -> base.Extractor:
-        """Return an instance of the n'th extractor (n=*index*)."""
-        # get build instructions
-        specs = self._specs[index]
-
-        # check specs structure. expecting[{name: {kwargs}}]
-        if not isinstance(specs, dict):
-            raise TypeError(f'expected a dict, found {bsfs.typename(specs)}')
-        if len(specs) != 1:
-            raise TypeError(f'expected a dict of length one, found {len(specs)}')
-
-        # get name and args from specs
-        name = next(iter(specs.keys()))
-        kwargs = specs[name]
-
-        # check kwargs structure
-        if not isinstance(kwargs, dict):
-            raise TypeError(f'expected a dict, found {bsfs.typename(kwargs)}')
-
-        # check name and get module/class components
-        module_name, class_name = _unpack_name(name)
-
-        # import extractor class
-        cls = _safe_load(module_name, class_name)
-
-        try: # build and return instance
-            return cls(**kwargs)
-
-        except Exception as err:
-            raise errors.BuilderError(f'failed to build extractor {name} due to {bsfs.typename(err)}: {err}') from err
-
-
-class PipelineBuilder():
-    """Build `bsie.tools.pipeline.Pipeline` instances."""
-
-    # Prefix to be used in the Pipeline.
-    prefix: bsfs.Namespace
-
-    # builder for Readers.
-    rbuild: ReaderBuilder
-
-    # builder for Extractors.
-    ebuild: ExtractorBuilder
-
-    def __init__(
-            self,
-            prefix: bsfs.Namespace,
-            reader_builder: ReaderBuilder,
-            extractor_builder: ExtractorBuilder,
-            ):
-        self.prefix = prefix
-        self.rbuild = reader_builder
-        self.ebuild = extractor_builder
-
-    def build(self) -> pipeline.Pipeline:
-        """Return a Pipeline instance."""
-        ext2rdr = {}
-
-        for eidx in self.ebuild:
-            # build extractor
-            try:
-                ext = self.ebuild.build(eidx)
-
-            except errors.LoaderError as err: # failed to load extractor; skip
-                logger.error('failed to load extractor: %s', err)
-                continue
-
-            except errors.BuilderError as err: # failed to build instance; skip
-                logger.error(str(err))
-                continue
-
-            try:
-                # get reader required by extractor
-                if ext.CONTENT_READER is not None:
-                    rdr = self.rbuild.build(ext.CONTENT_READER)
-                else:
-                    rdr = None
-                # store extractor
-                ext2rdr[ext] = rdr
-
-            except errors.LoaderError as err: # failed to load reader
-                logger.error('failed to load reader: %s', err)
-
-            except errors.BuilderError as err: # failed to build reader
-                logger.error(str(err))
-
-        return pipeline.Pipeline(self.prefix, ext2rdr)
-
-
-
-## EOF ##
diff --git a/bsie/utils/__init__.py b/bsie/utils/__init__.py
index bd22236..18c8db7 100644
--- a/bsie/utils/__init__.py
+++ b/bsie/utils/__init__.py
@@ -1,22 +1,23 @@
 """Common tools and definitions.
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
 """
-# imports
+# standard imports
 import typing
 
 # inner-module imports
 from . import bsfs
+from . import filematcher
 from . import namespaces as ns
 from . import node
+from .loading import safe_load, unpack_qualified_name
 
 # exports
 __all__: typing.Sequence[str] = (
     'bsfs',
+    'filematcher',
     'node',
     'ns',
+    'safe_load',
+    'unpack_qualified_name',
     )
 
 ## EOF ##
diff --git a/bsie/utils/bsfs.py b/bsie/utils/bsfs.py
index 0b88479..fc045cc 100644
--- a/bsie/utils/bsfs.py
+++ b/bsie/utils/bsfs.py
@@ -1,10 +1,6 @@
 """BSFS bridge, provides BSFS bindings for BSIE.
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
 """
-# imports
+# standard imports
 import typing
 
 # bsfs imports
diff --git a/bsie/base/errors.py b/bsie/utils/errors.py
index dc3c30e..7c7e6ed 100644
--- a/bsie/base/errors.py
+++ b/bsie/utils/errors.py
@@ -1,10 +1,6 @@
 """Common BSIE exceptions.
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
 """
-# imports
+# standard imports
 import typing
 
 # exports
@@ -39,4 +35,10 @@ class ProgrammingError(_BSIEError):
 class UnreachableError(ProgrammingError):
     """Bravo, you've reached a point in code that should logically not be reachable."""
 
+class ParserError(_BSIEError):
+    """Failed to parse due to invalid syntax or structures."""
+
+class UnsupportedFileFormatError(_BSIEError):
+    """Failed to read a file format."""
+
 ## EOF ##
diff --git a/bsie/utils/filematcher/__init__.py b/bsie/utils/filematcher/__init__.py
new file mode 100644
index 0000000..908de78
--- /dev/null
+++ b/bsie/utils/filematcher/__init__.py
@@ -0,0 +1,15 @@
+
+# standard imports
+import typing
+
+# inner-module imports
+from .matcher import Matcher
+from .parser import parse
+
+# exports
+__all__: typing.Sequence[str] = (
+    'Matcher',
+    'parse',
+    )
+
+## EOF ##
diff --git a/bsie/utils/filematcher/matcher.py b/bsie/utils/filematcher/matcher.py
new file mode 100644
index 0000000..1fa308e
--- /dev/null
+++ b/bsie/utils/filematcher/matcher.py
@@ -0,0 +1,174 @@
+
+# standard imports
+from collections.abc import Callable, Collection, Hashable
+import abc
+import os
+import typing
+
+# external imports
+import magic
+
+# exports
+__all__: typing.Sequence[str] = []
+
+
+## code ##
+
+# abstract nodes
+
+class Matcher(abc.ABC, Hashable, Callable, Collection): # type: ignore [misc] # Invalid base class Callable
+    """Matcher node base class."""
+
+    # child expressions or terminals
+    _childs: typing.Set[typing.Any]
+
+    def __init__(self, *childs: typing.Any):
+        if len(childs) == 1 and isinstance(childs[0], (list, tuple, set)):
+            self._childs = set(childs[0])
+        else:
+            self._childs = set(childs)
+
+    def __contains__(self, needle: typing.Any) -> bool:
+        return needle in self._childs
+
+    def __iter__(self) -> typing.Iterator[typing.Any]:
+        return iter(self._childs)
+
+    def __len__(self) -> int:
+        return len(self._childs)
+
+    def __repr__(self) -> str:
+        return f'{type(self).__name__}({self._childs})'
+
+    def __hash__(self) -> int:
+        return hash((type(self), tuple(set(self._childs))))
+
+    def __eq__(self, other: typing.Any) -> bool:
+        return isinstance(other, type(self)) \
+           and self._childs == other._childs
+
+    @abc.abstractmethod
+    def __call__(self, path: str) -> bool: # pylint: disable=arguments-differ
+        """Check if *path* satisfies the conditions set by the Matcher instance."""
+
+class NOT(Matcher):
+    """Invert a matcher result."""
+    def __init__(self, expr: Matcher):
+        super().__init__(expr)
+    def __call__(self, path: str) -> bool:
+        return not next(iter(self._childs))(path)
+
+# aggregate nodes
+
+class Aggregate(Matcher): # pylint: disable=too-few-public-methods # Yeah, it's an interface...
+    """Aggregation function base class (And, Or)."""
+
+class And(Aggregate):
+    """Accept only if all conditions are satisfied."""
+    def __call__(self, path: str) -> bool:
+        for itm in self:
+            if not itm(path):
+                return False
+        return True
+
+class Or(Aggregate):
+    """Accept only if at least one condition is satisfied."""
+    def __call__(self, path: str) -> bool:
+        for itm in self:
+            if itm(path):
+                return True
+        return False
+
+
+# criteria nodes
+
+class Criterion(Matcher):
+    """Criterion base class. Limits acceptance to certain values."""
+    def accepted(self) -> typing.Set[typing.Any]:
+        """Return a set of accepted values."""
+        return self._childs
+
+# criteria w/o value (valueless)
+
+class Any(Criterion):
+    """Accepts anything."""
+    def __call__(self, path: str) -> bool:
+        return True
+
+class Nothing(Criterion):
+    """Accepts nothing."""
+    def __call__(self, path: str) -> bool:
+        return False
+
+class Exists(Criterion):
+    """Filters by existence."""
+    def __call__(self, path: str) -> bool:
+        return os.path.exists(path)
+
+class IsFile(Criterion):
+    """Checks if the path is a regular file."""
+    def __call__(self, path: str) -> bool:
+        return os.path.isfile(path)
+
+class IsDir(Criterion):
+    """Checks if the path is a directory."""
+    def __call__(self, path: str) -> bool:
+        return os.path.isdir(path)
+
+class IsLink(Criterion):
+    """Checks if the path is a link."""
+    def __call__(self, path: str) -> bool:
+        return os.path.islink(path)
+
+class IsAbs(Criterion):
+    """Checks if the path is an absolute path."""
+    def __call__(self, path: str) -> bool:
+        return os.path.isabs(path)
+
+class IsRel(Criterion):
+    """Checks if the path is a relative path."""
+    def __call__(self, path: str) -> bool:
+        return not os.path.isabs(path)
+
+class IsMount(Criterion):
+    """Checks if the path is a mount point."""
+    def __call__(self, path: str) -> bool:
+        return os.path.ismount(path)
+
+class IsEmpty(Criterion):
+    """Checks if the path is an empty file."""
+    def __call__(self, path: str) -> bool:
+        return os.path.exists(path) and os.stat(path).st_size == 0
+
+class IsReadable(Criterion):
+    """Checks if the path is readable."""
+    def __call__(self, path: str) -> bool:
+        return os.path.exists(path) and os.access(path, os.R_OK)
+
+class IsWritable(Criterion):
+    """Checks if the path is writable."""
+    def __call__(self, path: str) -> bool:
+        return os.path.exists(path) and os.access(path, os.W_OK)
+
+class IsExecutable(Criterion):
+    """Checks if the path is executable."""
+    def __call__(self, path: str) -> bool:
+        return os.path.exists(path) and os.access(path, os.X_OK)
+
+# criteria w/ value
+
+class Extension(Criterion):
+    """Filters by file extension (without the dot)."""
+    def __call__(self, path: str) -> bool:
+        _, ext = os.path.splitext(path)
+        return ext[1:] in self.accepted()
+
+class Mime(Criterion):
+    """Filters by mime type."""
+    def __call__(self, path: str) -> bool:
+        try:
+            return magic.from_file(path, mime=True).lower() in self.accepted()
+        except FileNotFoundError:
+            return False
+
+## EOF ##
diff --git a/bsie/utils/filematcher/parser.py b/bsie/utils/filematcher/parser.py
new file mode 100644
index 0000000..dc28a0d
--- /dev/null
+++ b/bsie/utils/filematcher/parser.py
@@ -0,0 +1,141 @@
+
+# standard imports
+import typing
+
+# external imports
+import pyparsing
+from pyparsing import printables, alphas8bit, punc8bit, QuotedString, Word, \
+                      delimitedList, Or, CaselessKeyword, Group, oneOf, Optional
+
+# inner-module imports
+from . import matcher
+from .. import errors
+
+# exports
+__all__: typing.Sequence[str] = (
+    'parse',
+    )
+
+
+## code ##
+
+class FileMatcherParser():
+    """
+    EXPR      := RULES | RULES "|" RULES
+    RULESET   := RULE | RULE, RULE
+    RULE      := CRITERION OP VALUE | CRITERION OP {VALUES} | VALUELESS
+    OP        := != | =
+    VALUES    := VALUE | VALUE, VALUE
+    VALUE     := [word]
+    CRITERION := mime | extension | ...
+    """
+
+    # criteria matcher nodes w/ arguments
+    _CRITERIA: typing.Dict[str, typing.Type[matcher.Matcher]] = {
+        'extension':    matcher.Extension,
+        'mime':         matcher.Mime,
+    }
+
+    # criteria matcher nodes w/o arguments
+    _VALUELESS: typing.Dict[str, typing.Type[matcher.Matcher]] = {
+        'any':          matcher.Any,
+        'nothing':      matcher.Nothing,
+        'exists':       matcher.Exists,
+        'isfile':       matcher.IsFile,
+        'isdir':        matcher.IsDir,
+        'islink':       matcher.IsLink,
+        'isabs':        matcher.IsAbs,
+        'isrel':        matcher.IsRel,
+        'ismount':      matcher.IsMount,
+        'emtpy':        matcher.IsEmpty,
+        'readable':     matcher.IsReadable,
+        'writable':     matcher.IsWritable,
+        'executable':   matcher.IsExecutable,
+    }
+
+    # pyparsing parser instance.
+    _parser: pyparsing.ParseExpression
+
+    def __init__(self):
+        # build the parser
+        # VALUE := [word]
+        alphabet = (printables + alphas8bit + punc8bit).translate(str.maketrans('', '', ',{}|='))
+        value = QuotedString(quoteChar='"', escChar='\\') ^ Word(alphabet)
+        # CRITERION := mime | extension | ...
+        criterion = Or([CaselessKeyword(p) for p in self._CRITERIA]).setResultsName('criterion')
+        valueless = Or([CaselessKeyword(p) for p in self._VALUELESS]).setResultsName('criterion')
+        # VALUES := VALUE | VALUE, VALUE
+        values = delimitedList(value, delim=',').setResultsName('value')
+        # OP := '=' | '!='
+        eqop = oneOf('= !=').setResultsName('op')
+        # RULE := CRITERION OP VALUE | CRITERION OP {VALUES} | VALUELESS
+        rule_none = Group(Optional('!').setResultsName('op') + valueless).setResultsName('rule_none')
+        rule_one = Group(criterion + eqop + value.setResultsName('value')).setResultsName('rule_one')
+        rule_few = Group(criterion + eqop + '{' + values + '}').setResultsName('rule_few')
+        # RULESET := RULE | RULE, RULE
+        ruleset = Group(delimitedList(rule_none ^ rule_one ^ rule_few, delim=','))
+        # EXPR := RULESET | RULESET \| RULESET
+        self._parser = delimitedList(ruleset, delim='|')
+
+    def parse(self, query: str) -> matcher.Matcher: # pylint: disable=too-many-branches
+        """Build a file matcher from a rule definition."""
+        # preprocess the query
+        query = query.strip()
+
+        # empty query
+        if len(query) == 0:
+            return matcher.Any()
+
+        try:
+            parsed = self._parser.parseString(query, parseAll=True)
+        except pyparsing.ParseException as err:
+            raise errors.ParserError(f'Cannot parse query {err}')
+
+        # convert to Matcher
+        rules = []
+        for exp in parsed:
+            tokens = []
+            for rule in exp:
+                # fetch accepted values
+                if rule.getName() == 'rule_none':
+                    accepted = []
+                elif rule.getName() == 'rule_one':
+                    accepted = [rule.value]
+                elif rule.getName() == 'rule_few':
+                    accepted = list(rule.value)
+                else: # prevented by grammar
+                    raise errors.UnreachableError('Invalid rule definition')
+
+                # build criterion
+                if rule.criterion in self._VALUELESS:
+                    cls = self._VALUELESS[rule.criterion]
+                    if rule.op == '!':
+                        tokens.append(matcher.NOT(cls()))
+                    else:
+                        tokens.append(cls())
+                elif rule.criterion in self._CRITERIA:
+                    cls = self._CRITERIA[rule.criterion]
+                    if rule.op == '!=':
+                        tokens.append(matcher.NOT(cls(accepted)))
+                    else:
+                        tokens.append(cls(accepted))
+                else: # prevented by grammar
+                    raise errors.UnreachableError(f'Invalid condition "{rule.criterion}"')
+
+            # And-aggregate rules in one ruleset (if needed)
+            tokens = matcher.And(tokens) if len(tokens) > 1 else tokens[0]
+            rules.append(tokens)
+
+        # Or-aggregate rulesets
+        expr = matcher.Or(rules) if len(rules) > 1 else rules[0]
+
+        return expr
+
+# build default instance
+file_match_parser = FileMatcherParser()
+
+def parse(query: str) -> matcher.Matcher:
+    """Shortcut for FileMatcherParser()(query)."""
+    return file_match_parser.parse(query)
+
+## EOF ##
diff --git a/bsie/utils/loading.py b/bsie/utils/loading.py
new file mode 100644
index 0000000..58202d1
--- /dev/null
+++ b/bsie/utils/loading.py
@@ -0,0 +1,49 @@
+
+# standard imports
+import importlib
+import typing
+
+# inner-module imports
+from . import errors
+
+# exports
+__all__: typing.Sequence[str] = (
+    'safe_load',
+    'unpack_qualified_name',
+    )
+
+
+## code ##
+
+def safe_load(module_name: str, class_name: str):
+    """Get a class from a module. Raise BuilderError if anything goes wrong."""
+    try:
+        # load the module
+        module = importlib.import_module(module_name)
+    except Exception as err:
+        # cannot import module
+        raise errors.LoaderError(f'cannot load module {module_name} ({err})') from err
+
+    try:
+        # get the class from the module
+        cls = getattr(module, class_name)
+    except Exception as err:
+        # cannot find the class
+        raise errors.LoaderError(f'cannot load class {class_name} from module {module_name} ({err})') from err
+
+    return cls
+
+
+def unpack_qualified_name(name):
+    """Split a name into its module and class component (dot-separated)."""
+    if not isinstance(name, str):
+        raise TypeError(name)
+    if '.' not in name:
+        raise ValueError('name must be a qualified class name.')
+    module_name, class_name = name[:name.rfind('.')], name[name.rfind('.')+1:]
+    if module_name == '':
+        raise ValueError('name must be a qualified class name.')
+    return module_name, class_name
+
+
+## EOF ##
diff --git a/bsie/utils/namespaces.py b/bsie/utils/namespaces.py
index a29fc1b..4a66048 100644
--- a/bsie/utils/namespaces.py
+++ b/bsie/utils/namespaces.py
@@ -1,26 +1,37 @@
 """Default namespaces used throughout BSIE.
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
 """
-# imports
+# standard imports
 import typing
 
 # inner-module imports
 from . import bsfs as _bsfs
 
-# constants
-bse = _bsfs.Namespace('http://bsfs.ai/schema/Entity')
-bsfs = _bsfs.Namespace('http://bsfs.ai/schema', fsep='/')
-bsm = _bsfs.Namespace('http://bsfs.ai/schema/Meta')
-xsd = _bsfs.Namespace('http://www.w3.org/2001/XMLSchema')
+# generic namespaces
+xsd = _bsfs.Namespace('http://www.w3.org/2001/XMLSchema')()
+
+# core bsfs/bsie namespaces
+bsfs = _bsfs.Namespace('https://schema.bsfs.io/core')
+bsie = _bsfs.Namespace('https://schema.bsfs.io/ie')
+
+# auxiliary namespaces
+bsd = bsie.distance()
+bse = bsie.Node.Entity()
+bsf = bsie.Literal.Array.Feature
+bsl = bsfs.Literal
+bsn = bsie.Node
+bsp = bsie.Node.Preview()
 
 # export
 __all__: typing.Sequence[str] = (
+    'bsd',
     'bse',
+    'bsf',
     'bsfs',
-    'bsm',
+    'bsie',
+    'bsl',
+    'bsl',
+    'bsn',
+    'bsp',
     'xsd',
     )
 
diff --git a/bsie/utils/node.py b/bsie/utils/node.py
index ecf39cd..fa34b2e 100644
--- a/bsie/utils/node.py
+++ b/bsie/utils/node.py
@@ -1,10 +1,6 @@
 """Lighweight Node to bridge to BSFS.
-
-Part of the bsie module.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
 """
-# imports
+# standard imports
 import typing
 
 # bsie imports
@@ -19,30 +15,47 @@ __all__: typing.Sequence[str] = (
 ## code ##
 
 class Node():
-    """Lightweight Node, disconnected from any bsfs structures."""
+    """Lightweight Node, disconnected from any bsfs structures.
+
+    In most cases, provide *hints* and leave setting the uri to a node
+    naming policy. Only provide an *uri* if it is absolutely determined.
+
+    """
 
     # node type.
     node_type: bsfs.URI
 
     # node URI.
-    uri: bsfs.URI
+    uri: typing.Optional[bsfs.URI]
+
+    # node naming hints.
+    hits: dict
 
     def __init__(
             self,
             node_type: bsfs.URI,
-            uri: bsfs.URI,
+            uri: typing.Optional[bsfs.URI] = None,
+            **uri_hints,
             ):
         # assign members
         self.node_type = bsfs.URI(node_type)
-        self.uri = bsfs.URI(uri)
+        self.hints = uri_hints
+        self.uri = uri
 
     def __eq__(self, other: typing.Any) -> bool:
+        """Compare two Node instances based on type and uri.
+        Compares hits only if the uri is not yet specified.
+        """
         return isinstance(other, Node) \
             and other.node_type == self.node_type \
-            and other.uri == self.uri
+            and other.uri == self.uri \
+            and (self.uri is not None or self.hints == other.hints)
 
     def __hash__(self) -> int:
-        return hash((type(self), self.node_type, self.uri))
+        identifier = self.uri
+        if identifier is None:
+            identifier = tuple((key, self.hints[key]) for key in sorted(self.hints))
+        return hash((type(self), self.node_type, identifier))
 
     def __str__(self) -> str:
         return f'{bsfs.typename(self)}({self.node_type}, {self.uri})'
diff --git a/doc/Makefile b/doc/Makefile
new file mode 100644
index 0000000..d0c3cbf
--- /dev/null
+++ b/doc/Makefile
@@ -0,0 +1,20 @@
+# Minimal makefile for Sphinx documentation
+#
+
+# You can set these variables from the command line, and also
+# from the environment for the first two.
+SPHINXOPTS    ?=
+SPHINXBUILD   ?= sphinx-build
+SOURCEDIR     = source
+BUILDDIR      = build
+
+# Put it first so that "make" without argument is like "make help".
+help:
+	@$(SPHINXBUILD) -M help "$(SOURCEDIR)" "$(BUILDDIR)" $(SPHINXOPTS) $(O)
+
+.PHONY: help Makefile
+
+# Catch-all target: route all unknown targets to Sphinx using the new
+# "make mode" option.  $(O) is meant as a shortcut for $(SPHINXOPTS).
+%: Makefile
+	@$(SPHINXBUILD) -M $@ "$(SOURCEDIR)" "$(BUILDDIR)" $(SPHINXOPTS) $(O)
diff --git a/doc/make.bat b/doc/make.bat
new file mode 100644
index 0000000..747ffb7
--- /dev/null
+++ b/doc/make.bat
@@ -0,0 +1,35 @@
+@ECHO OFF
+
+pushd %~dp0
+
+REM Command file for Sphinx documentation
+
+if "%SPHINXBUILD%" == "" (
+	set SPHINXBUILD=sphinx-build
+)
+set SOURCEDIR=source
+set BUILDDIR=build
+
+%SPHINXBUILD% >NUL 2>NUL
+if errorlevel 9009 (
+	echo.
+	echo.The 'sphinx-build' command was not found. Make sure you have Sphinx
+	echo.installed, then set the SPHINXBUILD environment variable to point
+	echo.to the full path of the 'sphinx-build' executable. Alternatively you
+	echo.may add the Sphinx directory to PATH.
+	echo.
+	echo.If you don't have Sphinx installed, grab it from
+	echo.https://www.sphinx-doc.org/
+	exit /b 1
+)
+
+if "%1" == "" goto help
+
+%SPHINXBUILD% -M %1 %SOURCEDIR% %BUILDDIR% %SPHINXOPTS% %O%
+goto end
+
+:help
+%SPHINXBUILD% -M help %SOURCEDIR% %BUILDDIR% %SPHINXOPTS% %O%
+
+:end
+popd
diff --git a/doc/source/architecture.rst b/doc/source/architecture.rst
new file mode 100644
index 0000000..750319e
--- /dev/null
+++ b/doc/source/architecture.rst
@@ -0,0 +1,71 @@
+
+Architecture
+============
+
+
+The information extraction pipeline traverses through three stages of abstraction:
+
+1. File format
+2. Content
+3. Predicate-value pairs
+
+For example, an image can be stored in various file formats (JPEG, TIFF, PNG).
+In turn, a file format can store different kinds of information such as the image data (pixels) and additional metadata (image dimensions, EXIF tags).
+Finally, we translate the information read from the file into predicate-value pairs that can be attached to a file node in BSFS, e.g., ``(bse:filesize, 8150000)``, ``(bse:width, 6000)``, ``(bse:height, 4000)``, ``(bse:iso, 100)``, etc.
+
+The extraction pipeline is thus divided into
+:mod:`Readers <bsie.reader>` that abstract from file formats and content types,
+and :mod:`Extractors <bsie.extractor>` which produce predicate-value pairs from content artifacts.
+
+
+Readers
+-------
+
+:mod:`Readers <bsie.reader>` read the actual file (considering different file formats)
+and isolate specific content artifacts therein.
+The content artifact (in an internal representation)
+is then passed to an Extractor for further processing.
+
+For example, the :class:`Image <bsie.reader.image.Image>` reader aims at reading the content (pixels) of an image file.
+It automatically detects which python package (e.g., `rawpy`_, `pillow`_)
+to use when faced with the various existing image file formats.
+The image data is then converted into a PIL.Image instance
+(irrespective of which package was used to read the data),
+and passed on to the extractor.
+
+
+Extractors
+----------
+
+:mod:`Extractors <bsie.extractor>` turn content artifacts into
+predicate-value pairs that can be inserted into a BSFS storage.
+The predicate is defined by each extractor, as prescribed by BSFS' schema handling.
+
+For example, the class :class:`ColorsSpatial <bsie.extractor.image.colors_spatial.ColorsSpatial`
+determines regionally dominant colors from given pixel data.
+It then produces a feature vector and attaches it to the image file via the appropriate predicate.
+
+
+BSIE lib and apps
+-----------------
+
+The advantage of separating the reading and extraction steps is that multiple extractors
+can consume the same content, avoiding multiple re-reads of the same data.
+This close interaction between readers and extractors is encapsulated
+within the :class:`Pipeline <bsie.lib.pipeline.Pipeline>` class.
+
+Also, that having to deal with various file formats and content artifacts
+potentially pulls in a large number of dependencies.
+To make matters worse, many of those might not be needed in a specific scenario,
+e.g., if a user only works with a limited set of file formats.
+BSIE therefore implements a best-effort approach,
+that is modules that cannot be imported due to missing dependencies are ignored.
+
+With these two concerns taken care of,
+BSIE offers a few :mod:`end-user applications <bsie.apps>`
+that reduce the complexity of the task to a relatively simple command.
+
+
+
+.. _pillow: https://python-pillow.org/
+.. _rawpy: https://github.com/letmaik/rawpy
diff --git a/doc/source/conf.py b/doc/source/conf.py
new file mode 100644
index 0000000..017e036
--- /dev/null
+++ b/doc/source/conf.py
@@ -0,0 +1,37 @@
+# Configuration file for the Sphinx documentation builder.
+#
+# For the full list of built-in configuration values, see the documentation:
+# https://www.sphinx-doc.org/en/master/usage/configuration.html
+
+# -- Project information -----------------------------------------------------
+# https://www.sphinx-doc.org/en/master/usage/configuration.html#project-information
+
+project = 'Black Star Information Extraction'
+copyright = '2023, Matthias Baumgartner'
+author = 'Matthias Baumgartner'
+release = '0.5'
+
+# -- General configuration ---------------------------------------------------
+# https://www.sphinx-doc.org/en/master/usage/configuration.html#general-configuration
+
+extensions = [
+    'sphinx_copybutton',
+    'sphinx.ext.autodoc',
+    ]
+
+templates_path = ['_templates']
+exclude_patterns = []
+
+
+
+# -- Options for HTML output -------------------------------------------------
+# https://www.sphinx-doc.org/en/master/usage/configuration.html#options-for-html-output
+
+html_theme = 'furo'
+html_static_path = ['_static']
+
+html_title = 'bsie'
+html_theme_options = {
+    'announcement': '<em>This project is under heavy development and subject to rapid changes. Use at your own discretion.</em>',
+    }
+
diff --git a/doc/source/index.rst b/doc/source/index.rst
new file mode 100644
index 0000000..9cf06fe
--- /dev/null
+++ b/doc/source/index.rst
@@ -0,0 +1,26 @@
+
+Black Star Information Extraction
+=================================
+
+A major advantage of the `Black Star File System (BSFS) <https://www.bsfs.io/bsfs/>`_
+is its ability to store various kinds of (meta)data associated with a file.
+However, the BSFS itself is only a storage solution,
+it does not inspect files or collect information about them.
+
+The Black Star Information Extraction (BSIE) package fills this gap by
+extracting various kinds of information from a file and pushing that data to a BSFS instance.
+
+BSIE has the ability to process numerous file formats,
+and it can turn various aspects of a file into usable information.
+This includes metadata from a source file system,
+metadata stored within the file,
+and even excerpts or feature representations of the file's content itself.
+
+.. toctree::
+   :maxdepth: 1
+
+   installation
+   architecture
+   api/modules
+
+
diff --git a/doc/source/installation.rst b/doc/source/installation.rst
new file mode 100644
index 0000000..ee6fadb
--- /dev/null
+++ b/doc/source/installation.rst
@@ -0,0 +1,75 @@
+
+Installation
+============
+
+You can install *bsie* via pip. BSIE comes with support for various file formats.
+For this, it needs to install many external packages. BSIE lets you control
+which of these you want to install. Note that if you choose to not install
+support for some file types, BSIE will show a warning and skip them.
+All other formats will be processed normally.
+It is recommended to install *bsie* in a virtual environment (via ``virtualenv``).
+
+To install only the minimally required software, use::
+
+    pip install --extra-index-url https://pip.bsfs.io bsie
+
+To install all dependencies, use the following shortcut::
+
+    pip install --extra-index-url https://pip.bsfs.io bsie[all]
+
+To install a subset of all dependencies, modify the extras part (``[image, preview]``)
+of the follwing command to your liking::
+
+    pip install --extra-index-url https://pip.bsfs.io bsie[image,preview]
+
+Currently, BSIE providesthe following extra flags:
+
+* image: Read data from image files.
+  Note that you may also have to install ``exiftool`` through your system's
+  package manager (e.g. ``sudo apt install exiftool``).
+* preview: Create previews from a variety of files.
+  Note that support for various file formats also depends on what
+  system packages you've installed. You should at least install ``imagemagick``
+  through your system's package manager (e.g. ``sudo apt install imagemagick``).
+  See `Preview Generator <https://github.com/algoo/preview-generator>`_ for
+  more detailed instructions.
+* features: Extract feature vectors from images.
+
+
+
+License
+-------
+
+This project is released under the terms of the 3-clause BSD License.
+By downloading or using the application you agree to the license's terms and conditions.
+
+.. literalinclude:: ../../LICENSE
+
+
+Source
+------
+
+Check out our git repository::
+
+    git clone https://git.bsfs.io/bsie.git
+
+You can further install *bsie* via the ususal `setuptools <https://setuptools.pypa.io/en/latest/index.html>`_ commands from your bsie source directory::
+
+    python setup.py develop
+
+For development, you also need to install some additional dependencies::
+
+    # extra packages for tests
+    pip install rdflib requests
+
+    # code style discipline
+    pip install mypy coverage pylint
+    # external type annotations for pyyaml
+    pip install types-PyYAML
+
+    # documentation
+    pip install sphinx sphinx-copybutton furo
+
+    # packaging
+    pip install build
+
diff --git a/setup.py b/setup.py
index ee9e0fd..b1f5b2c 100644
--- a/setup.py
+++ b/setup.py
@@ -1,20 +1,73 @@
 
-from setuptools import setup
+from setuptools import setup, find_packages
 import os
 
+extras = {
+    # NOTE: an 'all' extra is added automatically
+    'features': [
+        # image feature extractors
+        'numpy',
+        ],
+    'preview': [
+        # preview readers
+        'preview_generator', # also depends on some system packages
+        'pillow',
+        'rawpy',
+        ],
+    'image': [
+        # image readers
+        'pillow',
+        'rawpy',
+        # exif reader
+        'pyexiv2',
+        ],
+    }
+
+
 setup(
+    # package metadata
     name='bsie',
-    version='0.0.1',
+    version='0.23.03',
     author='Matthias Baumgartner',
-    author_email='dev@igsor.net',
+    author_email='dev@bsfs.io',
     description='Extract information from files and store them in a BSFS.',
-    long_description=open(os.path.join(os.path.dirname(__file__), 'README')).read(),
+    long_description=open(os.path.join(os.path.dirname(__file__), 'README.md')).read(),
     license='BSD',
     license_files=('LICENSE', ),
-    url='https://www.igsor.net/projects/blackstar/bsie/',
-    download_url='https://pip.igsor.net',
-    packages=('bsie', ),
-    install_requires=('rdflib', 'bsfs'),
+    url='https://www.bsfs.io/bsie/',
+    download_url='https://pip.bsfs.io',
+
+    # packages
+    packages=find_packages(include=['bsie']),
+    package_dir={'bsie': 'bsie'},
+    # data files are included if mentioned in MANIFEST.in
+    include_package_data=True,
+
+    # entrypoints
+    entry_points={
+        'console_scripts': [
+            'bsie = bsie.apps:main',
+            ],
+        },
+
+    # dependencies
     python_requires=">=3.7",
-)
+    install_requires=(
+        'bsfs',
+        'pyparsing',
+        'python-magic',
+        'pyyaml',
+        ),
+    extras_require=dict(
+        # development targets
+        build=['build'],
+        dev=['coverage', 'mypy', 'pylint'],
+        doc=['sphinx', 'sphinx-copybutton', 'furo'],
+        test=['rdflib', 'requests', 'types-PyYAML'],
+        # add 'all'
+        all=list({pkg for ext in extras.values() for pkg in ext}),
+        # add extras
+        **extras
+        ),
+    )
 
diff --git a/test/apps/test_index.py b/test/apps/test_index.py
index 9cdc656..6927044 100644
--- a/test/apps/test_index.py
+++ b/test/apps/test_index.py
@@ -1,16 +1,15 @@
-"""
 
-Part of the bsie test suite.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import contextlib
 import io
 import os
-import rdflib
+import tempfile
 import unittest
 
+# external imports
+import rdflib
+import yaml
+
 # bsie imports
 from bsie.utils import ns
 
@@ -21,134 +20,283 @@ from bsie.apps.index import main
 ## code ##
 
 class TestIndex(unittest.TestCase):
+    def setUp(self):
+        config = {
+            'ReaderBuilder': {},
+            'ExtractorBuilder': [
+                {'bsie.extractor.preview.Preview': {
+                    'max_sides': [50],
+                    }},
+                {'bsie.extractor.generic.path.Path': {}},
+                {'bsie.extractor.generic.constant.Constant': {
+                    'schema': '''
+                        bse:author rdfs:subClassOf bsfs:Predicate ;
+                            rdfs:domain bsn:Entity ;
+                            rdfs:range xsd:string ;
+                            bsfs:unique "true"^^xsd:boolean .
+                        ''',
+                    'tuples': [['https://schema.bsfs.io/ie/Node/Entity#author', 'Me, myself, and I']],
+                  }},
+                {'bsie.extractor.image.colors_spatial.ColorsSpatial': {
+                    'width': 2,
+                    'height': 2,
+                    'exp': 2,
+                    }},
+                ]
+            }
+        # create config file
+        _, self.config_path = tempfile.mkstemp(prefix='bsie-test-', suffix='.yaml')
+        with open(self.config_path, 'wt') as cfile:
+            yaml.dump(config, cfile)
+
+    def tearDown(self):
+        if os.path.exists(self.config_path):
+            os.unlink(self.config_path)
+
+    def test_disclaimer(self):
+        print('Please wait, this test will take about 25 seconds')
+
+    def test_main_invalid(self):
+        outbuf = io.StringIO()
+        with contextlib.redirect_stdout(outbuf):
+            bsfs = main(['--config', self.config_path, os.path.join(os.path.dirname(__file__), 'inexistent-file.t')])
+        self.assertEqual(outbuf.getvalue().strip(), '')
+
     def test_main(self):
         bsfs = main([
+            '--config',
+            self.config_path,
             '-r',
-            '--user', 'http://example.com/me',
+            '--host', 'http://example.com',
+            '--user', 'me',
             os.path.join(os.path.dirname(__file__), 'testdir'),
             os.path.join(os.path.dirname(__file__), 'testfile'),
             ])
 
-        prefix = 'http://example.com/me/file#'
+        pre_file = 'http://example.com/me/file#'
+        pre_preview = 'http://example.com/me/preview#'
         self.assertTrue(set(bsfs._backend._graph).issuperset({
-            (rdflib.URIRef(prefix + '2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647'), rdflib.RDF.type, rdflib.URIRef(ns.bsfs.File)),
-            (rdflib.URIRef(prefix + '2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + '2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('alpha_second', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + '2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647'), rdflib.URIRef(ns.bse.filesize), rdflib.Literal('696', datatype=rdflib.XSD.integer)),
-            (rdflib.URIRef(prefix + '441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece'), rdflib.RDF.type, rdflib.URIRef(ns.bsfs.File)),
-            (rdflib.URIRef(prefix + '441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + '441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('omega_second', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + '441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece'), rdflib.URIRef(ns.bse.filesize), rdflib.Literal('503', datatype=rdflib.XSD.integer)),
-            (rdflib.URIRef(prefix + '69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871'), rdflib.RDF.type, rdflib.URIRef(ns.bsfs.File)),
-            (rdflib.URIRef(prefix + '69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + '69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('td_first', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + '69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871'), rdflib.URIRef(ns.bse.filesize), rdflib.Literal('911', datatype=rdflib.XSD.integer)),
-            (rdflib.URIRef(prefix + '78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926'), rdflib.RDF.type, rdflib.URIRef(ns.bsfs.File)),
-            (rdflib.URIRef(prefix + '78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + '78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('testfile', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + '78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926'), rdflib.URIRef(ns.bse.filesize), rdflib.Literal('885', datatype=rdflib.XSD.integer)),
-            (rdflib.URIRef(prefix + '80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3'), rdflib.RDF.type, rdflib.URIRef(ns.bsfs.File)),
-            (rdflib.URIRef(prefix + '80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + '80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('bar_first', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + '80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3'), rdflib.URIRef(ns.bse.filesize), rdflib.Literal('956', datatype=rdflib.XSD.integer)),
-            (rdflib.URIRef(prefix + '976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795'), rdflib.RDF.type, rdflib.URIRef(ns.bsfs.File)),
-            (rdflib.URIRef(prefix + '976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + '976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('omega_first', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + '976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795'), rdflib.URIRef(ns.bse.filesize), rdflib.Literal('648', datatype=rdflib.XSD.integer)),
-            (rdflib.URIRef(prefix + '997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3'), rdflib.RDF.type, rdflib.URIRef(ns.bsfs.File)),
-            (rdflib.URIRef(prefix + '997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + '997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('alpha_first', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + '997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3'), rdflib.URIRef(ns.bse.filesize), rdflib.Literal('754', datatype=rdflib.XSD.integer)),
-            (rdflib.URIRef(prefix + 'a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d'), rdflib.RDF.type, rdflib.URIRef(ns.bsfs.File)),
-            (rdflib.URIRef(prefix + 'a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + 'a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('foo_second', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + 'a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d'), rdflib.URIRef(ns.bse.filesize), rdflib.Literal('585', datatype=rdflib.XSD.integer)),
-            (rdflib.URIRef(prefix + 'b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70'), rdflib.RDF.type, rdflib.URIRef(ns.bsfs.File)),
-            (rdflib.URIRef(prefix + 'b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + 'b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('bar_second', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + 'b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70'), rdflib.URIRef(ns.bse.filesize), rdflib.Literal('636', datatype=rdflib.XSD.integer)),
-            (rdflib.URIRef(prefix + 'd43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d'), rdflib.RDF.type, rdflib.URIRef(ns.bsfs.File)),
-            (rdflib.URIRef(prefix + 'd43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + 'd43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('foo_first', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + 'd43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d'), rdflib.URIRef(ns.bse.filesize), rdflib.Literal('546', datatype=rdflib.XSD.integer)),
-            (rdflib.URIRef(prefix + 'd803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1'), rdflib.RDF.type, rdflib.URIRef(ns.bsfs.File)),
-            (rdflib.URIRef(prefix + 'd803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + 'd803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('td_second', datatype=rdflib.XSD.string)),
-            (rdflib.URIRef(prefix + 'd803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1'), rdflib.URIRef(ns.bse.filesize), rdflib.Literal('703', datatype=rdflib.XSD.integer)),
+            # files and properties
+            (rdflib.URIRef(pre_file + '2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Entity)),
+            (rdflib.URIRef(pre_file + '2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + '2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('alpha_second', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + '441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Entity)),
+            (rdflib.URIRef(pre_file + '441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + '441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('omega_second', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + '69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Entity)),
+            (rdflib.URIRef(pre_file + '69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + '69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('td_first', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + '78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Entity)),
+            (rdflib.URIRef(pre_file + '78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + '78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('testfile', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + '80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Entity)),
+            (rdflib.URIRef(pre_file + '80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + '80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('bar_first', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + '976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Entity)),
+            (rdflib.URIRef(pre_file + '976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + '976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('omega_first', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + '997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Entity)),
+            (rdflib.URIRef(pre_file + '997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + '997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('alpha_first', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + 'a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Entity)),
+            (rdflib.URIRef(pre_file + 'a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + 'a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('foo_second', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + 'b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Entity)),
+            (rdflib.URIRef(pre_file + 'b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + 'b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('bar_second', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + 'd43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Entity)),
+            (rdflib.URIRef(pre_file + 'd43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + 'd43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('foo_first', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + 'd803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Entity)),
+            (rdflib.URIRef(pre_file + 'd803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + 'd803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('td_second', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + 'accb115d266ad60c53cd01a7f7130f245886ce8eaf69bc85319febc11d9fe089'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Entity)),
+            (rdflib.URIRef(pre_file + 'accb115d266ad60c53cd01a7f7130f245886ce8eaf69bc85319febc11d9fe089'), rdflib.URIRef(ns.bse.author), rdflib.Literal('Me, myself, and I', datatype=rdflib.XSD.string)),
+            (rdflib.URIRef(pre_file + 'accb115d266ad60c53cd01a7f7130f245886ce8eaf69bc85319febc11d9fe089'), rdflib.URIRef(ns.bse.filename), rdflib.Literal('testimage.jpg', datatype=rdflib.XSD.string)),
+            # features
+            (rdflib.URIRef(pre_file + 'accb115d266ad60c53cd01a7f7130f245886ce8eaf69bc85319febc11d9fe089'), rdflib.URIRef('https://schema.bsfs.io/ie/Node/Entity#colors_spatial_0658f2234a054e1dd59a14462c89f7733e019160419c796356aa831498bd0a04'),
+                rdflib.Literal(
+                    '(91, 127, 121, 94, 138, 167, 163, 134, 190, 138, 170, 156, 121, 142, 159)',
+                    datatype=rdflib.URIRef('https://schema.bsfs.io/ie/Literal/Array/Feature/ColorsSpatial#0658f2234a054e1dd59a14462c89f7733e019160419c796356aa831498bd0a04'))),
+            # links to previews
+            (rdflib.URIRef(pre_file + '2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647'), rdflib.URIRef(ns.bse.preview), rdflib.URIRef(pre_preview + '79cb8a7e6369361a4f4cb7ff729c1ed3fcf87204769623d6fbd6ebfae601e5c7_s50')),
+            (rdflib.URIRef(pre_file + '441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece'), rdflib.URIRef(ns.bse.preview), rdflib.URIRef(pre_preview + 'a8b3245636074d5370283b690281abda8ffdff12ce8b1af77c8bc0a4c85be860_s50')),
+            (rdflib.URIRef(pre_file + '69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871'), rdflib.URIRef(ns.bse.preview), rdflib.URIRef(pre_preview + '968b9aa178585bc8d1fca0e4e32b8cf30b3941eff72f34e320584aaae8fd23ac_s50')),
+            (rdflib.URIRef(pre_file + '78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926'), rdflib.URIRef(ns.bse.preview), rdflib.URIRef(pre_preview + 'dbfd8ebc0557c4925e9ff8411629a74a15eca934a4c2a6bd3134dd81d2f95a36_s50')),
+            (rdflib.URIRef(pre_file + '80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3'), rdflib.URIRef(ns.bse.preview), rdflib.URIRef(pre_preview + '567049149769e1d02e6af6cfee3991f7cf0cbc935cbf6a566047f40155fb13a8_s50')),
+            (rdflib.URIRef(pre_file + '976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795'), rdflib.URIRef(ns.bse.preview), rdflib.URIRef(pre_preview + 'df2185d8927ccef65c92fc90b94e800b02791354d8dede9dd9aa0e2c2cb1e91e_s50')),
+            (rdflib.URIRef(pre_file + '997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3'), rdflib.URIRef(ns.bse.preview), rdflib.URIRef(pre_preview + '9827509a74a60dfceed11936f7f624e9c932f66c8c0d20d355d56f8c3c9b56b1_s50')),
+            (rdflib.URIRef(pre_file + 'a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d'), rdflib.URIRef(ns.bse.preview), rdflib.URIRef(pre_preview + '7a975385a110c21fcd12e238fab9501550fa02f6328749068a3bffd65e291027_s50')),
+            (rdflib.URIRef(pre_file + 'accb115d266ad60c53cd01a7f7130f245886ce8eaf69bc85319febc11d9fe089'), rdflib.URIRef(ns.bse.preview), rdflib.URIRef(pre_preview + '2656e303d7218300326df73b64f312d8b37eb980358be27a38b5f63dae259be3_s50')),
+            (rdflib.URIRef(pre_file + 'b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70'), rdflib.URIRef(ns.bse.preview), rdflib.URIRef(pre_preview + 'a63c84e647138a2b68113474212f6aee542b3707171ff178551db3c296e59817_s50')),
+            (rdflib.URIRef(pre_file + 'd43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d'), rdflib.URIRef(ns.bse.preview), rdflib.URIRef(pre_preview + '26f16643b2570ac5b2d1f8c373d492cb724aae2dd8d71a0b63647838ed651254_s50')),
+            (rdflib.URIRef(pre_file + 'd803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1'), rdflib.URIRef(ns.bse.preview), rdflib.URIRef(pre_preview + '5d1235838c3d501204bb09c2de563d7e4a7fd17b7ec4ff302221c0e88c4741aa_s50')),
+            # preview dimensions
+            (rdflib.URIRef(pre_preview + '2656e303d7218300326df73b64f312d8b37eb980358be27a38b5f63dae259be3_s50'), rdflib.URIRef(ns.bsp.height), rdflib.Literal('33', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '2656e303d7218300326df73b64f312d8b37eb980358be27a38b5f63dae259be3_s50'), rdflib.URIRef(ns.bsp.width), rdflib.Literal('50', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '2656e303d7218300326df73b64f312d8b37eb980358be27a38b5f63dae259be3_s50'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Preview)),
+            (rdflib.URIRef(pre_preview + '26f16643b2570ac5b2d1f8c373d492cb724aae2dd8d71a0b63647838ed651254_s50'), rdflib.URIRef(ns.bsp.height), rdflib.Literal('50', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '26f16643b2570ac5b2d1f8c373d492cb724aae2dd8d71a0b63647838ed651254_s50'), rdflib.URIRef(ns.bsp.width), rdflib.Literal('36', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '26f16643b2570ac5b2d1f8c373d492cb724aae2dd8d71a0b63647838ed651254_s50'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Preview)),
+            (rdflib.URIRef(pre_preview + '567049149769e1d02e6af6cfee3991f7cf0cbc935cbf6a566047f40155fb13a8_s50'), rdflib.URIRef(ns.bsp.height), rdflib.Literal('50', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '567049149769e1d02e6af6cfee3991f7cf0cbc935cbf6a566047f40155fb13a8_s50'), rdflib.URIRef(ns.bsp.width), rdflib.Literal('36', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '567049149769e1d02e6af6cfee3991f7cf0cbc935cbf6a566047f40155fb13a8_s50'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Preview)),
+            (rdflib.URIRef(pre_preview + '5d1235838c3d501204bb09c2de563d7e4a7fd17b7ec4ff302221c0e88c4741aa_s50'), rdflib.URIRef(ns.bsp.height), rdflib.Literal('50', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '5d1235838c3d501204bb09c2de563d7e4a7fd17b7ec4ff302221c0e88c4741aa_s50'), rdflib.URIRef(ns.bsp.width), rdflib.Literal('36', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '5d1235838c3d501204bb09c2de563d7e4a7fd17b7ec4ff302221c0e88c4741aa_s50'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Preview)),
+            (rdflib.URIRef(pre_preview + '79cb8a7e6369361a4f4cb7ff729c1ed3fcf87204769623d6fbd6ebfae601e5c7_s50'), rdflib.URIRef(ns.bsp.height), rdflib.Literal('50', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '79cb8a7e6369361a4f4cb7ff729c1ed3fcf87204769623d6fbd6ebfae601e5c7_s50'), rdflib.URIRef(ns.bsp.width), rdflib.Literal('36', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '79cb8a7e6369361a4f4cb7ff729c1ed3fcf87204769623d6fbd6ebfae601e5c7_s50'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Preview)),
+            (rdflib.URIRef(pre_preview + '7a975385a110c21fcd12e238fab9501550fa02f6328749068a3bffd65e291027_s50'), rdflib.URIRef(ns.bsp.height), rdflib.Literal('50', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '7a975385a110c21fcd12e238fab9501550fa02f6328749068a3bffd65e291027_s50'), rdflib.URIRef(ns.bsp.width), rdflib.Literal('36', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '7a975385a110c21fcd12e238fab9501550fa02f6328749068a3bffd65e291027_s50'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Preview)),
+            (rdflib.URIRef(pre_preview + '968b9aa178585bc8d1fca0e4e32b8cf30b3941eff72f34e320584aaae8fd23ac_s50'), rdflib.URIRef(ns.bsp.height), rdflib.Literal('50', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '968b9aa178585bc8d1fca0e4e32b8cf30b3941eff72f34e320584aaae8fd23ac_s50'), rdflib.URIRef(ns.bsp.width), rdflib.Literal('36', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '968b9aa178585bc8d1fca0e4e32b8cf30b3941eff72f34e320584aaae8fd23ac_s50'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Preview)),
+            (rdflib.URIRef(pre_preview + '9827509a74a60dfceed11936f7f624e9c932f66c8c0d20d355d56f8c3c9b56b1_s50'), rdflib.URIRef(ns.bsp.height), rdflib.Literal('50', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '9827509a74a60dfceed11936f7f624e9c932f66c8c0d20d355d56f8c3c9b56b1_s50'), rdflib.URIRef(ns.bsp.width), rdflib.Literal('36', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + '9827509a74a60dfceed11936f7f624e9c932f66c8c0d20d355d56f8c3c9b56b1_s50'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Preview)),
+            (rdflib.URIRef(pre_preview + 'a63c84e647138a2b68113474212f6aee542b3707171ff178551db3c296e59817_s50'), rdflib.URIRef(ns.bsp.height), rdflib.Literal('50', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + 'a63c84e647138a2b68113474212f6aee542b3707171ff178551db3c296e59817_s50'), rdflib.URIRef(ns.bsp.width), rdflib.Literal('36', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + 'a63c84e647138a2b68113474212f6aee542b3707171ff178551db3c296e59817_s50'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Preview)),
+            (rdflib.URIRef(pre_preview + 'a8b3245636074d5370283b690281abda8ffdff12ce8b1af77c8bc0a4c85be860_s50'), rdflib.URIRef(ns.bsp.height), rdflib.Literal('50', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + 'a8b3245636074d5370283b690281abda8ffdff12ce8b1af77c8bc0a4c85be860_s50'), rdflib.URIRef(ns.bsp.width), rdflib.Literal('36', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + 'a8b3245636074d5370283b690281abda8ffdff12ce8b1af77c8bc0a4c85be860_s50'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Preview)),
+            (rdflib.URIRef(pre_preview + 'dbfd8ebc0557c4925e9ff8411629a74a15eca934a4c2a6bd3134dd81d2f95a36_s50'), rdflib.URIRef(ns.bsp.height), rdflib.Literal('50', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + 'dbfd8ebc0557c4925e9ff8411629a74a15eca934a4c2a6bd3134dd81d2f95a36_s50'), rdflib.URIRef(ns.bsp.width), rdflib.Literal('36', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + 'dbfd8ebc0557c4925e9ff8411629a74a15eca934a4c2a6bd3134dd81d2f95a36_s50'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Preview)),
+            (rdflib.URIRef(pre_preview + 'df2185d8927ccef65c92fc90b94e800b02791354d8dede9dd9aa0e2c2cb1e91e_s50'), rdflib.URIRef(ns.bsp.height), rdflib.Literal('50', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + 'df2185d8927ccef65c92fc90b94e800b02791354d8dede9dd9aa0e2c2cb1e91e_s50'), rdflib.URIRef(ns.bsp.width), rdflib.Literal('36', datatype=rdflib.URIRef('http://www.w3.org/2001/XMLSchema#integer'))),
+            (rdflib.URIRef(pre_preview + 'df2185d8927ccef65c92fc90b94e800b02791354d8dede9dd9aa0e2c2cb1e91e_s50'), rdflib.RDF.type, rdflib.URIRef(ns.bsn.Preview)),
+            # assets
+            (rdflib.URIRef(pre_preview + '2656e303d7218300326df73b64f312d8b37eb980358be27a38b5f63dae259be3_s50'), rdflib.URIRef(ns.bsp.asset), rdflib.Literal('/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/2wBDAQkJCQwLDBgNDRgyIRwhMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjL/wAARCAAhADIDASIAAhEBAxEB/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/8QAHwEAAwEBAQEBAQEBAQAAAAAAAAECAwQFBgcICQoL/8QAtREAAgECBAQDBAcFBAQAAQJ3AAECAxEEBSExBhJBUQdhcRMiMoEIFEKRobHBCSMzUvAVYnLRChYkNOEl8RcYGRomJygpKjU2Nzg5OkNERUZHSElKU1RVVldYWVpjZGVmZ2hpanN0dXZ3eHl6goOEhYaHiImKkpOUlZaXmJmaoqOkpaanqKmqsrO0tba3uLm6wsPExcbHyMnK0tPU1dbX2Nna4uPk5ebn6Onq8vP09fb3+Pn6/9oADAMBAAIRAxEAPwDi9Ktb+O3khZTg/wAB7VSGnKkkhkAR85weteo3Vl9mvLtWjVWEJaN/9rsD7HkVwNza3kmsn7RYsDuzsdSVb/GvLo15W9rKNkVDmvzKN0dx4L1Sb+x2S7jZ7aM7BKF+XPoffkVdOpWDSSI9sGizk5HWuE8S69q0NnHptnB9ms7Nh8lr+6SSXALhsHJUcY7kitPTLi51nR0nMKpO6ZkCHABxngdq1xGKnSs1Kyvrc7qEMW2/Zrz/AKudnbXXhuaEiO3jjY9TtxVG8ht3mQwgOnaubuVmtbFV2gSjjn1q1prajJF+9dEQdMVjPHKtFxaXqc9fE1JXpzjr+J0Is7fA+VaKwmludx/0xevrRXLaH8xyfVZdjpNFsgsUlpryPkjyVuVJ6AnH8z/I1flS30m2ezvdt3bbd1teRL8yntu/xGfeua1zXtbs7dh5I8mRdhkD7mYEY5GOf51Jp+vW8Vnu1KT7FJKMmO5b5JcdwDxn1HFfR1KUZRd1v93zPoaFfD1J+5Kz+79DjfEV9Dc3E0hk5Zi5ZR1btx+NYNlrn9nllhkKgnPpnjr9Of1H0rrdc0bQtTvgsWbSRiwJjk2K+ADwrZ9RyOOa4/U/AWs21y0mmhL60dyI5IpVLduGGeCM/jXmPL201N3NK9SpfngrryOr0y+i1fT4lvZ9gR9pYfM5I9v8/wBK2/7FneFmCXEMLcIbhwpb3A6gVwGiaR4o03UYhbaZOZ88RqA27HXoeB9K9PgiYRRyal4Y1KKVhlyHbr3966MPgIRpuMtNROjTr+/JWn+P4mB/wix/5/o/+/lFdoLXT8DPhfUfx8yiuj6lT7v8P8hex85ffEZef8gu0+oriPiZ/rNI+j/zFFFbYn+Ez5uh8ZP4l/5Cq/8AYN/9nFU/CH/Hvd/9dv8A2Wiih/Ee7k/wv1/Q63Qv9fb/APXT+ldFrP8Ax/xfRP8A0IUUVX2T0K38RD5v9dJ/vH+dFFFUC2P/2Q==', datatype=rdflib.URIRef(ns.bsl.BinaryBlob))),
+            (rdflib.URIRef(pre_preview + '26f16643b2570ac5b2d1f8c373d492cb724aae2dd8d71a0b63647838ed651254_s50'), rdflib.URIRef(ns.bsp.asset), rdflib.Literal('/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/wAALCAAyACQBAREA/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/9oACAEBAAA/APdLq6eAgKoI25Oc+oHYe9Qfb5sr+6UZHUhuv5VYjnZyQSo9gpNTgP3YH6Lj+tPqCa2jmILrkgEVH/Z8HHy8DjGB/hUq26IMAkj0IH+FTUUUUUUUUUUUUUUUUUUUUUUUUUUV/9k=', datatype=rdflib.URIRef(ns.bsl.BinaryBlob))),
+            (rdflib.URIRef(pre_preview + '5d1235838c3d501204bb09c2de563d7e4a7fd17b7ec4ff302221c0e88c4741aa_s50'), rdflib.URIRef(ns.bsp.asset), rdflib.Literal('/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/wAALCAAyACQBAREA/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/9oACAEBAAA/APdLm5eBwFVTkd89c+1Qf2hMSAI15/3v8KtR3DSFgIiceh7/AMv1qcHIzjFLVO7tnncFduAMc+v5GoDZXJAw65A/vdT/AN81PDasjMXPBGAMgj8toqxHGIxgfyA/lUlFFFFFFFFFFFFFFFFFFFFFFFFFf//Z', datatype=rdflib.URIRef(ns.bsl.BinaryBlob))),
+            (rdflib.URIRef(pre_preview + '79cb8a7e6369361a4f4cb7ff729c1ed3fcf87204769623d6fbd6ebfae601e5c7_s50'), rdflib.URIRef(ns.bsp.asset), rdflib.Literal('/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/wAALCAAyACQBAREA/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/9oACAEBAAA/APdLq5eFgEQEYySQf6CoDqEo2/u1yevDf4VJHfZ3eZhNvojHvj0q4h3IDnOR1AxmnVXntEuDlyR8u3gA8fiDUR02I4G9uOei9eeenvTvsS5J82Tn6cfpUsMPkrgSOw9Gxx+lTUUUUUUUUUUUUUUUUUUUUUUUUUV//9k=', datatype=rdflib.URIRef(ns.bsl.BinaryBlob))),
+            (rdflib.URIRef(pre_preview + '7a975385a110c21fcd12e238fab9501550fa02f6328749068a3bffd65e291027_s50'), rdflib.URIRef(ns.bsp.asset), rdflib.Literal('/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/wAALCAAyACQBAREA/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/9oACAEBAAA/APdLq6eBgFUHjPIJ7+wqudQmGMxpyMk4bA6+30/OrSyzFvm8kDrjecgflVhSGGVIIPcUtV57RLjBZmBAxwB/UGov7NiGMMw25wQF45z6VKtrGuc7mz6n3zVjpRRRRRRRRRRRRRRRRRRRRRRRRRRX/9k=', datatype=rdflib.URIRef(ns.bsl.BinaryBlob))),
+            (rdflib.URIRef(pre_preview + '968b9aa178585bc8d1fca0e4e32b8cf30b3941eff72f34e320584aaae8fd23ac_s50'), rdflib.URIRef(ns.bsp.asset), rdflib.Literal('/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/wAALCAAyACQBAREA/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/9oACAEBAAA/APdLm5eFsIgPy55zyc47CoRfTkf6uPOOckjnn29qmS5dmbK52noq5OPzqwj7/wCFl/3hin1n36MzgqhbCkcKTz+ANUzA5Cjynx7IRjn/AHfT271d8gAEFGxzwCfX2WrMR2gIQfbg/wCAqaiiiiiiiiiiiiiiiiiiiiiiiiiv/9k=', datatype=rdflib.URIRef(ns.bsl.BinaryBlob))),
+            (rdflib.URIRef(pre_preview + '9827509a74a60dfceed11936f7f624e9c932f66c8c0d20d355d56f8c3c9b56b1_s50'), rdflib.URIRef(ns.bsp.asset), rdflib.Literal('/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/wAALCAAyACQBAREA/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/9oACAEBAAA/APdLq5eAgKoPyknIPb6ColvZywHlp79R+WRU3nT7iojHXqQR6VYUk9QR9adVK8t3mdSqggLjJxkHPuD/AJFVlsZ/l+RFIXGTtI6k9Npq3FbFdwYKAemAp/8AZR/WrKLtGMn9KdRRRRRRRRRRRRRRRRRRRRRRRRRX/9k=', datatype=rdflib.URIRef(ns.bsl.BinaryBlob))),
+            (rdflib.URIRef(pre_preview + '567049149769e1d02e6af6cfee3991f7cf0cbc935cbf6a566047f40155fb13a8_s50'), rdflib.URIRef(ns.bsp.asset), rdflib.Literal('/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/wAALCAAyACQBAREA/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/9oACAEBAAA/APdLm5eFwFVSCpPIPYj0HvVf+0ZiFxGnIychhj9KtQyTSNkiLZ7E5B/EVZorN1EZkXAJIQ4wM9x7GqYAAXCAgDHKnGMn0X/Oa0LeMJudcKx64iJz+OATVoOOh3E+oUipKayK33lB+opPKj/55r+VHlR8/Iv5U+iiiiiiiiiiiiiiiiiiiiiv/9k=', datatype=rdflib.URIRef(ns.bsl.BinaryBlob))),
+            (rdflib.URIRef(pre_preview + 'df2185d8927ccef65c92fc90b94e800b02791354d8dede9dd9aa0e2c2cb1e91e_s50'), rdflib.URIRef(ns.bsp.asset), rdflib.Literal('/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/wAALCAAyACQBAREA/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/9oACAEBAAA/APdLm5eGQBVU5XPIJ7j0FQfb5sgiNcH2b39vapkvGPDIucjoTjBOPSrKuHHGencEU+ql1am4IIYA7cDIB757g1CunFWByAAOCMAg5yD07VcCEN0P/fZNS0UUUUUUUUUUUUUUUUUUUUUUUUUV/9k=', datatype=rdflib.URIRef(ns.bsl.BinaryBlob))),
+            (rdflib.URIRef(pre_preview + 'a63c84e647138a2b68113474212f6aee542b3707171ff178551db3c296e59817_s50'), rdflib.URIRef(ns.bsp.asset), rdflib.Literal('/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/wAALCAAyACQBAREA/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/9oACAEBAAA/APdLq6eBgFUH5c5IPqPQVANQmJUeWvPXhvUj09qspcM2cAnHHCE/rmpUl3nGx1+oxUlVLq1M7KQQPlI5APXHqDUKacyMDvHTsBk9fb3q4kXl5IYkk5OQB/ICpaKKKKKKKKKKKKKKKKKKKKKKKKKK/9k=', datatype=rdflib.URIRef(ns.bsl.BinaryBlob))),
+            (rdflib.URIRef(pre_preview + 'a8b3245636074d5370283b690281abda8ffdff12ce8b1af77c8bc0a4c85be860_s50'), rdflib.URIRef(ns.bsp.asset), rdflib.Literal('/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/wAALCAAyACQBAREA/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/9oACAEBAAA/APdbm4eF1CqDkZOQT3HoKr/b5vl/dpkj/awDnHp06fzqZLiRmIPk47fMQf1FXKKikgilOZEDHGOaZ9itsg+SuQMA08QoowNwHoGP+NO8serf99Gn0UUUUUUUUUUUUUUUUUUUUUUUUV//2Q==', datatype=rdflib.URIRef(ns.bsl.BinaryBlob))),
+            (rdflib.URIRef(pre_preview + 'dbfd8ebc0557c4925e9ff8411629a74a15eca934a4c2a6bd3134dd81d2f95a36_s50'), rdflib.URIRef(ns.bsp.asset), rdflib.Literal('/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/wAALCAAyACQBAREA/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/9oACAEBAAA/APdbm5eFwqqDkZ5B9QOw96rHUZgoPlLkgno3b8Kel7IzspVePYgdcck1YilaTA3JnqQFPT86sVQvoXkYFUDEKRz7np0NVWtZcAiLkd8Dg/8AfPqO3rWhEk6thsbc9mz+mP61Zoooooooooooooooooooooooooor/9k=', datatype=rdflib.URIRef(ns.bsl.BinaryBlob))),
             }))
 
         # NOTE: we don't check ns.bsm.t_created since it depends on the execution time. Triples would look like this:
-        #   (rdflib.URIRef(prefix + '2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647'), rdflib.URIRef(ns.bsm.t_created), rdflib.Literal('1670..........', datatype=rdflib.XSD.integer)),
-        #   (rdflib.URIRef(prefix + '441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece'), rdflib.URIRef(ns.bsm.t_created), rdflib.Literal('1670..........', datatype=rdflib.XSD.integer)),
-        #   (rdflib.URIRef(prefix + '69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871'), rdflib.URIRef(ns.bsm.t_created), rdflib.Literal('1670..........', datatype=rdflib.XSD.integer)),
-        #   (rdflib.URIRef(prefix + '78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926'), rdflib.URIRef(ns.bsm.t_created), rdflib.Literal('1670..........', datatype=rdflib.XSD.integer)),
-        #   (rdflib.URIRef(prefix + '80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3'), rdflib.URIRef(ns.bsm.t_created), rdflib.Literal('1670..........', datatype=rdflib.XSD.integer)),
-        #   (rdflib.URIRef(prefix + '976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795'), rdflib.URIRef(ns.bsm.t_created), rdflib.Literal('1670..........', datatype=rdflib.XSD.integer)),
-        #   (rdflib.URIRef(prefix + '997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3'), rdflib.URIRef(ns.bsm.t_created), rdflib.Literal('1670..........', datatype=rdflib.XSD.integer)),
-        #   (rdflib.URIRef(prefix + 'a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d'), rdflib.URIRef(ns.bsm.t_created), rdflib.Literal('1670..........', datatype=rdflib.XSD.integer)),
-        #   (rdflib.URIRef(prefix + 'b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70'), rdflib.URIRef(ns.bsm.t_created), rdflib.Literal('1670..........', datatype=rdflib.XSD.integer)),
-        #   (rdflib.URIRef(prefix + 'd43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d'), rdflib.URIRef(ns.bsm.t_created), rdflib.Literal('1670..........', datatype=rdflib.XSD.integer)),
-        #   (rdflib.URIRef(prefix + 'd803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1'), rdflib.URIRef(ns.bsm.t_created), rdflib.Literal('1670..........', datatype=rdflib.XSD.integer)),
+        #   (rdflib.URIRef(pre_file + '2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647'), rdflib.URIRef(ns.bsm.t_created), rdflib.Literal('1670..........', datatype=rdflib.XSD.integer)),
+        #   (rdflib.URIRef(pre_file + '441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece'), rdflib.URIRef(ns.bsm.t_created), rdflib.Literal('1670..........', datatype=rdflib.XSD.integer)),
+        #   (rdflib.URIRef(pre_file + '80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3'), rdflib.URIRef(ns.bsm.t_created), rdflib.Literal('1670..........', datatype=rdflib.XSD.integer)),
+        #   ...
         # instead, we simply check if there's such a predicate for each file
-        self.assertSetEqual({sub for sub, _ in bsfs._backend._graph.subject_objects(rdflib.URIRef(ns.bsm.t_created))}, {
-            rdflib.URIRef(prefix + '2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647'),
-            rdflib.URIRef(prefix + '441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece'),
-            rdflib.URIRef(prefix + '69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871'),
-            rdflib.URIRef(prefix + '78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926'),
-            rdflib.URIRef(prefix + '80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3'),
-            rdflib.URIRef(prefix + '976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795'),
-            rdflib.URIRef(prefix + '997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3'),
-            rdflib.URIRef(prefix + 'a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d'),
-            rdflib.URIRef(prefix + 'b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70'),
-            rdflib.URIRef(prefix + 'd43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d'),
-            rdflib.URIRef(prefix + 'd803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1'),
+        self.assertSetEqual({sub for sub, _ in bsfs._backend._graph.subject_objects(rdflib.URIRef(ns.bsfs.Node().t_created))}, {
+            rdflib.URIRef(pre_file + '2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647'),
+            rdflib.URIRef(pre_file + '441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece'),
+            rdflib.URIRef(pre_file + '69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871'),
+            rdflib.URIRef(pre_file + '78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926'),
+            rdflib.URIRef(pre_file + '80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3'),
+            rdflib.URIRef(pre_file + '976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795'),
+            rdflib.URIRef(pre_file + '997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3'),
+            rdflib.URIRef(pre_file + 'a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d'),
+            rdflib.URIRef(pre_file + 'b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70'),
+            rdflib.URIRef(pre_file + 'd43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d'),
+            rdflib.URIRef(pre_file + 'd803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1'),
+            rdflib.URIRef(pre_file + 'accb115d266ad60c53cd01a7f7130f245886ce8eaf69bc85319febc11d9fe089'),
+            rdflib.URIRef(pre_preview + '26f16643b2570ac5b2d1f8c373d492cb724aae2dd8d71a0b63647838ed651254_s50'),
+            rdflib.URIRef(pre_preview + 'a63c84e647138a2b68113474212f6aee542b3707171ff178551db3c296e59817_s50'),
+            rdflib.URIRef(pre_preview + '9827509a74a60dfceed11936f7f624e9c932f66c8c0d20d355d56f8c3c9b56b1_s50'),
+            rdflib.URIRef(pre_preview + '2656e303d7218300326df73b64f312d8b37eb980358be27a38b5f63dae259be3_s50'),
+            rdflib.URIRef(pre_preview + '79cb8a7e6369361a4f4cb7ff729c1ed3fcf87204769623d6fbd6ebfae601e5c7_s50'),
+            rdflib.URIRef(pre_preview + 'dbfd8ebc0557c4925e9ff8411629a74a15eca934a4c2a6bd3134dd81d2f95a36_s50'),
+            rdflib.URIRef(pre_preview + '5d1235838c3d501204bb09c2de563d7e4a7fd17b7ec4ff302221c0e88c4741aa_s50'),
+            rdflib.URIRef(pre_preview + '567049149769e1d02e6af6cfee3991f7cf0cbc935cbf6a566047f40155fb13a8_s50'),
+            rdflib.URIRef(pre_preview + 'df2185d8927ccef65c92fc90b94e800b02791354d8dede9dd9aa0e2c2cb1e91e_s50'),
+            rdflib.URIRef(pre_preview + 'a8b3245636074d5370283b690281abda8ffdff12ce8b1af77c8bc0a4c85be860_s50'),
+            rdflib.URIRef(pre_preview + '7a975385a110c21fcd12e238fab9501550fa02f6328749068a3bffd65e291027_s50'),
+            rdflib.URIRef(pre_preview + '968b9aa178585bc8d1fca0e4e32b8cf30b3941eff72f34e320584aaae8fd23ac_s50'),
             })
 
     def test_print(self):
         outbuf = io.StringIO()
         with contextlib.redirect_stdout(outbuf):
             bsfs = main([
+                '--config',
+                self.config_path,
                 '--print',
                 '-r',
-                '--user', 'http://example.com/me',
+                '--host', 'http://example.com',
+                '--user', 'me',
                 os.path.join(os.path.dirname(__file__), 'testdir'),
                 os.path.join(os.path.dirname(__file__), 'testfile'),
                 ])
-        self.assertSetEqual(set(outbuf.getvalue().split('\n')) - {''}, {
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647) Predicate({ns.bse.author}) Me, myself, and I',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647) Predicate({ns.bse.filename}) alpha_second',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647) Predicate({ns.bse.filesize}) 696',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece) Predicate({ns.bse.author}) Me, myself, and I',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece) Predicate({ns.bse.filename}) omega_second',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece) Predicate({ns.bse.filesize}) 503',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871) Predicate({ns.bse.author}) Me, myself, and I',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871) Predicate({ns.bse.filename}) td_first',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871) Predicate({ns.bse.filesize}) 911',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926) Predicate({ns.bse.author}) Me, myself, and I',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926) Predicate({ns.bse.filename}) testfile',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926) Predicate({ns.bse.filesize}) 885',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3) Predicate({ns.bse.author}) Me, myself, and I',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3) Predicate({ns.bse.filename}) bar_first',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3) Predicate({ns.bse.filesize}) 956',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795) Predicate({ns.bse.author}) Me, myself, and I',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795) Predicate({ns.bse.filename}) omega_first',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795) Predicate({ns.bse.filesize}) 648',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3) Predicate({ns.bse.author}) Me, myself, and I',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3) Predicate({ns.bse.filename}) alpha_first',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3) Predicate({ns.bse.filesize}) 754',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d) Predicate({ns.bse.author}) Me, myself, and I',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d) Predicate({ns.bse.filename}) foo_second',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d) Predicate({ns.bse.filesize}) 585',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70) Predicate({ns.bse.author}) Me, myself, and I',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70) Predicate({ns.bse.filename}) bar_second',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70) Predicate({ns.bse.filesize}) 636',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#d43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d) Predicate({ns.bse.author}) Me, myself, and I',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#d43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d) Predicate({ns.bse.filename}) foo_first',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#d43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d) Predicate({ns.bse.filesize}) 546',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#d803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1) Predicate({ns.bse.author}) Me, myself, and I',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#d803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1) Predicate({ns.bse.filename}) td_second',
-            f'Node(http://bsfs.ai/schema/File, http://example.com/me/file#d803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1) Predicate({ns.bse.filesize}) 703',
-            })
+        self.assertTrue((set(outbuf.getvalue().split('\n')) - {''}).issuperset({
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647) Predicate({ns.bse.author}) Me, myself, and I',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647) Predicate({ns.bse.filename}) alpha_second',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece) Predicate({ns.bse.author}) Me, myself, and I',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece) Predicate({ns.bse.filename}) omega_second',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871) Predicate({ns.bse.author}) Me, myself, and I',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871) Predicate({ns.bse.filename}) td_first',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926) Predicate({ns.bse.author}) Me, myself, and I',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926) Predicate({ns.bse.filename}) testfile',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3) Predicate({ns.bse.author}) Me, myself, and I',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3) Predicate({ns.bse.filename}) bar_first',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795) Predicate({ns.bse.author}) Me, myself, and I',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795) Predicate({ns.bse.filename}) omega_first',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3) Predicate({ns.bse.author}) Me, myself, and I',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3) Predicate({ns.bse.filename}) alpha_first',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d) Predicate({ns.bse.author}) Me, myself, and I',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d) Predicate({ns.bse.filename}) foo_second',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70) Predicate({ns.bse.author}) Me, myself, and I',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70) Predicate({ns.bse.filename}) bar_second',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#d43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d) Predicate({ns.bse.author}) Me, myself, and I',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#d43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d) Predicate({ns.bse.filename}) foo_first',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#d803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1) Predicate({ns.bse.author}) Me, myself, and I',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#d803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1) Predicate({ns.bse.filename}) td_second',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#accb115d266ad60c53cd01a7f7130f245886ce8eaf69bc85319febc11d9fe089) Predicate({ns.bse.author}) Me, myself, and I',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#accb115d266ad60c53cd01a7f7130f245886ce8eaf69bc85319febc11d9fe089) Predicate({ns.bse.filename}) testimage.jpg',
+            # features
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#accb115d266ad60c53cd01a7f7130f245886ce8eaf69bc85319febc11d9fe089) Predicate(https://schema.bsfs.io/ie/Node/Entity#colors_spatial_0658f2234a054e1dd59a14462c89f7733e019160419c796356aa831498bd0a04) (91, 127, 121, 94, 138, 167, 163, 134, 190, 138, 170, 156, 121, 142, 159)',
+            # links to previews
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#2f4109b40107cc50e0884755a1a961ed126887e49b8dbaf0e146b2e226aa6647) Predicate({ns.bse.preview}) Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#79cb8a7e6369361a4f4cb7ff729c1ed3fcf87204769623d6fbd6ebfae601e5c7_s50)',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#441f3d10c8ff489fe8e33e639606512f6c463151cc429de7e554b9af670c2ece) Predicate({ns.bse.preview}) Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#a8b3245636074d5370283b690281abda8ffdff12ce8b1af77c8bc0a4c85be860_s50)',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#69b98ecf7aff3e95b09688ba93331678eb8397817111f674c9558e6dd8f5e871) Predicate({ns.bse.preview}) Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#968b9aa178585bc8d1fca0e4e32b8cf30b3941eff72f34e320584aaae8fd23ac_s50)',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#78f7eb7f0d8221cdb2cb26c978fa42a11f75eb87becc768f4474134cb1e06926) Predicate({ns.bse.preview}) Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#dbfd8ebc0557c4925e9ff8411629a74a15eca934a4c2a6bd3134dd81d2f95a36_s50)',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#80818b8ec2ee1919116dba9c8a7e0a4608313cf3b463cd88e9ed77a700dd92d3) Predicate({ns.bse.preview}) Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#567049149769e1d02e6af6cfee3991f7cf0cbc935cbf6a566047f40155fb13a8_s50)',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#976d2ea0e58488678cc7e435fbfadabfb6eb6cf50ad51862f38f73729ed11795) Predicate({ns.bse.preview}) Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#df2185d8927ccef65c92fc90b94e800b02791354d8dede9dd9aa0e2c2cb1e91e_s50)',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#997e2fbb7494a3818ec782d2bc87bf1cffafba6b9c0f658e4a6c18a723e944d3) Predicate({ns.bse.preview}) Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#9827509a74a60dfceed11936f7f624e9c932f66c8c0d20d355d56f8c3c9b56b1_s50)',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#a8af899ecdab60dfaea8ec7f934053624c80a1054539e163f2c7eaa986c2777d) Predicate({ns.bse.preview}) Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#7a975385a110c21fcd12e238fab9501550fa02f6328749068a3bffd65e291027_s50)',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#accb115d266ad60c53cd01a7f7130f245886ce8eaf69bc85319febc11d9fe089) Predicate({ns.bse.preview}) Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#2656e303d7218300326df73b64f312d8b37eb980358be27a38b5f63dae259be3_s50)',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#b8fd7fba818254166a6043195004138ebda6923e012442f819a2c49671136c70) Predicate({ns.bse.preview}) Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#a63c84e647138a2b68113474212f6aee542b3707171ff178551db3c296e59817_s50)',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#d43758ace82154a1cc10ca0dfef63cb20dd831f9c87edd6dc06539eefe67371d) Predicate({ns.bse.preview}) Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#26f16643b2570ac5b2d1f8c373d492cb724aae2dd8d71a0b63647838ed651254_s50)',
+            f'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/file#d803187cbf3676ae9d38126270a6152c60431589aa3bb3824baf8954e9c097f1) Predicate({ns.bse.preview}) Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#5d1235838c3d501204bb09c2de563d7e4a7fd17b7ec4ff302221c0e88c4741aa_s50)',
+            # preview dimensions
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#2656e303d7218300326df73b64f312d8b37eb980358be27a38b5f63dae259be3_s50) Predicate({ns.bsp.height}) 33',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#2656e303d7218300326df73b64f312d8b37eb980358be27a38b5f63dae259be3_s50) Predicate({ns.bsp.width}) 50',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#26f16643b2570ac5b2d1f8c373d492cb724aae2dd8d71a0b63647838ed651254_s50) Predicate({ns.bsp.height}) 50',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#26f16643b2570ac5b2d1f8c373d492cb724aae2dd8d71a0b63647838ed651254_s50) Predicate({ns.bsp.width}) 36',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#567049149769e1d02e6af6cfee3991f7cf0cbc935cbf6a566047f40155fb13a8_s50) Predicate({ns.bsp.height}) 50',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#567049149769e1d02e6af6cfee3991f7cf0cbc935cbf6a566047f40155fb13a8_s50) Predicate({ns.bsp.width}) 36',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#5d1235838c3d501204bb09c2de563d7e4a7fd17b7ec4ff302221c0e88c4741aa_s50) Predicate({ns.bsp.height}) 50',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#5d1235838c3d501204bb09c2de563d7e4a7fd17b7ec4ff302221c0e88c4741aa_s50) Predicate({ns.bsp.width}) 36',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#79cb8a7e6369361a4f4cb7ff729c1ed3fcf87204769623d6fbd6ebfae601e5c7_s50) Predicate({ns.bsp.height}) 50',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#79cb8a7e6369361a4f4cb7ff729c1ed3fcf87204769623d6fbd6ebfae601e5c7_s50) Predicate({ns.bsp.width}) 36',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#7a975385a110c21fcd12e238fab9501550fa02f6328749068a3bffd65e291027_s50) Predicate({ns.bsp.height}) 50',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#7a975385a110c21fcd12e238fab9501550fa02f6328749068a3bffd65e291027_s50) Predicate({ns.bsp.width}) 36',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#968b9aa178585bc8d1fca0e4e32b8cf30b3941eff72f34e320584aaae8fd23ac_s50) Predicate({ns.bsp.height}) 50',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#968b9aa178585bc8d1fca0e4e32b8cf30b3941eff72f34e320584aaae8fd23ac_s50) Predicate({ns.bsp.width}) 36',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#9827509a74a60dfceed11936f7f624e9c932f66c8c0d20d355d56f8c3c9b56b1_s50) Predicate({ns.bsp.height}) 50',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#9827509a74a60dfceed11936f7f624e9c932f66c8c0d20d355d56f8c3c9b56b1_s50) Predicate({ns.bsp.width}) 36',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#a63c84e647138a2b68113474212f6aee542b3707171ff178551db3c296e59817_s50) Predicate({ns.bsp.height}) 50',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#a63c84e647138a2b68113474212f6aee542b3707171ff178551db3c296e59817_s50) Predicate({ns.bsp.width}) 36',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#a8b3245636074d5370283b690281abda8ffdff12ce8b1af77c8bc0a4c85be860_s50) Predicate({ns.bsp.height}) 50',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#a8b3245636074d5370283b690281abda8ffdff12ce8b1af77c8bc0a4c85be860_s50) Predicate({ns.bsp.width}) 36',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#dbfd8ebc0557c4925e9ff8411629a74a15eca934a4c2a6bd3134dd81d2f95a36_s50) Predicate({ns.bsp.height}) 50',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#dbfd8ebc0557c4925e9ff8411629a74a15eca934a4c2a6bd3134dd81d2f95a36_s50) Predicate({ns.bsp.width}) 36',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#df2185d8927ccef65c92fc90b94e800b02791354d8dede9dd9aa0e2c2cb1e91e_s50) Predicate({ns.bsp.height}) 50',
+            f'Node(https://schema.bsfs.io/ie/Node/Preview, http://example.com/me/preview#df2185d8927ccef65c92fc90b94e800b02791354d8dede9dd9aa0e2c2cb1e91e_s50) Predicate({ns.bsp.width}) 36',
+            # assets
+            # ... (not checked)
+            }))
 
 
 ## main ##
diff --git a/test/apps/test_info.py b/test/apps/test_info.py
index 6f4d98f..ffcaecf 100644
--- a/test/apps/test_info.py
+++ b/test/apps/test_info.py
@@ -1,15 +1,18 @@
-"""
 
-Part of the bsie test suite.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import argparse
 import contextlib
 import io
+import os
+import tempfile
 import unittest
 
+# external imports
+import yaml
+
+# bsie imports
+from bsie.utils import bsfs
+
 # objects to test
 from bsie.apps.info import main
 
@@ -17,17 +20,72 @@ from bsie.apps.info import main
 ## code ##
 
 class TestIndex(unittest.TestCase):
+    def setUp(self):
+        config = {
+            'ReaderBuilder': {},
+            'ExtractorBuilder': [
+                {'bsie.extractor.preview.Preview': {
+                    'max_sides': [50],
+                    }},
+                {'bsie.extractor.generic.path.Path': {}},
+                {'bsie.extractor.generic.constant.Constant': {
+                    'schema': '''
+                        bse:author rdfs:subClassOf bsfs:Predicate ;
+                            rdfs:domain bsn:Entity ;
+                            rdfs:range xsd:string ;
+                            bsfs:unique "true"^^xsd:boolean .
+                        ''',
+                    'tuples': [['https://schema.bsfs.io/ie/Node/Entity#author', 'Me, myself, and I']],
+                  }},
+                {'bsie.extractor.image.colors_spatial.ColorsSpatial': {
+                    'width': 2,
+                    'height': 2,
+                    'exp': 2,
+                    }},
+                ]
+            }
+        # create config file
+        _, self.config_path = tempfile.mkstemp(prefix='bsie-test-', suffix='.yaml')
+        with open(self.config_path, 'wt') as cfile:
+            yaml.dump(config, cfile)
+
+    def tearDown(self):
+        if os.path.exists(self.config_path):
+            os.unlink(self.config_path)
+
     def test_predicates(self):
         outbuf = io.StringIO()
         with contextlib.redirect_stdout(outbuf):
             # show predicates infos
-            main(['predicates'])
+            main(['--config', self.config_path, 'predicates'])
         # verify output
         self.assertSetEqual({pred for pred in outbuf.getvalue().split('\n') if pred != ''}, {
-            'http://bsfs.ai/schema/Entity#author',
-            'http://bsfs.ai/schema/Predicate',
-            'http://bsfs.ai/schema/Entity#filename',
-            'http://bsfs.ai/schema/Entity#filesize',
+            'https://schema.bsfs.io/ie/Node/Entity#author',
+            'https://schema.bsfs.io/core/Predicate',
+            'https://schema.bsfs.io/ie/Node/Entity#filename',
+            'https://schema.bsfs.io/ie/Node/Entity#colors_spatial_0658f2234a054e1dd59a14462c89f7733e019160419c796356aa831498bd0a04',
+            'https://schema.bsfs.io/ie/Node/Entity#preview',
+            'https://schema.bsfs.io/ie/Node/Preview#width',
+            'https://schema.bsfs.io/ie/Node/Preview#height',
+            'https://schema.bsfs.io/ie/Node/Preview#asset',
+            })
+
+    def test_schema(self):
+        outbuf = io.StringIO()
+        with contextlib.redirect_stdout(outbuf):
+            # show schema infos
+            main(['--config', self.config_path, 'schema'])
+        # verify output
+        schema = bsfs.schema.from_string(outbuf.getvalue())
+        self.assertSetEqual({pred.uri for pred in schema.predicates()}, {
+            'https://schema.bsfs.io/ie/Node/Entity#author',
+            'https://schema.bsfs.io/core/Predicate',
+            'https://schema.bsfs.io/ie/Node/Entity#filename',
+            'https://schema.bsfs.io/ie/Node/Entity#colors_spatial_0658f2234a054e1dd59a14462c89f7733e019160419c796356aa831498bd0a04',
+            'https://schema.bsfs.io/ie/Node/Entity#preview',
+            'https://schema.bsfs.io/ie/Node/Preview#width',
+            'https://schema.bsfs.io/ie/Node/Preview#height',
+            'https://schema.bsfs.io/ie/Node/Preview#asset',
             })
 
     def test_invalid(self):
diff --git a/test/apps/test_loader.py b/test/apps/test_loader.py
new file mode 100644
index 0000000..4670266
--- /dev/null
+++ b/test/apps/test_loader.py
@@ -0,0 +1,83 @@
+
+# standard imports
+import os
+import tempfile
+import unittest
+
+# external imports
+import yaml
+
+# objects to test
+from bsie.apps._loader import load_pipeline
+
+
+## code ##
+
+class TestLoader(unittest.TestCase):
+    def test_load_pipeline(self):
+        # config file can be empty
+        config = {
+            'ReaderBuilder': {},
+            'ExtractorBuilder': []
+            }
+        # create config file
+        _, path = tempfile.mkstemp(prefix='bsie-test-', suffix='.yaml')
+        with open(path, 'wt') as cfile:
+            yaml.dump(config, cfile)
+        # pipeline contains only default predicates
+        pipeline = load_pipeline(path)
+        self.assertSetEqual({pred.uri for pred in pipeline.schema.predicates()}, {
+            'https://schema.bsfs.io/core/Predicate',
+            })
+
+        # pipeline is built according to configured extractors
+        config = {
+            'ReaderBuilder': {},
+            'ExtractorBuilder': [
+                {'bsie.extractor.preview.Preview': {
+                    'max_sides': [50],
+                    }},
+                {'bsie.extractor.generic.path.Path': {}},
+                {'bsie.extractor.generic.constant.Constant': {
+                    'schema': '''
+                        bse:author rdfs:subClassOf bsfs:Predicate ;
+                            rdfs:domain bsn:Entity ;
+                            rdfs:range xsd:string ;
+                            bsfs:unique "true"^^xsd:boolean .
+                        ''',
+                    'tuples': [['https://schema.bsfs.io/ie/Node/Entity#author', 'Me, myself, and I']],
+                  }},
+                {'bsie.extractor.image.colors_spatial.ColorsSpatial': {
+                    'width': 2,
+                    'height': 2,
+                    'exp': 2,
+                    }},
+                ]
+            }
+        # create config file
+        _, path = tempfile.mkstemp(prefix='bsie-test-', suffix='.yaml')
+        with open(path, 'wt') as cfile:
+            yaml.dump(config, cfile)
+        # pipeline contains all defined predicates
+        pipeline = load_pipeline(path)
+        self.assertSetEqual({pred.uri for pred in pipeline.schema.predicates()}, {
+            'https://schema.bsfs.io/ie/Node/Entity#author',
+            'https://schema.bsfs.io/core/Predicate',
+            'https://schema.bsfs.io/ie/Node/Entity#filename',
+            'https://schema.bsfs.io/ie/Node/Entity#colors_spatial_0658f2234a054e1dd59a14462c89f7733e019160419c796356aa831498bd0a04',
+            'https://schema.bsfs.io/ie/Node/Entity#preview',
+            'https://schema.bsfs.io/ie/Node/Preview#width',
+            'https://schema.bsfs.io/ie/Node/Preview#height',
+            'https://schema.bsfs.io/ie/Node/Preview#asset',
+            })
+
+        # config file must exist
+        self.assertRaises(OSError, load_pipeline, 'invalid.yaml')
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/apps/test_main.py b/test/apps/test_main.py
new file mode 100644
index 0000000..4fa094b
--- /dev/null
+++ b/test/apps/test_main.py
@@ -0,0 +1,57 @@
+
+# standard imports
+import contextlib
+import io
+import json
+import os
+import tempfile
+import unittest
+import yaml
+
+# objects to test
+from bsie.apps import main
+
+
+## code ##
+
+class TestMain(unittest.TestCase):
+    def setUp(self):
+        config = {
+            'ReaderBuilder': {},
+            'ExtractorBuilder': [
+                {'bsie.extractor.generic.stat.Stat': {}},
+                {'bsie.extractor.generic.path.Path': {}},
+                ]
+            }
+        # create config file
+        _, self.config_path = tempfile.mkstemp(prefix='bsie-test-', suffix='.yaml')
+        with open(self.config_path, 'wt') as cfile:
+            yaml.dump(config, cfile)
+
+    def tearDown(self):
+        if os.path.exists(self.config_path):
+            os.unlink(self.config_path)
+
+    def test_main(self):
+        # must at least pass an app
+        with contextlib.redirect_stderr(io.StringIO()):
+            self.assertRaises(SystemExit, main, [])
+        # app takes over
+        with contextlib.redirect_stderr(io.StringIO()):
+            self.assertRaises(SystemExit, main, ['info'])
+        outbuf = io.StringIO()
+        with contextlib.redirect_stdout(outbuf):
+            main(['info', '--config', self.config_path, 'predicates'])
+        self.assertEqual(set(outbuf.getvalue().strip().split('\n')), {
+            'https://schema.bsfs.io/ie/Node/Entity#filename',
+            'https://schema.bsfs.io/ie/Node/Entity#filesize',
+            'https://schema.bsfs.io/core/Predicate',
+            })
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/apps/testdir/testimage.jpg b/test/apps/testdir/testimage.jpg
new file mode 100644
index 0000000..c80bb48
--- /dev/null
+++ b/test/apps/testdir/testimage.jpg
diff --git a/test/extractor/generic/test_constant.py b/test/extractor/generic/test_constant.py
index 9dbaced..77ee02b 100644
--- a/test/extractor/generic/test_constant.py
+++ b/test/extractor/generic/test_constant.py
@@ -1,10 +1,5 @@
-"""
 
-Part of the bsie test suite.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import unittest
 
 # bsie imports
@@ -20,11 +15,11 @@ class TestConstant(unittest.TestCase):
     def test_extract(self):
         schema = '''
             bse:author rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:Entity ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
                 bsfs:unique "true"^^xsd:boolean .
             bse:comment rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:Entity ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
                 bsfs:unique "false"^^xsd:boolean .
             '''
@@ -33,33 +28,33 @@ class TestConstant(unittest.TestCase):
             (ns.bse.comment, 'the quick brown fox jumps over the lazy dog.'),
             ]
         ext = Constant(schema, tuples)
-        node = _node.Node(ns.bsfs.Entity, '') # Blank node
+        node = _node.Node(ns.bsn.Entity, '') # Blank node
         p_author = ext.schema.predicate(ns.bse.author)
         p_comment = ext.schema.predicate(ns.bse.comment)
-        entity = ext.schema.node(ns.bsfs.Node).get_child(ns.bsfs.Entity)
-        string = ext.schema.literal(ns.bsfs.Literal).get_child(ns.xsd.string)
+        entity = ext.schema.node(ns.bsfs.Node).child(ns.bsn.Entity)
+        string = ext.schema.literal(ns.bsfs.Literal).child(ns.xsd.string)
         # baseline
         self.assertSetEqual(set(ext.extract(node, None, (p_author, p_comment))),
             {(node, p_author, 'Me, myself, and I'),
              (node, p_comment, 'the quick brown fox jumps over the lazy dog.')})
         # predicates is respected
-        p_foobar = ext.schema.predicate(ns.bsfs.Predicate).get_child(ns.bse.foobar, domain=entity, range=entity)
+        p_foobar = ext.schema.predicate(ns.bsfs.Predicate).child(ns.bse.foobar, domain=entity, range=entity)
         self.assertSetEqual(set(ext.extract(node, None, (p_author, p_foobar))),
             {(node, p_author, 'Me, myself, and I')})
         self.assertSetEqual(set(ext.extract(node, None, (p_comment, p_foobar))),
             {(node, p_comment, 'the quick brown fox jumps over the lazy dog.')})
-        p_barfoo = ext.schema.predicate(ns.bse.author).get_child(ns.bse.comment, domain=entity, range=string)
+        p_barfoo = ext.schema.predicate(ns.bse.author).child(ns.bse.comment, domain=entity, range=string)
         self.assertSetEqual(set(ext.extract(node, None, (p_foobar, p_barfoo))), set())
 
     def test_construct(self):
         # schema compliance
         schema = '''
             bse:author rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:Entity ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
                 bsfs:unique "true"^^xsd:boolean .
             bse:comment rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:Entity ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
                 bsfs:unique "false"^^xsd:boolean .
             '''
@@ -80,13 +75,13 @@ class TestConstant(unittest.TestCase):
     def test_eq(self):
         schema_a = '''
             bse:author rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:Entity ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
                 bsfs:unique "true"^^xsd:boolean .
             '''
         schema_b = '''
             bse:comment rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:Entity ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
                 bsfs:unique "false"^^xsd:boolean .
             '''
diff --git a/test/extractor/generic/test_path.py b/test/extractor/generic/test_path.py
index 820f402..0beb37e 100644
--- a/test/extractor/generic/test_path.py
+++ b/test/extractor/generic/test_path.py
@@ -1,14 +1,9 @@
-"""
 
-Part of the bsie test suite.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import unittest
 
 # bsie imports
-from bsie.base import extractor
+from bsie.extractor import base
 from bsie.utils import bsfs, node as _node, ns
 
 # objects to test
@@ -29,31 +24,31 @@ class TestPath(unittest.TestCase):
 
     def test_schema(self):
         self.assertEqual(Path().schema,
-            bsfs.schema.Schema.from_string(extractor.SCHEMA_PREAMBLE + '''
+            bsfs.schema.from_string(base.SCHEMA_PREAMBLE + '''
                 bse:filename rdfs:subClassOf bsfs:Predicate ;
-                    rdfs:domain bsfs:File ;
+                    rdfs:domain bsn:Entity ;
                     rdfs:range xsd:string ;
-                    bsfs:unique "false"^^xsd:boolean .
+                    bsfs:unique "true"^^xsd:boolean .
                 '''))
 
     def test_extract(self):
         ext = Path()
-        node = _node.Node(ns.bsfs.File, '') # Blank node
+        node = _node.Node(ns.bsn.Entity, '') # Blank node
         content = '/tmp/foo/bar'
         p_filename = ext.schema.predicate(ns.bse.filename)
-        entity = ext.schema.node(ns.bsfs.Node).get_child(ns.bsfs.Entity)
-        string = ext.schema.literal(ns.bsfs.Literal).get_child(ns.xsd.string)
+        entity = ext.schema.node(ns.bsfs.Node).child(ns.bsn.Entity)
+        string = ext.schema.literal(ns.bsfs.Literal).child(ns.xsd.string)
 
         # baseline
         self.assertSetEqual(set(ext.extract(node, content, (p_filename, ))),
             {(node, p_filename, 'bar')})
         # predicates parameter is respected
-        p_foo = ext.schema.predicate(ns.bsfs.Predicate).get_child(ns.bse.foo, domain=entity, range=string) # unsupported predicate
+        p_foo = ext.schema.predicate(ns.bsfs.Predicate).child(ns.bse.foo, domain=entity, range=string) # unsupported predicate
         self.assertSetEqual(set(ext.extract(node, content, (p_filename, p_foo))),
             {(node, p_filename, 'bar')})
         self.assertSetEqual(set(ext.extract(node, content, (p_foo, ))), set())
         # predicates are validated
-        p_bar = p_foo.get_child(ns.bse.filename) # same URI but different hierarchy
+        p_bar = p_foo.child(ns.bse.filename) # same URI but different hierarchy
         self.assertSetEqual(set(ext.extract(node, content, (p_filename, p_bar))),
             {(node, p_filename, 'bar')})
         self.assertSetEqual(set(ext.extract(node, content, (p_bar, ))), set())
diff --git a/test/extractor/generic/test_stat.py b/test/extractor/generic/test_stat.py
index 3441438..0e83e24 100644
--- a/test/extractor/generic/test_stat.py
+++ b/test/extractor/generic/test_stat.py
@@ -1,15 +1,10 @@
-"""
 
-Part of the bsie test suite.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import os
 import unittest
 
 # bsie imports
-from bsie.base import extractor
+from bsie.extractor import base
 from bsie.utils import bsfs, node as _node, ns
 
 # objects to test
@@ -30,31 +25,31 @@ class TestStat(unittest.TestCase):
 
     def test_schema(self):
         self.assertEqual(Stat().schema,
-            bsfs.schema.Schema.from_string(extractor.SCHEMA_PREAMBLE + '''
+            bsfs.schema.from_string(base.SCHEMA_PREAMBLE + '''
                 bse:filesize rdfs:subClassOf bsfs:Predicate ;
-                    rdfs:domain bsfs:File ;
+                    rdfs:domain bsn:Entity ;
                     rdfs:range xsd:integer ;
-                    bsfs:unique "false"^^xsd:boolean .
+                    bsfs:unique "true"^^xsd:boolean .
                 '''))
 
     def test_extract(self):
         ext = Stat()
-        node = _node.Node(ns.bsfs.File, '') # Blank node
+        node = _node.Node(ns.bsn.Entity, '') # Blank node
         content = os.stat(__file__)
         p_filesize = ext.schema.predicate(ns.bse.filesize)
-        entity = ext.schema.node(ns.bsfs.Node).get_child(ns.bsfs.Entity)
-        string = ext.schema.literal(ns.bsfs.Literal).get_child(ns.xsd.string)
+        entity = ext.schema.node(ns.bsfs.Node).child(ns.bsn.Entity)
+        string = ext.schema.literal(ns.bsfs.Literal).child(ns.xsd.string)
 
         # baseline
         self.assertSetEqual(set(ext.extract(node, content, (p_filesize, ))),
             {(node, p_filesize, content.st_size)})
         # predicates parameter is respected
-        p_foo = ext.schema.predicate(ns.bsfs.Predicate).get_child(ns.bse.foo, domain=entity, range=string) # unsupported predicate
+        p_foo = ext.schema.predicate(ns.bsfs.Predicate).child(ns.bse.foo, domain=entity, range=string) # unsupported predicate
         self.assertSetEqual(set(ext.extract(node, content, (p_filesize, p_foo))),
             {(node, p_filesize, content.st_size)})
         self.assertSetEqual(set(ext.extract(node, content, (p_foo, ))), set())
         # predicates are validated
-        p_bar = p_foo.get_child(ns.bse.filesizse) # same URI but different hierarchy
+        p_bar = p_foo.child(ns.bse.filesizse) # same URI but different hierarchy
         self.assertSetEqual(set(ext.extract(node, content, (p_filesize, p_bar))),
             {(node, p_filesize, content.st_size)})
         self.assertSetEqual(set(ext.extract(node, content, (p_bar, ))), set())
diff --git a/test/base/__init__.py b/test/extractor/image/__init__.py
index e69de29..e69de29 100644
--- a/test/base/__init__.py
+++ b/test/extractor/image/__init__.py
diff --git a/test/extractor/image/test_colors_spatial.py b/test/extractor/image/test_colors_spatial.py
new file mode 100644
index 0000000..902ab6d
--- /dev/null
+++ b/test/extractor/image/test_colors_spatial.py
@@ -0,0 +1,95 @@
+
+# standard imports
+import os
+import unittest
+
+# external imports
+import PIL.Image
+
+# bsie imports
+from bsie.extractor import base
+from bsie.utils import bsfs, ns, node as _node
+
+# objects to test
+from bsie.extractor.image.colors_spatial import ColorsSpatial
+
+
+## code ##
+
+class TestColorsSpatial(unittest.TestCase):
+    def setUp(self):
+        # content id with default constructors (width=32, height=32, exp=4)
+        self.instance_prefix = 'https://schema.bsfs.io/ie/Literal/Array/Feature/ColorsSpatial'
+        self.predicate_prefix = 'https://schema.bsfs.io/ie/Node/Entity#colors_spatial_'
+        self.uuid = 'adee8d6c43687021e1c5bffe56bcfe727f1638d792744137181304ef889dac2a'
+
+    def test_essentials(self):
+        # clones are equal
+        self.assertEqual(ColorsSpatial(32, 32, 4), ColorsSpatial(32, 32, 4))
+        self.assertEqual(hash(ColorsSpatial(32, 32, 4)), hash(ColorsSpatial(32, 32, 4)))
+        # equal respects type
+        self.assertNotEqual(ColorsSpatial(32, 32, 4), 'hello world')
+        self.assertNotEqual(hash(ColorsSpatial(32, 32, 4)), hash('hello world'))
+        # equals respects width
+        self.assertNotEqual(ColorsSpatial(32, 32, 4), ColorsSpatial(16, 32, 4))
+        self.assertNotEqual(hash(ColorsSpatial(32, 32, 4)), hash(ColorsSpatial(16, 32, 4)))
+        # equals respects height
+        self.assertNotEqual(ColorsSpatial(32, 32, 4), ColorsSpatial(32, 16, 4))
+        self.assertNotEqual(hash(ColorsSpatial(32, 32, 4)), hash(ColorsSpatial(32, 16, 4)))
+        # equals respects exp
+        self.assertNotEqual(ColorsSpatial(32, 32, 4), ColorsSpatial(32, 32, 8))
+        self.assertNotEqual(hash(ColorsSpatial(32, 32, 4)), hash(ColorsSpatial(32, 32, 8)))
+        # string representation
+        self.assertEqual(str(ColorsSpatial()), 'ColorsSpatial')
+        self.assertEqual(repr(ColorsSpatial(64, 16, 2)), 'ColorsSpatial(64, 16, 2)')
+
+    def test_dimension(self):
+        self.assertEqual(ColorsSpatial.dimension(32, 32, 4), 3 * (32*32 + 8*8 + 2*2))
+        self.assertEqual(ColorsSpatial.dimension(16, 16, 8), 3 * (16*16 + 2*2))
+        self.assertEqual(ColorsSpatial.dimension(64, 64, 16), 3 * (64*64 + 4*4))
+
+    def test_schema(self):
+        schema = bsfs.schema.from_string(base.SCHEMA_PREAMBLE + f'''
+            <{self.instance_prefix}> rdfs:subClassOf bsa:Feature ;
+                # annotations
+                rdfs:label "Spatially dominant colors"^^xsd:string ;
+                schema:description "Domiant colors of subregions in an image."^^xsd:string ;
+                bsfs:dtype xsd:integer .
+
+            <{self.instance_prefix}#{self.uuid}> rdfs:subClassOf <{self.instance_prefix}> ;
+                bsfs:dimension "3276"^^xsd:integer ;
+                # annotations
+                <{self.instance_prefix}/args#width> "32"^^xsd:integer ;
+                <{self.instance_prefix}/args#height> "32"^^xsd:integer ;
+                <{self.instance_prefix}/args#exp> "4"^^xsd:float .
+
+            <{self.predicate_prefix}{self.uuid}> rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range <{self.instance_prefix}#{self.uuid}> ;
+                bsfs:unique "true"^^xsd:boolean .
+            ''')
+        self.assertEqual(schema, ColorsSpatial().schema)
+
+    def test_extract(self):
+        ext = ColorsSpatial(2,2,2)
+        img = PIL.Image.open(os.path.join(os.path.dirname(__file__), 'testimage.jpg'))
+        node = _node.Node(ns.bsn.Entity, bsfs.URI('http://example.com/entity#1234'))
+        principals = set(ext.principals)
+        self.assertEqual(len(principals), 1)
+        # valid invocation yields feature
+        ret = list(ext.extract(node, img, principals))
+        self.assertEqual(ret[0], (
+            node,
+            list(principals)[0],
+            (91, 127, 121, 94, 138, 167, 163, 134, 190, 138, 170, 156, 121, 142, 159)))
+        # principals is respected
+        self.assertListEqual(list(ext.extract(node, img, {})), [])
+
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/extractor/image/test_photometrics.py b/test/extractor/image/test_photometrics.py
new file mode 100644
index 0000000..fb219e2
--- /dev/null
+++ b/test/extractor/image/test_photometrics.py
@@ -0,0 +1,143 @@
+
+# standard imports
+import unittest
+
+# bsie imports
+from bsie.extractor import base
+from bsie.utils import bsfs, node as _node, ns
+
+# objects to test
+from bsie.extractor.image.photometrics import Exif, _gps_to_dec
+
+
+## code ##
+
+class TestExif(unittest.TestCase):
+
+    def test_gps_to_dec(self):
+        # deg+min+sec format
+        self.assertAlmostEqual(_gps_to_dec('29/1 58/1 45/1'.split()), 29.979167, 6)
+        self.assertAlmostEqual(_gps_to_dec('31 08 03'.split()), 31.134167, 6)
+        self.assertAlmostEqual(_gps_to_dec('20 40 586/10'.split()), 20.682944, 6)
+        self.assertAlmostEqual(_gps_to_dec('88/1 34 68/10'.split()), 88.568556, 6)
+        # deg+min format
+        self.assertAlmostEqual(_gps_to_dec('13 472167/10000 0/1 '.split()), 13.786945, 6)
+        self.assertAlmostEqual(_gps_to_dec('104/1 3215/100 0/1'.split()), 104.535833, 6)
+
+    def test_eq(self):
+        # identical instances are equal
+        self.assertEqual(Exif(), Exif())
+        self.assertEqual(hash(Exif()), hash(Exif()))
+        # comparison respects type
+        class Foo(): pass
+        self.assertNotEqual(Exif(), Foo())
+        self.assertNotEqual(hash(Exif()), hash(Foo()))
+        self.assertNotEqual(Exif(), 1234)
+        self.assertNotEqual(hash(Exif()), hash(1234))
+        self.assertNotEqual(Exif(), None)
+        self.assertNotEqual(hash(Exif()), hash(None))
+
+    def test_schema(self):
+        self.assertSetEqual({pred.uri for pred in Exif().schema.predicates()}, {
+            ns.bsfs.Predicate,
+            ns.bse.exposure,
+            ns.bse.aperture,
+            ns.bse.iso,
+            ns.bse.focal_length,
+            ns.bse.width,
+            ns.bse.height,
+            ns.bse.orientation,
+            ns.bse.orientation_label,
+            ns.bse.altitude,
+            ns.bse.latitude,
+            ns.bse.longitude,
+            })
+
+    def test_extract(self):
+        ext = Exif()
+        node = _node.Node(ns.bsfs.File, '') # Blank node
+        content = {
+            'Exif.Photo.ExposureTime': '10/600',
+            'Exif.Photo.FNumber': '48/10',
+            'Exif.Photo.ISOSpeedRatings': '400',
+            'Exif.Photo.FocalLength': '460/10',
+            'Exif.Photo.PixelXDimension': '4288',
+            'Exif.Photo.PixelYDimension': '2848',
+            'Exif.Image.Orientation': '1',
+            'Exif.GPSInfo.GPSAltitude': '431/1',
+            'Exif.GPSInfo.GPSLatitude': '46/1 11397/625 0/1',
+            'Exif.GPSInfo.GPSLongitude': '7/1 131250/2500 0/1',
+            }
+
+        # invalid principals are ignored
+        self.assertSetEqual(set(ext.extract(node, content, {ns.bse.filename})), set())
+        # extract finds all relevant information
+        self.assertSetEqual(set(ext.extract(node, content, {ext.schema.predicate(ns.bse.exposure)})),
+            {(node, ext.schema.predicate(ns.bse.exposure), 60.0)})
+        self.assertSetEqual(set(ext.extract(node, content, {ext.schema.predicate(ns.bse.aperture)})),
+            {(node, ext.schema.predicate(ns.bse.aperture), 4.8)})
+        self.assertSetEqual(set(ext.extract(node, content, {ext.schema.predicate(ns.bse.iso)})),
+            {(node, ext.schema.predicate(ns.bse.iso), 400)})
+        self.assertSetEqual(set(ext.extract(node, content, {ext.schema.predicate(ns.bse.focal_length)})),
+            {(node, ext.schema.predicate(ns.bse.focal_length), 46.0)})
+        self.assertSetEqual(set(ext.extract(node, content, {ext.schema.predicate(ns.bse.width)})),
+            {(node, ext.schema.predicate(ns.bse.width), 4288)})
+        self.assertSetEqual(set(ext.extract(node, content, {ext.schema.predicate(ns.bse.height)})),
+            {(node, ext.schema.predicate(ns.bse.height), 2848)})
+        self.assertSetEqual(set(ext.extract(node, content, {ext.schema.predicate(ns.bse.orientation)})),
+            {(node, ext.schema.predicate(ns.bse.orientation), 1)})
+        self.assertSetEqual(set(ext.extract(node, content, {ext.schema.predicate(ns.bse.orientation_label)})),
+            {(node, ext.schema.predicate(ns.bse.orientation_label), 'landscape')})
+        self.assertSetEqual(set(ext.extract(node, content, {ext.schema.predicate(ns.bse.altitude)})),
+            {(node, ext.schema.predicate(ns.bse.altitude), 431.0)})
+        self.assertSetEqual(set(ext.extract(node, content, {ext.schema.predicate(ns.bse.latitude)})),
+            {(node, ext.schema.predicate(ns.bse.latitude), 46.30392)})
+        self.assertSetEqual(set(ext.extract(node, content, {ext.schema.predicate(ns.bse.longitude)})),
+            {(node, ext.schema.predicate(ns.bse.longitude), 7.875)})
+
+        # check orientation label
+        self.assertSetEqual(set(ext.extract(
+            node, {
+                'Exif.Photo.PixelXDimension': '4288',
+                'Exif.Photo.PixelYDimension': '2848',
+                'Exif.Image.Orientation': '5',
+                },
+            {ext.schema.predicate(ns.bse.orientation_label)})),
+            {(node, ext.schema.predicate(ns.bse.orientation_label), 'portrait')})
+
+        # can pass multiple principals
+        self.assertSetEqual(set(ext.extract(node, content, {
+            ext.schema.predicate(ns.bse.exposure),
+            ext.schema.predicate(ns.bse.iso),
+            ext.schema.predicate(ns.bse.focal_length),
+            })), {
+            (node, ext.schema.predicate(ns.bse.exposure), 60.0),
+            (node, ext.schema.predicate(ns.bse.iso), 400),
+            (node, ext.schema.predicate(ns.bse.focal_length), 46.0),
+            })
+
+        # principals w/o content are ignored
+        self.assertSetEqual(set(ext.extract(
+            node,
+            content={'Exif.Photo.ExposureTime': '10/600'},
+            principals={
+                ext.schema.predicate(ns.bse.exposure),
+                ext.schema.predicate(ns.bse.iso),
+                ext.schema.predicate(ns.bse.focal_length),
+                })
+            ), {
+            (node, ext.schema.predicate(ns.bse.exposure), 60.0),
+            })
+
+        # empty content is acceptable
+        self.assertSetEqual(set(ext.extract(node, {}, set(ext.principals))), set())
+        # no principals is acceptable
+        self.assertSetEqual(set(ext.extract(node, content, set())), set())
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/extractor/image/testimage.jpg b/test/extractor/image/testimage.jpg
new file mode 100644
index 0000000..c80bb48
--- /dev/null
+++ b/test/extractor/image/testimage.jpg
diff --git a/test/base/test_extractor.py b/test/extractor/test_base.py
index 30974ef..81865e1 100644
--- a/test/base/test_extractor.py
+++ b/test/extractor/test_base.py
@@ -1,30 +1,25 @@
-"""
 
-Part of the bsie test suite.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import unittest
 
 # bsie imports
 from bsie.utils import bsfs, ns
 
 # objects to test
-from bsie.base import extractor
+from bsie.extractor import base
 
 
 ## code ##
 
-class StubExtractor(extractor.Extractor):
+class StubExtractor(base.Extractor):
     def __init__(self):
-        super().__init__(bsfs.schema.Schema.from_string(extractor.SCHEMA_PREAMBLE + '''
+        super().__init__(bsfs.schema.from_string(base.SCHEMA_PREAMBLE + '''
             bse:author rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:Entity ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
                 bsfs:unique "false"^^xsd:boolean .
             bse:comment rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:Entity ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
                 bsfs:unique "false"^^xsd:boolean .
         '''))
@@ -52,11 +47,11 @@ class TestExtractor(unittest.TestCase):
         self.assertNotEqual(hash(ext), hash(sub))
 
     def test_principals(self):
-        schema = bsfs.schema.Schema.Empty()
-        entity = schema.node(ns.bsfs.Node).get_child(ns.bsfs.Entity)
-        string = schema.literal(ns.bsfs.Literal).get_child(bsfs.URI('http://www.w3.org/2001/XMLSchema#string'))
-        p_author = schema.predicate(ns.bsfs.Predicate).get_child(ns.bse.author, domain=entity, range=string)
-        p_comment = schema.predicate(ns.bsfs.Predicate).get_child(ns.bse.comment, domain=entity, range=string)
+        schema = bsfs.schema.Schema()
+        entity = schema.node(ns.bsfs.Node).child(ns.bsn.Entity)
+        string = schema.literal(ns.bsfs.Literal).child(ns.xsd.string)
+        p_author = schema.predicate(ns.bsfs.Predicate).child(ns.bse.author, domain=entity, range=string)
+        p_comment = schema.predicate(ns.bsfs.Predicate).child(ns.bse.comment, domain=entity, range=string)
         ext = StubExtractor()
         self.assertSetEqual(set(ext.principals),
             {p_author, p_comment} | set(schema.predicates()) - {schema.predicate(ns.bsfs.Predicate)})
diff --git a/test/extractor/test_builder.py b/test/extractor/test_builder.py
new file mode 100644
index 0000000..fbb0895
--- /dev/null
+++ b/test/extractor/test_builder.py
@@ -0,0 +1,98 @@
+
+# standard imports
+import unittest
+
+# bsie imports
+from bsie.utils import errors
+
+# objects to test
+from bsie.extractor import ExtractorBuilder
+
+
+## code ##
+
+class TestExtractorBuilder(unittest.TestCase):
+    def test_iter(self):
+        # no specifications
+        self.assertListEqual(list(ExtractorBuilder([])), [])
+        # some specifications
+        builder = ExtractorBuilder([
+            {'bsie.extractor.generic.path.Path': {}},
+            {'bsie.extractor.generic.stat.Stat': {}},
+            {'bsie.extractor.generic.path.Path': {}},
+            ])
+        self.assertListEqual(list(builder), [0, 1, 2])
+
+    def test_build(self):
+        # simple and repeated extractors
+        builder = ExtractorBuilder([
+            {'bsie.extractor.generic.path.Path': {}},
+            {'bsie.extractor.generic.stat.Stat': {}},
+            {'bsie.extractor.generic.path.Path': {}},
+            ])
+        ext = [builder.build(0), builder.build(1), builder.build(2)]
+        import bsie.extractor.generic.path
+        import bsie.extractor.generic.stat
+        self.assertListEqual(ext, [
+            bsie.extractor.generic.path.Path(),
+            bsie.extractor.generic.stat.Stat(),
+            bsie.extractor.generic.path.Path(),
+            ])
+        # out-of-bounds raises KeyError
+        self.assertRaises(IndexError, builder.build, 3)
+
+        # building with args
+        builder = ExtractorBuilder([
+            {'bsie.extractor.generic.constant.Constant': {
+                'schema': '''
+                    bse:author rdfs:subClassOf bsfs:Predicate ;
+                        rdfs:domain bsn:Entity ;
+                        rdfs:range xsd:string ;
+                        bsfs:unique "true"^^xsd:boolean .
+                    bse:rating rdfs:subClassOf bsfs:Predicate ;
+                        rdfs:domain bsn:Entity ;
+                        rdfs:range xsd:integer ;
+                        bsfs:unique "true"^^xsd:boolean .
+                    ''',
+                'tuples': [
+                    ('https://schema.bsfs.io/ie/Node/Entity#author', 'Me, myself, and I'),
+                    ('https://schema.bsfs.io/ie/Node/Entity#rating', 123),
+                    ],
+            }}])
+        obj = builder.build(0)
+        import bsie.extractor.generic.constant
+        self.assertEqual(obj, bsie.extractor.generic.constant.Constant('''
+                    bse:author rdfs:subClassOf bsfs:Predicate ;
+                        rdfs:domain bsn:Entity ;
+                        rdfs:range xsd:string ;
+                        bsfs:unique "true"^^xsd:boolean .
+                    bse:rating rdfs:subClassOf bsfs:Predicate ;
+                        rdfs:domain bsn:Entity ;
+                        rdfs:range xsd:integer ;
+                        bsfs:unique "true"^^xsd:boolean .
+                    ''', [
+            ('https://schema.bsfs.io/ie/Node/Entity#author', 'Me, myself, and I'),
+            ('https://schema.bsfs.io/ie/Node/Entity#rating', 123),
+            ]))
+
+        # building with invalid args
+        self.assertRaises(errors.BuilderError, ExtractorBuilder(
+            [{'bsie.extractor.generic.path.Path': {'foo': 123}}]).build, 0)
+        # non-dict build specification
+        self.assertRaises(TypeError, ExtractorBuilder(
+            [('bsie.extractor.generic.path.Path', {})]).build, 0)
+        # multiple keys per build specification
+        self.assertRaises(TypeError, ExtractorBuilder(
+            [{'bsie.extractor.generic.path.Path': {},
+              'bsie.extractor.generic.stat.Stat': {}}]).build, 0)
+        # non-dict value for kwargs
+        self.assertRaises(TypeError, ExtractorBuilder(
+            [{'bsie.extractor.generic.path.Path': 123}]).build, 0)
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/extractor/test_preview.py b/test/extractor/test_preview.py
new file mode 100644
index 0000000..6526783
--- /dev/null
+++ b/test/extractor/test_preview.py
@@ -0,0 +1,123 @@
+
+# standard imports
+import io
+import os
+import unittest
+
+# external imports
+import PIL.Image
+
+# bsie imports
+from bsie.extractor import base
+from bsie.utils import bsfs, node as _node, ns
+from bsie.reader.preview import Preview as Reader
+
+# objects to test
+from bsie.extractor.preview import Preview
+
+
+## code ##
+
+class TestPreview(unittest.TestCase):
+    def test_eq(self):
+        # identical instances are equal
+        self.assertEqual(Preview([1,2,3]), Preview([1,2,3]))
+        self.assertEqual(hash(Preview([1,2,3])), hash(Preview([1,2,3])))
+        # comparison respects max_sides
+        self.assertNotEqual(Preview([1,2,3]), Preview([1,2]))
+        self.assertNotEqual(hash(Preview([1,2,3])), hash(Preview([1,2])))
+        self.assertNotEqual(Preview([1,2]), Preview([1,2,3]))
+        self.assertNotEqual(hash(Preview([1,2])), hash(Preview([1,2,3])))
+        # comparison respects type
+        class Foo(): pass
+        self.assertNotEqual(Preview([1,2,3]), Foo())
+        self.assertNotEqual(hash(Preview([1,2,3])), hash(Foo()))
+        self.assertNotEqual(Preview([1,2,3]), 123)
+        self.assertNotEqual(hash(Preview([1,2,3])), hash(123))
+        self.assertNotEqual(Preview([1,2,3]), None)
+        self.assertNotEqual(hash(Preview([1,2,3])), hash(None))
+
+    def test_schema(self):
+        self.assertEqual(Preview([1,2,3]).schema,
+            bsfs.schema.from_string(base.SCHEMA_PREAMBLE + '''
+                bsn:Preview rdfs:subClassOf bsfs:Node .
+                bsl:BinaryBlob rdfs:subClassOf bsfs:Literal .
+                <https://schema.bsfs.io/ie/Literal/BinaryBlob/JPEG> rdfs:subClassOf bsl:BinaryBlob .
+
+                bse:preview rdfs:subClassOf bsfs:Predicate ;
+                    rdfs:domain bsn:Entity ;
+                    rdfs:range bsn:Preview ;
+                    bsfs:unique "false"^^xsd:boolean .
+
+                bsp:width rdfs:subClassOf bsfs:Predicate ;
+                    rdfs:domain bsn:Preview ;
+                    rdfs:range xsd:integer ;
+                    bsfs:unique "true"^^xsd:boolean .
+
+                bsp:height rdfs:subClassOf bsfs:Predicate ;
+                    rdfs:domain bsn:Preview ;
+                    rdfs:range xsd:integer ;
+                    bsfs:unique "true"^^xsd:boolean .
+
+                bsp:asset rdfs:subClassOf bsfs:Predicate ;
+                    rdfs:domain bsn:Preview ;
+                    rdfs:range <https://schema.bsfs.io/ie/Literal/BinaryBlob/JPEG> ;
+                    bsfs:unique "true"^^xsd:boolean .
+
+                '''))
+
+    def test_extract(self):
+        # setup dependents
+        rdr = Reader()
+        subject = _node.Node(ns.bsn.Entity)
+        path = os.path.join(os.path.dirname(__file__), 'testimage.jpg')
+
+        # setup extractor
+        ext = Preview(max_sides=[10])
+        principals = set(ext.principals)
+        self.assertEqual(principals, {ext.schema.predicate(ns.bse.preview)})
+        # skip unknown predicates
+        gen = rdr(path)
+        self.assertSetEqual(set(), set(ext.extract(subject, gen,
+            {ext.schema.predicate(ns.bsfs.Predicate).child(ns.bse.unknown)})))
+        gen(10) # NOTE: consume some image to avoid resource error warning
+        # extract a preview
+        triples = set(ext.extract(subject, rdr(path), principals))
+        thumbs = {node for node, _, _ in triples if node.node_type == ns.bsn.Preview}
+        self.assertEqual(len(thumbs), 1)
+        thumb = list(thumbs)[0]
+        # test properties
+        self.assertTrue(triples.issuperset({
+            (subject, ext.schema.predicate(ns.bse.preview), thumb),
+            (thumb, ext.schema.predicate(ns.bsp.width), 10),
+            (thumb, ext.schema.predicate(ns.bsp.height), 10),
+            }))
+        # test image data
+        rawdata = {val for _, pred, val in triples if pred == ext.schema.predicate(ns.bsp.asset)}
+        self.assertEqual(len(rawdata), 1)
+        data = io.BytesIO(list(rawdata)[0])
+        data.seek(0)
+        img = PIL.Image.open(data)
+        self.assertEqual(img.size, (10, 10))
+        self.assertEqual(sum(band for pix in img.getdata() for band in pix), 0)
+
+        # setup extractor
+        ext = Preview(max_sides=[10, 20])
+        principals = set(ext.principals)
+        self.assertEqual(principals, {ext.schema.predicate(ns.bse.preview)})
+        # extract a preview
+        triples = set(ext.extract(subject, rdr(path), principals))
+        thumbs = {node for node, _, _ in triples if node.node_type == ns.bsn.Preview}
+        self.assertEqual(len(thumbs), 2)
+        self.assertSetEqual({10, 20}, {
+            value for _, pred, value in triples if pred == ext.schema.predicate(ns.bsp.width)})
+        self.assertSetEqual({10, 20}, {
+            value for _, pred, value in triples if pred == ext.schema.predicate(ns.bsp.height)})
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/extractor/testimage.jpg b/test/extractor/testimage.jpg
new file mode 100644
index 0000000..4c2aca5
--- /dev/null
+++ b/test/extractor/testimage.jpg
diff --git a/test/lib/test_bsie.py b/test/lib/test_bsie.py
index 771a0c2..0c393cc 100644
--- a/test/lib/test_bsie.py
+++ b/test/lib/test_bsie.py
@@ -1,16 +1,13 @@
-"""
 
-Part of the bsie test suite.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import os
 import unittest
 
 # bsie imports
-from bsie.base import extractor
-from bsie.tools import builder
+from bsie.extractor import ExtractorBuilder
+from bsie.extractor.base import SCHEMA_PREAMBLE
+from bsie.lib import PipelineBuilder, DefaultNamingPolicy
+from bsie.reader import ReaderBuilder
 from bsie.utils import bsfs, node, ns
 
 # objects to test
@@ -22,53 +19,53 @@ from bsie.lib.bsie import BSIE
 class TestBSIE(unittest.TestCase):
     def setUp(self):
         # reader builder
-        rbuild = builder.ReaderBuilder({})
+        rbuild = ReaderBuilder({})
         # extractor builder
-        ebuild = builder.ExtractorBuilder([
+        ebuild = ExtractorBuilder([
             {'bsie.extractor.generic.path.Path': {}},
             {'bsie.extractor.generic.stat.Stat': {}},
             {'bsie.extractor.generic.constant.Constant': dict(
-                tuples=[('http://bsfs.ai/schema/Entity#author', 'Me, myself, and I')],
+                tuples=[('https://schema.bsfs.io/ie/Node/Entity#author', 'Me, myself, and I')],
                 schema='''
                     bse:author rdfs:subClassOf bsfs:Predicate ;
-                        rdfs:domain bsfs:Entity ;
+                        rdfs:domain bsn:Entity ;
                         rdfs:range xsd:string ;
                         bsfs:unique "true"^^xsd:boolean .
                     ''',
                 )},
             ])
         # build pipeline
-        self.prefix = bsfs.Namespace('http://example.com/local/')
-        pbuild = builder.PipelineBuilder(self.prefix, rbuild, ebuild)
+        self.naming_policy = DefaultNamingPolicy(host='http://example.com/local', user='me')
+        pbuild = PipelineBuilder(rbuild, ebuild)
         self.pipeline = pbuild.build()
 
     def test_construction(self):
-        # pipeline only
-        lib = BSIE(self.pipeline)
+        # only pipeline and naming policy
+        lib = BSIE(self.pipeline, self.naming_policy)
         self.assertSetEqual(set(lib.principals), {
             ns.bse.filename,
             ns.bse.filesize,
             ns.bse.author,
             })
-        self.assertEqual(lib.schema, bsfs.schema.Schema.from_string(extractor.SCHEMA_PREAMBLE + '''
+        self.assertEqual(lib.schema, bsfs.schema.from_string(SCHEMA_PREAMBLE + '''
             bse:filename rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:File ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
-                bsfs:unique "false"^^xsd:boolean .
+                bsfs:unique "true"^^xsd:boolean .
 
             bse:filesize rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:File ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:integer;
-                bsfs:unique "false"^^xsd:boolean .
+                bsfs:unique "true"^^xsd:boolean .
 
             bse:author rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:Entity ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
                 bsfs:unique "true"^^xsd:boolean .
             '''))
 
         # specify collect
-        lib = BSIE(self.pipeline, collect={
+        lib = BSIE(self.pipeline, self.naming_policy, collect={
             ns.bse.filesize,
             ns.bse.author,
             ns.bse.inexistent,
@@ -77,44 +74,44 @@ class TestBSIE(unittest.TestCase):
             ns.bse.filesize,
             ns.bse.author,
             })
-        self.assertEqual(lib.schema, bsfs.schema.Schema.from_string(extractor.SCHEMA_PREAMBLE + '''
+        self.assertEqual(lib.schema, bsfs.schema.from_string(SCHEMA_PREAMBLE + '''
             bse:filesize rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:File ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:integer;
-                bsfs:unique "false"^^xsd:boolean .
+                bsfs:unique "true"^^xsd:boolean .
 
             bse:author rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:Entity ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
                 bsfs:unique "true"^^xsd:boolean .
             '''))
         # empty collect is disregarded
-        lib = BSIE(self.pipeline, collect={})
+        lib = BSIE(self.pipeline, self.naming_policy, collect={})
         self.assertSetEqual(set(lib.principals), {
             ns.bse.filename,
             ns.bse.filesize,
             ns.bse.author,
             })
-        self.assertEqual(lib.schema, bsfs.schema.Schema.from_string(extractor.SCHEMA_PREAMBLE + '''
+        self.assertEqual(lib.schema, bsfs.schema.from_string(SCHEMA_PREAMBLE + '''
             bse:filename rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:File ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
-                bsfs:unique "false"^^xsd:boolean .
+                bsfs:unique "true"^^xsd:boolean .
 
             bse:filesize rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:File ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:integer;
-                bsfs:unique "false"^^xsd:boolean .
+                bsfs:unique "true"^^xsd:boolean .
 
             bse:author rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:Entity ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
                 bsfs:unique "true"^^xsd:boolean .
 
             '''))
 
         # specify discard
-        lib = BSIE(self.pipeline, discard={
+        lib = BSIE(self.pipeline, self.naming_policy, discard={
             ns.bse.filesize,
             ns.bse.filename,
             ns.bse.inexistent,
@@ -122,40 +119,40 @@ class TestBSIE(unittest.TestCase):
         self.assertSetEqual(set(lib.principals), {
             ns.bse.author,
             })
-        self.assertEqual(lib.schema, bsfs.schema.Schema.from_string(extractor.SCHEMA_PREAMBLE + '''
+        self.assertEqual(lib.schema, bsfs.schema.from_string(SCHEMA_PREAMBLE + '''
             bse:author rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:Entity ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
                 bsfs:unique "true"^^xsd:boolean .
             '''))
 
         # specify collect and discard
-        lib = BSIE(self.pipeline,
+        lib = BSIE(self.pipeline, self.naming_policy,
             collect={ns.bse.filesize, ns.bse.author, ns.bse.foo, ns.bse.bar},
             discard={ns.bse.author, ns.bse.foo, ns.bse.foobar},
             )
         self.assertSetEqual(set(lib.principals), {
             ns.bse.filesize,
             })
-        self.assertEqual(lib.schema, bsfs.schema.Schema.from_string(extractor.SCHEMA_PREAMBLE + '''
+        self.assertEqual(lib.schema, bsfs.schema.from_string(SCHEMA_PREAMBLE + '''
             bse:filesize rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:File ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:integer;
-                bsfs:unique "false"^^xsd:boolean .
+                bsfs:unique "true"^^xsd:boolean .
 
             '''))
 
 
     def test_from_file(self):
         # setup
-        lib = BSIE(self.pipeline)
+        lib = BSIE(self.pipeline, self.naming_policy)
         self.assertSetEqual(set(lib.principals), {
             ns.bse.filesize,
             ns.bse.filename,
             ns.bse.author,
             })
         content_hash = 'a948904f2f0f479b8f8197694b30184b0d2ed1c1cd2a1ec0fb85d299a192a447'
-        subject = node.Node(ns.bsfs.File, (self.prefix + 'file#')[content_hash])
+        subject = node.Node(ns.bsn.Entity, uri=f'http://example.com/local/me/file#{content_hash}')
         testfile = os.path.join(os.path.dirname(__file__), 'testfile.t')
 
         # from_file extracts all available triples
diff --git a/test/lib/test_builder.py b/test/lib/test_builder.py
new file mode 100644
index 0000000..3ecb3d3
--- /dev/null
+++ b/test/lib/test_builder.py
@@ -0,0 +1,101 @@
+
+# standard imports
+import logging
+import unittest
+
+# bsie imports
+from bsie.extractor import ExtractorBuilder
+from bsie.reader import ReaderBuilder
+from bsie.utils import bsfs
+
+# objects to test
+from bsie.lib import PipelineBuilder
+
+
+## code ##
+
+class TestPipelineBuilder(unittest.TestCase):
+    def test_build(self):
+        c_schema = '''
+            bse:author rdfs:subClassOf bsfs:Predicate ;
+                rdfs:domain bsn:Entity ;
+                rdfs:range xsd:string ;
+                bsfs:unique "true"^^xsd:boolean .
+            '''
+        c_tuples = [('https://schema.bsfs.io/ie/Node/Entity#author', 'Me, myself, and I')]
+        # prepare builders
+        rbuild = ReaderBuilder({})
+        ebuild = ExtractorBuilder([
+            {'bsie.extractor.generic.path.Path': {}},
+            {'bsie.extractor.generic.stat.Stat': {}},
+            {'bsie.extractor.generic.constant.Constant': dict(
+                schema=c_schema,
+                tuples=c_tuples,
+                )},
+            ])
+        # build pipeline
+        builder = PipelineBuilder(rbuild, ebuild)
+        pipeline = builder.build()
+        # delayed import
+        import bsie.reader.path
+        import bsie.reader.stat
+        import bsie.extractor.generic.path
+        import bsie.extractor.generic.stat
+        import bsie.extractor.generic.constant
+        # check pipeline
+        self.assertDictEqual(pipeline._ext2rdr, {
+            bsie.extractor.generic.path.Path(): bsie.reader.path.Path(),
+            bsie.extractor.generic.stat.Stat(): bsie.reader.stat.Stat(),
+            bsie.extractor.generic.constant.Constant(c_schema, c_tuples): None,
+            })
+
+        # fail to load extractor
+        ebuild_err = ExtractorBuilder([
+            {'bsie.extractor.generic.foo.Foo': {}},
+            {'bsie.extractor.generic.path.Path': {}},
+            ])
+        with self.assertLogs(logging.getLogger('bsie.lib.builder'), logging.ERROR):
+            pipeline = PipelineBuilder(rbuild, ebuild_err).build()
+            self.assertDictEqual(pipeline._ext2rdr, {
+                bsie.extractor.generic.path.Path(): bsie.reader.path.Path()})
+
+        # fail to build extractor
+        ebuild_err = ExtractorBuilder([
+            {'bsie.extractor.generic.path.Path': {'foo': 123}},
+            {'bsie.extractor.generic.path.Path': {}},
+            ])
+        with self.assertLogs(logging.getLogger('bsie.lib.builder'), logging.ERROR):
+            pipeline = PipelineBuilder(rbuild, ebuild_err).build()
+            self.assertDictEqual(pipeline._ext2rdr, {
+                bsie.extractor.generic.path.Path(): bsie.reader.path.Path()})
+
+        # fail to load reader
+        with self.assertLogs(logging.getLogger('bsie.lib.builder'), logging.ERROR):
+            # switch reader of an extractor
+            old_reader = bsie.extractor.generic.path.Path.CONTENT_READER
+            bsie.extractor.generic.path.Path.CONTENT_READER = 'bsie.reader.foo.Foo'
+            # build pipeline with invalid reader reference
+            pipeline = PipelineBuilder(rbuild, ebuild).build()
+            self.assertDictEqual(pipeline._ext2rdr, {
+                bsie.extractor.generic.stat.Stat(): bsie.reader.stat.Stat(),
+                bsie.extractor.generic.constant.Constant(c_schema, c_tuples): None,
+                })
+            # switch back
+            bsie.extractor.generic.path.Path.CONTENT_READER = old_reader
+
+        # fail to build reader
+        rbuild_err = ReaderBuilder({'bsie.reader.stat.Stat': dict(foo=123)})
+        with self.assertLogs(logging.getLogger('bsie.lib.builder'), logging.ERROR):
+            pipeline = PipelineBuilder(rbuild_err, ebuild).build()
+            self.assertDictEqual(pipeline._ext2rdr, {
+                bsie.extractor.generic.path.Path(): bsie.reader.path.Path(),
+                bsie.extractor.generic.constant.Constant(c_schema, c_tuples): None,
+                })
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/lib/test_naming_policy.py b/test/lib/test_naming_policy.py
new file mode 100644
index 0000000..c9b0cd2
--- /dev/null
+++ b/test/lib/test_naming_policy.py
@@ -0,0 +1,115 @@
+
+# standard imports
+import unittest
+
+# bsie imports
+from bsie.utils import ns, errors
+from bsie.utils.bsfs import URI
+from bsie.utils.node import Node
+
+# objects to test
+from bsie.lib.naming_policy import NamingPolicy, NamingPolicyIterator, DefaultNamingPolicy
+
+
+
+## code ##
+
+class TestDefaultNamingPolicy(unittest.TestCase):
+
+    def test_handle_node(self):
+        # setup
+        policy = DefaultNamingPolicy('http://example.com', 'me')
+        # handle_node doesn't modify existing uris
+        self.assertEqual(policy.handle_node(
+            Node(ns.bsn.Invalid, uri='http://example.com/you/foo#bar')).uri,
+            URI('http://example.com/you/foo#bar'))
+        # processes bsfs:File
+        self.assertEqual(policy.handle_node(
+            Node(ns.bsn.Entity, ucid='abc123cba')).uri,
+            URI('http://example.com/me/file#abc123cba'))
+        # processes bsfs:Preview
+        self.assertEqual(policy.handle_node(
+            Node(ns.bsn.Preview, ucid='abc123cba', size=123)).uri,
+            URI('http://example.com/me/preview#abc123cba_s123'))
+        # raises an exception on unknown types
+        self.assertRaises(errors.ProgrammingError, policy.handle_node,
+            Node(ns.bsn.Invalid, ucid='abc123cba', size=123))
+
+    def test_name_file(self):
+        # setup
+        policy = DefaultNamingPolicy('http://example.com', 'me')
+        # name_file uses ucid
+        self.assertEqual(policy.name_file(
+            Node(ns.bsn.Entity, ucid='123abc321')).uri,
+            URI('http://example.com/me/file#123abc321'))
+        # name_file falls back to a random guid
+        self.assertTrue(policy.name_file(
+            Node(ns.bsn.Entity)).uri.startswith('http://example.com/me/file#'))
+
+    def test_name_preview(self):
+        # setup
+        policy = DefaultNamingPolicy('http://example.com', 'me')
+        # name_preview uses ucid
+        self.assertEqual(policy.name_preview(
+            Node(ns.bsn.Preview, ucid='123abc321')).uri,
+            URI('http://example.com/me/preview#123abc321'))
+        self.assertEqual(policy.name_preview(
+            Node(ns.bsn.Preview, ucid='123abc321', size=400)).uri,
+            URI('http://example.com/me/preview#123abc321_s400'))
+        # name_preview uses source
+        self.assertEqual(policy.name_preview(
+            Node(ns.bsn.Preview, source=Node(ns.bsn.Entity, ucid='123file321'))).uri,
+            URI('http://example.com/me/preview#123file321'))
+        self.assertEqual(policy.name_preview(
+            Node(ns.bsn.Preview, source=Node(ns.bsn.Entity, ucid='123file321'), size=300)).uri,
+            URI('http://example.com/me/preview#123file321_s300'))
+        # name_preview falls back to a random guid
+        self.assertTrue(policy.name_preview(
+            Node(ns.bsn.Preview)).uri.startswith('http://example.com/me/preview#'))
+        self.assertTrue(policy.name_preview(
+            Node(ns.bsn.Preview, size=200)).uri.startswith('http://example.com/me/preview#'))
+        self.assertTrue(policy.name_preview(
+            Node(ns.bsn.Preview, size=200)).uri.endswith('_s200'))
+
+
+class TestNamingPolicyIterator(unittest.TestCase):
+
+    def test_call(self): # NOTE: We test NamingPolicy.__call__ here
+        # setup
+        policy = DefaultNamingPolicy('http://example.com', 'me')
+        # call accepts list
+        triples = [('node', 'pred', 'value'), ('node', 'pred', 'value')]
+        it = policy(triples)
+        self.assertIsInstance(it, NamingPolicyIterator)
+        self.assertEqual(it._iterable, triples)
+        self.assertEqual(it._policy, policy)
+        # call accepts iterator
+        triples = iter([('node', 'pred', 'value'), ('node', 'pred', 'value')])
+        it = policy(triples)
+        self.assertIsInstance(it, NamingPolicyIterator)
+        self.assertEqual(it._iterable, triples)
+        self.assertEqual(it._policy, policy)
+
+    def test_iter(self):
+        # setup
+        policy = DefaultNamingPolicy('http://example.com', 'me')
+        triples = [
+            (Node(ns.bsn.Entity, ucid='foo'), 'predA', 'hello'),
+            (Node(ns.bsn.Preview, ucid='bar'), 'predB', 1234),
+            (Node(ns.bsn.Preview, ucid='hello'), 'predC', Node(ns.bsn.Entity, ucid='world'))
+            ]
+        # handles nodes, handles values, ignores predicate
+        self.assertListEqual(list(policy(triples)), [
+            (Node(ns.bsn.Entity, uri='http://example.com/me/file#foo'), 'predA', 'hello'),
+            (Node(ns.bsn.Preview, uri='http://example.com/me/preview#bar'), 'predB', 1234),
+            (Node(ns.bsn.Preview, uri='http://example.com/me/preview#hello'), 'predC',
+                Node(ns.bsn.Entity, uri='http://example.com/me/file#world')),
+            ])
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/tools/test_pipeline.py b/test/lib/test_pipeline.py
index a116a30..eb088a9 100644
--- a/test/tools/test_pipeline.py
+++ b/test/lib/test_pipeline.py
@@ -1,17 +1,11 @@
-"""
 
-Part of the bsie test suite.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import logging
 import os
 import unittest
 
 # bsie imports
-from bsie.base import errors
-from bsie.utils import bsfs, node, ns
+from bsie.utils import bsfs, errors, node, ns
 import bsie.extractor.generic.constant
 import bsie.extractor.generic.path
 import bsie.extractor.generic.stat
@@ -19,7 +13,7 @@ import bsie.reader.path
 import bsie.reader.stat
 
 # objects to test
-from bsie.tools.pipeline import Pipeline
+from bsie.lib.pipeline import Pipeline
 
 
 ## code ##
@@ -29,19 +23,19 @@ class TestPipeline(unittest.TestCase):
         # constant A
         csA = '''
             bse:author rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:File ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:string ;
                 bsfs:unique "true"^^xsd:boolean .
             '''
-        tupA = [('http://bsfs.ai/schema/Entity#author', 'Me, myself, and I')]
+        tupA = [('https://schema.bsfs.io/ie/Node/Entity#author', 'Me, myself, and I')]
         # constant B
         csB = '''
             bse:rating rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:File ;
+                rdfs:domain bsn:Entity ;
                 rdfs:range xsd:integer ;
                 bsfs:unique "true"^^xsd:boolean .
             '''
-        tupB = [('http://bsfs.ai/schema/Entity#rating', 123)]
+        tupB = [('https://schema.bsfs.io/ie/Node/Entity#rating', 123)]
         # extractors/readers
         self.ext2rdr = {
             bsie.extractor.generic.path.Path(): bsie.reader.path.Path(),
@@ -49,33 +43,29 @@ class TestPipeline(unittest.TestCase):
             bsie.extractor.generic.constant.Constant(csA, tupA): None,
             bsie.extractor.generic.constant.Constant(csB, tupB): None,
         }
-        self.prefix = bsfs.Namespace('http://example.com/local/')
 
     def test_essentials(self):
-        pipeline = Pipeline(self.prefix, self.ext2rdr)
+        pipeline = Pipeline(self.ext2rdr)
         self.assertEqual(str(pipeline), 'Pipeline')
         self.assertEqual(repr(pipeline), 'Pipeline(...)')
 
     def test_equality(self):
-        pipeline = Pipeline(self.prefix, self.ext2rdr)
+        pipeline = Pipeline(self.ext2rdr)
         # a pipeline is equivalent to itself
         self.assertEqual(pipeline, pipeline)
         self.assertEqual(hash(pipeline), hash(pipeline))
         # identical builds are equivalent
-        self.assertEqual(pipeline, Pipeline(self.prefix, self.ext2rdr))
-        self.assertEqual(hash(pipeline), hash(Pipeline(self.prefix, self.ext2rdr)))
+        self.assertEqual(pipeline, Pipeline(self.ext2rdr))
+        self.assertEqual(hash(pipeline), hash(Pipeline(self.ext2rdr)))
 
-        # equivalence respects prefix
-        self.assertNotEqual(pipeline, Pipeline(bsfs.URI('http://example.com/global/ent#'), self.ext2rdr))
-        self.assertNotEqual(hash(pipeline), hash(Pipeline(bsfs.URI('http://example.com/global/ent#'), self.ext2rdr)))
         # equivalence respects extractors/readers
         ext2rdr = {ext: rdr for idx, (ext, rdr) in enumerate(self.ext2rdr.items()) if idx % 2 == 0}
-        self.assertNotEqual(pipeline, Pipeline(self.prefix, ext2rdr))
-        self.assertNotEqual(hash(pipeline), hash(Pipeline(self.prefix, ext2rdr)))
+        self.assertNotEqual(pipeline, Pipeline(ext2rdr))
+        self.assertNotEqual(hash(pipeline), hash(Pipeline(ext2rdr)))
 
         # equivalence respects schema
-        p2 = Pipeline(self.prefix, self.ext2rdr)
-        p2._schema = pipeline.schema.Empty()
+        p2 = Pipeline(self.ext2rdr)
+        p2._schema = bsfs.schema.Schema()
         self.assertNotEqual(pipeline, p2)
         self.assertNotEqual(hash(pipeline), hash(p2))
 
@@ -91,17 +81,17 @@ class TestPipeline(unittest.TestCase):
 
     def test_call(self):
         # build pipeline
-        pipeline = Pipeline(self.prefix, self.ext2rdr)
+        pipeline = Pipeline(self.ext2rdr)
         # build objects for tests
         content_hash = 'a948904f2f0f479b8f8197694b30184b0d2ed1c1cd2a1ec0fb85d299a192a447'
-        subject = node.Node(ns.bsfs.File, (self.prefix + 'file#')[content_hash])
+        subject = node.Node(ns.bsn.Entity, ucid=content_hash)
         testfile = os.path.join(os.path.dirname(__file__), 'testfile.t')
         p_filename = pipeline.schema.predicate(ns.bse.filename)
         p_filesize = pipeline.schema.predicate(ns.bse.filesize)
         p_author = pipeline.schema.predicate(ns.bse.author)
         p_rating = pipeline.schema.predicate(ns.bse.rating)
-        entity = pipeline.schema.node(ns.bsfs.File)
-        p_invalid = pipeline.schema.predicate(ns.bsfs.Predicate).get_child(ns.bse.foo, range=entity)
+        entity = pipeline.schema.node(ns.bsn.Entity)
+        p_invalid = pipeline.schema.predicate(ns.bsfs.Predicate).child(ns.bse.foo, range=entity)
 
         # extract given predicates
         self.assertSetEqual(set(pipeline(testfile, {p_filename, p_filesize})), {
@@ -139,8 +129,8 @@ class TestPipeline(unittest.TestCase):
             def __call__(self, path):
                 raise errors.ReaderError('reader error')
 
-        pipeline = Pipeline(self.prefix, {bsie.extractor.generic.path.Path(): FaultyReader()})
-        with self.assertLogs(logging.getLogger('bsie.tools.pipeline'), logging.ERROR):
+        pipeline = Pipeline({bsie.extractor.generic.path.Path(): FaultyReader()})
+        with self.assertLogs(logging.getLogger('bsie.lib.pipeline'), logging.ERROR):
             testfile = os.path.join(os.path.dirname(__file__), 'testfile.t')
             p_filename = pipeline.schema.predicate(ns.bse.filename)
             self.assertSetEqual(set(pipeline(testfile, {p_filename})), set())
@@ -150,15 +140,15 @@ class TestPipeline(unittest.TestCase):
             def extract(self, subject, content, predicates):
                 raise errors.ExtractorError('extractor error')
 
-        pipeline = Pipeline(self.prefix, {FaultyExtractor(): bsie.reader.path.Path()})
-        with self.assertLogs(logging.getLogger('bsie.tools.pipeline'), logging.ERROR):
+        pipeline = Pipeline({FaultyExtractor(): bsie.reader.path.Path()})
+        with self.assertLogs(logging.getLogger('bsie.lib.pipeline'), logging.ERROR):
             testfile = os.path.join(os.path.dirname(__file__), 'testfile.t')
             p_filename = pipeline.schema.predicate(ns.bse.filename)
             self.assertSetEqual(set(pipeline(testfile, {p_filename})), set())
 
     def test_predicates(self):
         # build pipeline
-        pipeline = Pipeline(self.prefix, self.ext2rdr)
+        pipeline = Pipeline(self.ext2rdr)
         #
         self.assertSetEqual(set(pipeline.principals), {
             pipeline.schema.predicate(ns.bse.filename),
diff --git a/test/tools/__init__.py b/test/reader/image/__init__.py
index e69de29..e69de29 100644
--- a/test/tools/__init__.py
+++ b/test/reader/image/__init__.py
diff --git a/test/reader/image/load_nef.py b/test/reader/image/load_nef.py
new file mode 100644
index 0000000..02be470
--- /dev/null
+++ b/test/reader/image/load_nef.py
@@ -0,0 +1,23 @@
+
+# standard imports
+import os
+
+# external imports
+import requests
+
+# constants
+IMAGE_URL = 'http://igsor.net/eik7AhvohghaeN5.nef'
+
+## code ##
+
+def get():
+    """Download a raw test image."""
+    target = os.path.join(os.path.dirname(__file__), 'testimage.nef')
+    if not os.path.exists(target):
+        with open(target, 'wb') as ofile:
+            ans = requests.get(IMAGE_URL)
+            ofile.write(ans.content)
+
+
+
+## EOF ##
diff --git a/test/reader/image/test_image.py b/test/reader/image/test_image.py
new file mode 100644
index 0000000..ee9b8f9
--- /dev/null
+++ b/test/reader/image/test_image.py
@@ -0,0 +1,49 @@
+
+# standard imports
+import importlib
+import os
+import unittest
+
+# bsie imports
+from bsie.utils import errors
+
+# objects to test
+from bsie.reader.image import Image
+
+
+## code ##
+
+class TestImage(unittest.TestCase):
+    def setUp(self):
+        if __package__ is None or __package__ == '': # direct call or local discovery
+            importlib.import_module('load_nef', __package__).get()
+        else: # parent discovery
+            importlib.import_module('.load_nef', __package__).get()
+
+    def test_construct(self):
+        image = Image({})
+        self.assertIsInstance(image, Image)
+        self.assertEqual(len(image._children), 2)
+
+    def test_call(self):
+        image = Image({})
+        # call returns raw image
+        img = image(os.path.join(os.path.dirname(__file__), 'testimage.nef'))
+        self.assertEqual(img.size, (6016, 4016)) # FIXME: change when image was replaced
+        img.close()
+        # call returns jpeg image
+        img = image(os.path.join(os.path.dirname(__file__), 'testimage.jpg'))
+        self.assertEqual(img.size, (1, 1))
+        img.close()
+        # call raises error if file cannot be read
+        self.assertRaises(errors.ReaderError, image,
+            os.path.join(os.path.dirname(__file__), 'invalid.nef'))
+        self.assertRaises(errors.ReaderError, image,
+            os.path.join(os.path.dirname(__file__), 'invalid.jpg'))
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/reader/image/test_pillow.py b/test/reader/image/test_pillow.py
new file mode 100644
index 0000000..2cff768
--- /dev/null
+++ b/test/reader/image/test_pillow.py
@@ -0,0 +1,39 @@
+
+# standard imports
+import os
+import unittest
+
+# external imports
+import PIL.Image
+
+# bsie imports
+from bsie.utils import errors
+
+# objects to test
+from bsie.reader.image._pillow import PillowImage
+
+
+## code ##
+
+class TestPillowImage(unittest.TestCase):
+    def test_call(self):
+        rdr = PillowImage()
+        # returns PIL image
+        img = rdr(os.path.join(os.path.dirname(__file__), 'testimage.jpg'))
+        self.assertEqual(img.size, (1, 1))
+        self.assertEqual(img.getdata().getpixel((0, 0)), (0, 0, 0))
+        img.close()
+        # raises exception when image cannot be read
+        self.assertRaises(errors.ReaderError, rdr,
+            os.path.join(os.path.dirname(__file__), 'invalid.jpg'))
+        # NOTE: PIL can actually read raw image files (returns the thumbnail)
+        #self.assertRaises(errors.ReaderError, rdr,
+        #    os.path.join(os.path.dirname(__file__), 'testimage.nef'))
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/reader/image/test_raw_image.py b/test/reader/image/test_raw_image.py
new file mode 100644
index 0000000..3b240d0
--- /dev/null
+++ b/test/reader/image/test_raw_image.py
@@ -0,0 +1,48 @@
+
+# standard imports
+import importlib
+import os
+import unittest
+
+# external imports
+import PIL.Image
+
+# bsie imports
+from bsie.utils import errors
+
+# objects to test
+from bsie.reader.image._raw import RawImage
+
+
+## code ##
+
+class TestRawImage(unittest.TestCase):
+    def setUp(self):
+        if __package__ is None or __package__ == '': # direct call or local discovery
+            importlib.import_module('load_nef', __package__).get()
+        else: # parent discovery
+            importlib.import_module('.load_nef', __package__).get()
+
+    def test_call(self):
+        rdr = RawImage()
+        # returns PIL image
+        img = rdr(os.path.join(os.path.dirname(__file__), 'testimage.nef'))
+        self.assertEqual(img.size, (6016, 4016)) # FIXME: change when image was replaced
+        #self.assertEqual(img.size, (1, 1))
+        #self.assertEqual(img.getdata().getpixel((0, 0)), (0, 0, 0))
+        img.close()
+        # raises exception when image cannot be read
+        self.assertRaises(errors.UnsupportedFileFormatError, rdr,
+            os.path.join(os.path.dirname(__file__), 'testimage.jpg'))
+        self.assertRaises(errors.ReaderError, rdr,
+            os.path.join(os.path.dirname(__file__), 'invalid.nef'))
+
+
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/reader/image/testimage.jpg b/test/reader/image/testimage.jpg
new file mode 100644
index 0000000..ea7af63
--- /dev/null
+++ b/test/reader/image/testimage.jpg
diff --git a/test/reader/preview/__init__.py b/test/reader/preview/__init__.py
new file mode 100644
index 0000000..e69de29
--- /dev/null
+++ b/test/reader/preview/__init__.py
diff --git a/test/reader/preview/invalid.foo b/test/reader/preview/invalid.foo
new file mode 100644
index 0000000..e69de29
--- /dev/null
+++ b/test/reader/preview/invalid.foo
diff --git a/test/reader/preview/invalid.jpg b/test/reader/preview/invalid.jpg
new file mode 100644
index 0000000..e69de29
--- /dev/null
+++ b/test/reader/preview/invalid.jpg
diff --git a/test/reader/preview/load_nef.py b/test/reader/preview/load_nef.py
new file mode 100644
index 0000000..02be470
--- /dev/null
+++ b/test/reader/preview/load_nef.py
@@ -0,0 +1,23 @@
+
+# standard imports
+import os
+
+# external imports
+import requests
+
+# constants
+IMAGE_URL = 'http://igsor.net/eik7AhvohghaeN5.nef'
+
+## code ##
+
+def get():
+    """Download a raw test image."""
+    target = os.path.join(os.path.dirname(__file__), 'testimage.nef')
+    if not os.path.exists(target):
+        with open(target, 'wb') as ofile:
+            ans = requests.get(IMAGE_URL)
+            ofile.write(ans.content)
+
+
+
+## EOF ##
diff --git a/test/reader/preview/test_pg.py b/test/reader/preview/test_pg.py
new file mode 100644
index 0000000..30095c5
--- /dev/null
+++ b/test/reader/preview/test_pg.py
@@ -0,0 +1,78 @@
+
+# standard imports
+from functools import partial
+import os
+import shutil
+import tempfile
+import unittest
+
+# external imports
+import PIL.Image
+
+# bsie imports
+from bsie.utils import errors
+
+# objects to test
+from bsie.reader.preview._pg import PreviewGeneratorReader
+
+
+## code ##
+
+class TestPreviewGeneratorReader(unittest.TestCase):
+    def test_call(self):
+        rdr = PreviewGeneratorReader()
+        # inexistent file raises a ReaderError
+        self.assertRaises(errors.ReaderError, rdr,
+            os.path.join(os.path.dirname(__file__), 'missing.jpg'))
+        # unsupported file type raises an UnsupportedFileFormatError
+        self.assertRaises(errors.UnsupportedFileFormatError, rdr,
+            os.path.join(os.path.dirname(__file__), 'invalid.foo'))
+        # invalid file raises a ReaderError
+        self.assertRaises(errors.ReaderError,
+            rdr(os.path.join(os.path.dirname(__file__), 'invalid.jpg')), 100)
+
+        # proper file produces a generator
+        gen = rdr(os.path.join(os.path.dirname(__file__), 'testimage.jpg'))
+        self.assertIsInstance(gen, partial)
+        # generator produces an image
+        img = gen(10)
+        self.assertIsInstance(img, PIL.Image.Image)
+        self.assertEqual(img.size, (10, 10))
+        self.assertEqual(sum(img.getdata()), 0)
+        # cleanup
+        img.close()
+
+        # preview generator can also extract data from non-image files
+        gen = rdr(os.path.join(os.path.dirname(__file__), 'testfile.pdf'))
+        self.assertIsInstance(gen, partial)
+        # generator produces an image
+        img = gen(10)
+        self.assertIsInstance(img, PIL.Image.Image)
+        self.assertEqual(img.size, (8, 10))
+        self.assertEqual(sum(img.getdata()), 20258)
+        # cleanup
+        img.close()
+        del rdr
+
+        # can define a cache dir
+        pg_dir = tempfile.mkdtemp(prefix='bsie-test')
+        self.assertTrue(os.path.exists(pg_dir))
+        rdr = PreviewGeneratorReader(cache=pg_dir)
+        gen = rdr(os.path.join(os.path.dirname(__file__), 'testimage.jpg'))
+        img = gen(10)
+        self.assertIsInstance(img, PIL.Image.Image)
+        self.assertEqual(img.size, (10, 10))
+        self.assertEqual(sum(img.getdata()), 0)
+        img.close()
+        del rdr
+        # cache dir still exists after instance deletion
+        self.assertTrue(os.path.exists(pg_dir))
+        shutil.rmtree(pg_dir, ignore_errors=True)
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/reader/preview/test_pillow.py b/test/reader/preview/test_pillow.py
new file mode 100644
index 0000000..20f08ec
--- /dev/null
+++ b/test/reader/preview/test_pillow.py
@@ -0,0 +1,49 @@
+
+# standard imports
+from functools import partial
+import os
+import unittest
+
+# external imports
+import PIL.Image
+
+# bsie imports
+from bsie.utils import errors
+
+# objects to test
+from bsie.reader.preview._pillow import PillowPreviewReader
+
+
+## code ##
+
+class TestPillowPreviewReader(unittest.TestCase):
+    def test_call(self):
+        rdr = PillowPreviewReader()
+        # raises exception when image cannot be read
+        self.assertRaises(errors.UnsupportedFileFormatError, rdr,
+            os.path.join(os.path.dirname(__file__), 'invalid.jpg'))
+        # raises exception when image cannot be read
+        self.assertRaises(errors.ReaderError, rdr,
+            os.path.join(os.path.dirname(__file__), 'inexistent.jpg'))
+        # raises exception when image has invalid type
+        self.assertRaises(errors.UnsupportedFileFormatError, rdr,
+            os.path.join(os.path.dirname(__file__), 'invalid.foo'))
+
+        # proper file produces a generator
+        gen = rdr(os.path.join(os.path.dirname(__file__), 'testimage.jpg'))
+        self.assertIsInstance(gen, partial)
+        # generator produces an image
+        img = gen(10)
+        self.assertIsInstance(img, PIL.Image.Image)
+        self.assertEqual(img.size, (10, 10))
+        self.assertEqual(sum(band for pix in img.getdata() for band in pix), 0)
+        # cleanup
+        img.close()
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/reader/preview/test_preview.py b/test/reader/preview/test_preview.py
new file mode 100644
index 0000000..e144877
--- /dev/null
+++ b/test/reader/preview/test_preview.py
@@ -0,0 +1,72 @@
+
+# standard imports
+from functools import partial
+import importlib
+import os
+import unittest
+
+# external imports
+import PIL.Image
+
+# bsie imports
+from bsie.utils import errors
+
+# objects to test
+from bsie.reader.preview import Preview
+
+
+## code ##
+
+class TestPreview(unittest.TestCase):
+    def setUp(self):
+        if __package__ is None or __package__ == '': # direct call or local discovery
+            importlib.import_module('load_nef', __package__).get()
+        else: # parent discovery
+            importlib.import_module('.load_nef', __package__).get()
+
+    def test_construct(self):
+        preview = Preview()
+        self.assertIsInstance(preview, Preview)
+        self.assertEqual(len(preview._children), 3)
+
+    def test_call(self):
+        preview = Preview()
+        # call raises error if file cannot be read
+        self.assertRaises(errors.ReaderError, preview,
+            os.path.join(os.path.dirname(__file__), 'missing.jpg'))
+        self.assertRaises(errors.ReaderError, preview(
+            os.path.join(os.path.dirname(__file__), 'invalid.jpg')), 10)
+        self.assertRaises(errors.UnsupportedFileFormatError, preview,
+            os.path.join(os.path.dirname(__file__), 'invalid.foo'))
+
+        # call returns raw preview
+        gen = preview(os.path.join(os.path.dirname(__file__), 'testimage.nef'))
+        img = gen(10)
+        self.assertIsInstance(img, PIL.Image.Image)
+        self.assertEqual(img.size, (10, 8))
+        self.assertEqual(sum(band for pix in img.getdata() for band in pix), 25287)
+        img.close()
+
+        # call returns jpeg image
+        gen = preview(os.path.join(os.path.dirname(__file__), 'testimage.jpg'))
+        img = gen(10)
+        self.assertIsInstance(img, PIL.Image.Image)
+        self.assertEqual(img.size, (10, 10))
+        self.assertEqual(sum(band for pix in img.getdata() for band in pix), 0)
+        img.close()
+
+        # preview generator can also extract data from non-image files
+        gen = preview(os.path.join(os.path.dirname(__file__), 'testfile.pdf'))
+        img = gen(10)
+        self.assertIsInstance(img, PIL.Image.Image)
+        self.assertEqual(img.size, (8, 10))
+        self.assertEqual(sum(img.getdata()), 20258)
+        img.close()
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/reader/preview/test_rawpy.py b/test/reader/preview/test_rawpy.py
new file mode 100644
index 0000000..11a6f9b
--- /dev/null
+++ b/test/reader/preview/test_rawpy.py
@@ -0,0 +1,54 @@
+
+# standard imports
+from functools import partial
+import importlib
+import os
+import unittest
+
+# external imports
+import PIL.Image
+
+# bsie imports
+from bsie.utils import errors
+
+# objects to test
+from bsie.reader.preview._rawpy import RawpyPreviewReader
+
+
+## code ##
+
+class TestRawpyPreviewReader(unittest.TestCase):
+    def setUp(self):
+        if __package__ is None or __package__ == '': # direct call or local discovery
+            importlib.import_module('load_nef', __package__).get()
+        else: # parent discovery
+            importlib.import_module('.load_nef', __package__).get()
+
+    def test_call(self):
+        rdr = RawpyPreviewReader()
+        # raises exception when image cannot be read
+        self.assertRaises(errors.ReaderError, rdr,
+            os.path.join(os.path.dirname(__file__), 'invalid.nef'))
+        # raises exception when image has invalid type
+        self.assertRaises(errors.UnsupportedFileFormatError, rdr,
+            os.path.join(os.path.dirname(__file__), 'invalid.jpg'))
+        self.assertRaises(errors.UnsupportedFileFormatError, rdr,
+            os.path.join(os.path.dirname(__file__), 'invalid.foo'))
+        # proper file produces a generator
+        gen = rdr(os.path.join(os.path.dirname(__file__), 'testimage.nef'))
+        self.assertIsInstance(gen, partial)
+        # generator produces an image
+        img = gen(10)
+        self.assertIsInstance(img, PIL.Image.Image)
+        self.assertEqual(img.size, (10, 7))
+        self.assertEqual(sum(band for pix in img.getdata() for band in pix), 15269)
+        # cleanup
+        img.close()
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/reader/preview/test_utils.py b/test/reader/preview/test_utils.py
new file mode 100644
index 0000000..2b15bc6
--- /dev/null
+++ b/test/reader/preview/test_utils.py
@@ -0,0 +1,39 @@
+
+# standard imports
+import os
+import unittest
+
+# external imports
+import PIL.Image
+
+# objects to test
+from bsie.reader.preview.utils import resize
+
+
+## code ##
+
+class TestUtils(unittest.TestCase):
+
+    def test_resize(self):
+        img = PIL.Image.open(os.path.join(os.path.dirname(__file__), 'testimage.jpg'))
+        landscape = img.resize((100, 80))
+        portrait = img.resize((80, 100))
+        self.assertEqual(img.size, (100, 100))
+        self.assertEqual(landscape.size, (100, 80))
+        self.assertEqual(portrait.size, (80, 100))
+        # resize can downscale
+        self.assertEqual(resize(img, 10).size, (10, 10))
+        self.assertEqual(resize(img, 20).size, (20, 20))
+        # resize can upscale
+        self.assertEqual(resize(img, 200).size, (200, 200))
+        # aspect ratio is preserved
+        self.assertEqual(resize(landscape, 10).size, (10, 8))
+        self.assertEqual(resize(portrait, 10).size, (8, 10))
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/reader/preview/testfile.pdf b/test/reader/preview/testfile.pdf
new file mode 100644
index 0000000..592d448
--- /dev/null
+++ b/test/reader/preview/testfile.pdf
diff --git a/test/reader/preview/testimage.jpg b/test/reader/preview/testimage.jpg
new file mode 100644
index 0000000..4c2aca5
--- /dev/null
+++ b/test/reader/preview/testimage.jpg
diff --git a/test/base/test_reader.py b/test/reader/test_base.py
index a907eb9..5dd2855 100644
--- a/test/base/test_reader.py
+++ b/test/reader/test_base.py
@@ -1,19 +1,14 @@
-"""
 
-Part of the bsie test suite.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import unittest
 
 # objects to test
-from bsie import base
+from bsie.reader import Reader
 
 
 ## code ##
 
-class StubReader(base.Reader):
+class StubReader(Reader):
     def __call__(self, path):
         raise NotImplementedError()
 
diff --git a/test/reader/test_builder.py b/test/reader/test_builder.py
new file mode 100644
index 0000000..84e8e7a
--- /dev/null
+++ b/test/reader/test_builder.py
@@ -0,0 +1,49 @@
+
+# standard imports
+import unittest
+
+# bsie imports
+from bsie.utils import errors
+
+# objects to test
+from bsie.reader import ReaderBuilder
+
+
+## code ##
+
+class TestReaderBuilder(unittest.TestCase):
+    def test_build(self):
+        builder = ReaderBuilder({'bsie.reader.path.Path': {}})
+        # build configured reader
+        cls = builder.build('bsie.reader.path.Path')
+        import bsie.reader.path
+        self.assertIsInstance(cls, bsie.reader.path.Path)
+        # build unconfigured reader
+        cls = builder.build('bsie.reader.stat.Stat')
+        import bsie.reader.stat
+        self.assertIsInstance(cls, bsie.reader.stat.Stat)
+        # re-build previous reader (test cache)
+        self.assertEqual(cls, builder.build('bsie.reader.stat.Stat'))
+        # test invalid
+        self.assertRaises(TypeError, builder.build, 123)
+        self.assertRaises(TypeError, builder.build, None)
+        self.assertRaises(ValueError, builder.build, '')
+        self.assertRaises(ValueError, builder.build, 'Path')
+        self.assertRaises(errors.BuilderError, builder.build, 'path.Path')
+        # invalid config
+        builder = ReaderBuilder({'bsie.reader.stat.Stat': dict(foo=123)})
+        self.assertRaises(errors.BuilderError, builder.build, 'bsie.reader.stat.Stat')
+        builder = ReaderBuilder({'bsie.reader.stat.Stat': 123})
+        self.assertRaises(TypeError, builder.build, 'bsie.reader.stat.Stat')
+        # no instructions
+        builder = ReaderBuilder({})
+        cls = builder.build('bsie.reader.stat.Stat')
+        self.assertIsInstance(cls, bsie.reader.stat.Stat)
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/reader/test_chain.py b/test/reader/test_chain.py
new file mode 100644
index 0000000..665aabc
--- /dev/null
+++ b/test/reader/test_chain.py
@@ -0,0 +1,80 @@
+
+# standard imports
+import logging
+import os
+import unittest
+
+# bsie imports
+from bsie.utils import errors
+import bsie.reader.path
+import bsie.reader.stat
+
+# objects to test
+from bsie.reader.chain import ReaderChain
+
+
+## code ##
+
+class TestReaderChain(unittest.TestCase):
+    def test_construct(self):
+        # subreaders are built
+        chain = ReaderChain(['bsie.reader.stat.Stat', 'bsie.reader.path.Path'], {})
+        self.assertIsInstance(chain, ReaderChain)
+        self.assertEqual(chain._children,
+            (bsie.reader.stat.Stat(), bsie.reader.path.Path()))
+        # subreaders that failed to build are omitted
+        with self.assertLogs(logging.getLogger('bsie.reader.chain'), logging.WARNING):
+            chain = ReaderChain(['bsie.reader.stat.Stat', 'bsie.reader.invalid.Invalid'], {})
+            self.assertEqual(chain._children, (bsie.reader.stat.Stat(), ))
+        with self.assertLogs(logging.getLogger('bsie.reader.chain'), logging.WARNING):
+            chain = ReaderChain(['bsie.reader.stat.Stat', 'bsie.reader.path.Invalid'], {})
+            self.assertEqual(chain._children, (bsie.reader.stat.Stat(), ))
+        # warning is issued if there are no subreaders
+        with self.assertLogs(logging.getLogger('bsie.reader.chain'), logging.WARNING):
+            chain = ReaderChain([], {})
+            self.assertEqual(chain._children, tuple())
+
+    def test_essentials(self):
+        chain = ReaderChain(['bsie.reader.stat.Stat', 'bsie.reader.path.Path'], {})
+        # identity
+        self.assertEqual(chain, chain)
+        self.assertEqual(hash(chain), hash(chain))
+        # comparison works across instances
+        self.assertEqual(chain,
+            ReaderChain(['bsie.reader.stat.Stat', 'bsie.reader.path.Path'], {}))
+        self.assertEqual(hash(chain),
+            hash(ReaderChain(['bsie.reader.stat.Stat', 'bsie.reader.path.Path'], {})))
+        # comparison respects subreaders
+        self.assertNotEqual(hash(chain),
+            hash(ReaderChain(['bsie.reader.path.Path'], {})))
+        self.assertNotEqual(hash(chain),
+            hash(ReaderChain(['bsie.reader.path.Path'], {})))
+        # comparison respects subreader order
+        self.assertNotEqual(chain,
+            ReaderChain(['bsie.reader.path.Path', 'bsie.reader.stat.Stat'], {}))
+        self.assertNotEqual(hash(chain),
+            hash(ReaderChain(['bsie.reader.path.Path', 'bsie.reader.stat.Stat'], {})))
+        # string representation
+        self.assertEqual(str(chain), 'ReaderChain(Stat, Path)')
+        self.assertEqual(repr(chain), 'ReaderChain((Stat(), Path()))')
+
+    def test_call(self):
+        chain = ReaderChain(['bsie.reader.stat.Stat', 'bsie.reader.path.Path'], {})
+        # chain first probes first child
+        self.assertEqual(chain(__file__), os.stat(__file__))
+        # chain probes second child if first one failes
+        self.assertEqual(chain(''), '')
+        self.assertEqual(chain('missing-file'), 'missing-file')
+
+        # chain raises a ReaderError if childs were exhausted
+        chain = ReaderChain(['bsie.reader.stat.Stat'], {})
+        # chain probes second child if first one failes
+        self.assertRaises(errors.ReaderError, chain, '')
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/reader/test_exif.py b/test/reader/test_exif.py
new file mode 100644
index 0000000..de6e801
--- /dev/null
+++ b/test/reader/test_exif.py
@@ -0,0 +1,52 @@
+
+# standard imports
+import os
+import unittest
+
+# external imports
+import pyexiv2
+
+# bsie imports
+from bsie.utils import errors
+
+# objects to test
+from bsie.reader.exif import Exif
+
+
+## code ##
+
+class TestExif(unittest.TestCase):
+    def test_call(self):
+        rdr = Exif()
+        # discards non-image files
+        self.assertRaises(errors.UnsupportedFileFormatError, rdr,
+            os.path.join(os.path.dirname(__file__), 'invalid.doc'))
+        # raises on invalid image files
+        self.assertRaises(errors.UnsupportedFileFormatError, rdr,
+            os.path.join(os.path.dirname(__file__), 'invalid.jpg'))
+        # raises on invalid image files
+        pyexiv2.set_log_level(3) # suppress log message
+        self.assertRaises(errors.ReaderError, rdr,
+            os.path.join(os.path.dirname(__file__), 'testimage_exif_corrupted.jpg'))
+        # returns dict with exif info
+        self.assertDictEqual(rdr(os.path.join(os.path.dirname(__file__), 'testimage_exif.jpg')), {
+            'Exif.Image.Artist': 'nobody',
+            'Exif.Image.ExifTag': '110',
+            'Exif.Image.ResolutionUnit': '2',
+            'Exif.Image.XResolution': '300/1',
+            'Exif.Image.YCbCrPositioning': '1',
+            'Exif.Image.YResolution': '300/1',
+            'Exif.Photo.ColorSpace': '65535',
+            'Exif.Photo.ComponentsConfiguration': '1 2 3 0',
+            'Exif.Photo.ExifVersion': '48 50 51 50',
+            'Exif.Photo.FlashpixVersion': '48 49 48 48',
+            'Exif.Photo.ISOSpeedRatings': '200',
+            })
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/reader/test_path.py b/test/reader/test_path.py
index fd7bc5a..f2eee06 100644
--- a/test/reader/test_path.py
+++ b/test/reader/test_path.py
@@ -1,10 +1,5 @@
-"""
 
-Part of the bsie test suite.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import unittest
 
 # objects to test
diff --git a/test/reader/test_stat.py b/test/reader/test_stat.py
index d12ad9c..f36b8b3 100644
--- a/test/reader/test_stat.py
+++ b/test/reader/test_stat.py
@@ -1,15 +1,10 @@
-"""
 
-Part of the bsie test suite.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import os
 import unittest
 
 # bsie imports
-from bsie.base import errors
+from bsie.utils import errors
 
 # objects to test
 from bsie.reader.stat import Stat
diff --git a/test/reader/testimage_exif.jpg b/test/reader/testimage_exif.jpg
new file mode 100644
index 0000000..a774bc2
--- /dev/null
+++ b/test/reader/testimage_exif.jpg
diff --git a/test/reader/testimage_exif_corrupted.jpg b/test/reader/testimage_exif_corrupted.jpg
new file mode 100644
index 0000000..e51a9dc
--- /dev/null
+++ b/test/reader/testimage_exif_corrupted.jpg
diff --git a/test/tools/test_builder.py b/test/tools/test_builder.py
deleted file mode 100644
index 62c637c..0000000
--- a/test/tools/test_builder.py
+++ /dev/null
@@ -1,246 +0,0 @@
-"""
-
-Part of the bsie test suite.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
-import logging
-import unittest
-
-# bsie imports
-from bsie import base
-from bsie.utils import bsfs
-
-# objects to test
-from bsie.tools.builder import ExtractorBuilder
-from bsie.tools.builder import PipelineBuilder
-from bsie.tools.builder import ReaderBuilder
-from bsie.tools.builder import _safe_load
-from bsie.tools.builder import _unpack_name
-
-
-## code ##
-
-class TestUtils(unittest.TestCase):
-    def test_safe_load(self):
-        # invalid module
-        self.assertRaises(base.errors.LoaderError, _safe_load, 'dBGHMSAYOoKeKMpywDoKZQycENFPvN', 'foobar')
-        self.assertRaises(base.errors.LoaderError, _safe_load, 'dBGHMSAYOoKeKMpywDoKZQycENFPvN.bar', 'foobar')
-        # partially valid module
-        self.assertRaises(base.errors.LoaderError, _safe_load, 'os.foo', 'foobar')
-        # invalid class
-        self.assertRaises(base.errors.LoaderError, _safe_load, 'os.path', 'foo')
-        # valid module and class
-        cls = _safe_load('collections.abc', 'Container')
-        import collections.abc
-        self.assertEqual(cls, collections.abc.Container)
-
-    def test_unpack_name(self):
-        self.assertRaises(TypeError, _unpack_name, 123)
-        self.assertRaises(TypeError, _unpack_name, None)
-        self.assertRaises(ValueError, _unpack_name, '')
-        self.assertRaises(ValueError, _unpack_name, 'path')
-        self.assertRaises(ValueError, _unpack_name, '.Path')
-        self.assertEqual(_unpack_name('path.Path'), ('path', 'Path'))
-        self.assertEqual(_unpack_name('path.foo.bar.Path'), ('path.foo.bar', 'Path'))
-
-
-class TestReaderBuilder(unittest.TestCase):
-    def test_build(self):
-        builder = ReaderBuilder({'bsie.reader.path.Path': {}})
-        # build configured reader
-        cls = builder.build('bsie.reader.path.Path')
-        import bsie.reader.path
-        self.assertIsInstance(cls, bsie.reader.path.Path)
-        # build unconfigured reader
-        cls = builder.build('bsie.reader.stat.Stat')
-        import bsie.reader.stat
-        self.assertIsInstance(cls, bsie.reader.stat.Stat)
-        # re-build previous reader (test cache)
-        self.assertEqual(cls, builder.build('bsie.reader.stat.Stat'))
-        # test invalid
-        self.assertRaises(TypeError, builder.build, 123)
-        self.assertRaises(TypeError, builder.build, None)
-        self.assertRaises(ValueError, builder.build, '')
-        self.assertRaises(ValueError, builder.build, 'Path')
-        self.assertRaises(base.errors.BuilderError, builder.build, 'path.Path')
-        # invalid config
-        builder = ReaderBuilder({'bsie.reader.stat.Stat': dict(foo=123)})
-        self.assertRaises(base.errors.BuilderError, builder.build, 'bsie.reader.stat.Stat')
-        builder = ReaderBuilder({'bsie.reader.stat.Stat': 123})
-        self.assertRaises(TypeError, builder.build, 'bsie.reader.stat.Stat')
-        # no instructions
-        builder = ReaderBuilder({})
-        cls = builder.build('bsie.reader.stat.Stat')
-        self.assertIsInstance(cls, bsie.reader.stat.Stat)
-
-
-
-class TestExtractorBuilder(unittest.TestCase):
-    def test_iter(self):
-        # no specifications
-        self.assertListEqual(list(ExtractorBuilder([])), [])
-        # some specifications
-        builder = ExtractorBuilder([
-            {'bsie.extractor.generic.path.Path': {}},
-            {'bsie.extractor.generic.stat.Stat': {}},
-            {'bsie.extractor.generic.path.Path': {}},
-            ])
-        self.assertListEqual(list(builder), [0, 1, 2])
-
-    def test_build(self):
-        # simple and repeated extractors
-        builder = ExtractorBuilder([
-            {'bsie.extractor.generic.path.Path': {}},
-            {'bsie.extractor.generic.stat.Stat': {}},
-            {'bsie.extractor.generic.path.Path': {}},
-            ])
-        ext = [builder.build(0), builder.build(1), builder.build(2)]
-        import bsie.extractor.generic.path
-        import bsie.extractor.generic.stat
-        self.assertListEqual(ext, [
-            bsie.extractor.generic.path.Path(),
-            bsie.extractor.generic.stat.Stat(),
-            bsie.extractor.generic.path.Path(),
-            ])
-        # out-of-bounds raises KeyError
-        self.assertRaises(IndexError, builder.build, 3)
-
-        # building with args
-        builder = ExtractorBuilder([
-            {'bsie.extractor.generic.constant.Constant': {
-                'schema': '''
-                    bse:author rdfs:subClassOf bsfs:Predicate ;
-                        rdfs:domain bsfs:Entity ;
-                        rdfs:range xsd:string ;
-                        bsfs:unique "true"^^xsd:boolean .
-                    bse:rating rdfs:subClassOf bsfs:Predicate ;
-                        rdfs:domain bsfs:Entity ;
-                        rdfs:range xsd:integer ;
-                        bsfs:unique "true"^^xsd:boolean .
-                    ''',
-                'tuples': [
-                    ('http://bsfs.ai/schema/Entity#author', 'Me, myself, and I'),
-                    ('http://bsfs.ai/schema/Entity#rating', 123),
-                    ],
-            }}])
-        obj = builder.build(0)
-        import bsie.extractor.generic.constant
-        self.assertEqual(obj, bsie.extractor.generic.constant.Constant('''
-                    bse:author rdfs:subClassOf bsfs:Predicate ;
-                        rdfs:domain bsfs:Entity ;
-                        rdfs:range xsd:string ;
-                        bsfs:unique "true"^^xsd:boolean .
-                    bse:rating rdfs:subClassOf bsfs:Predicate ;
-                        rdfs:domain bsfs:Entity ;
-                        rdfs:range xsd:integer ;
-                        bsfs:unique "true"^^xsd:boolean .
-                    ''', [
-            ('http://bsfs.ai/schema/Entity#author', 'Me, myself, and I'),
-            ('http://bsfs.ai/schema/Entity#rating', 123),
-            ]))
-
-        # building with invalid args
-        self.assertRaises(base.errors.BuilderError, ExtractorBuilder(
-            [{'bsie.extractor.generic.path.Path': {'foo': 123}}]).build, 0)
-        # non-dict build specification
-        self.assertRaises(TypeError, ExtractorBuilder(
-            [('bsie.extractor.generic.path.Path', {})]).build, 0)
-        # multiple keys per build specification
-        self.assertRaises(TypeError, ExtractorBuilder(
-            [{'bsie.extractor.generic.path.Path': {},
-              'bsie.extractor.generic.stat.Stat': {}}]).build, 0)
-        # non-dict value for kwargs
-        self.assertRaises(TypeError, ExtractorBuilder(
-            [{'bsie.extractor.generic.path.Path': 123}]).build, 0)
-
-
-
-
-class TestPipelineBuilder(unittest.TestCase):
-    def test_build(self):
-        prefix = bsfs.URI('http://example.com/local/file#')
-        c_schema = '''
-            bse:author rdfs:subClassOf bsfs:Predicate ;
-                rdfs:domain bsfs:Entity ;
-                rdfs:range xsd:string ;
-                bsfs:unique "true"^^xsd:boolean .
-            '''
-        c_tuples = [('http://bsfs.ai/schema/Entity#author', 'Me, myself, and I')]
-        # prepare builders
-        rbuild = ReaderBuilder({})
-        ebuild = ExtractorBuilder([
-            {'bsie.extractor.generic.path.Path': {}},
-            {'bsie.extractor.generic.stat.Stat': {}},
-            {'bsie.extractor.generic.constant.Constant': dict(
-                schema=c_schema,
-                tuples=c_tuples,
-                )},
-            ])
-        # build pipeline
-        builder = PipelineBuilder(prefix, rbuild, ebuild)
-        pipeline = builder.build()
-        # delayed import
-        import bsie.reader.path
-        import bsie.reader.stat
-        import bsie.extractor.generic.path
-        import bsie.extractor.generic.stat
-        import bsie.extractor.generic.constant
-        # check pipeline
-        self.assertDictEqual(pipeline._ext2rdr, {
-            bsie.extractor.generic.path.Path(): bsie.reader.path.Path(),
-            bsie.extractor.generic.stat.Stat(): bsie.reader.stat.Stat(),
-            bsie.extractor.generic.constant.Constant(c_schema, c_tuples): None,
-            })
-
-        # fail to load extractor
-        ebuild_err = ExtractorBuilder([
-            {'bsie.extractor.generic.foo.Foo': {}},
-            {'bsie.extractor.generic.path.Path': {}},
-            ])
-        with self.assertLogs(logging.getLogger('bsie.tools.builder'), logging.ERROR):
-            pipeline = PipelineBuilder(prefix, rbuild, ebuild_err).build()
-            self.assertDictEqual(pipeline._ext2rdr, {
-                bsie.extractor.generic.path.Path(): bsie.reader.path.Path()})
-
-        # fail to build extractor
-        ebuild_err = ExtractorBuilder([
-            {'bsie.extractor.generic.path.Path': {'foo': 123}},
-            {'bsie.extractor.generic.path.Path': {}},
-            ])
-        with self.assertLogs(logging.getLogger('bsie.tools.builder'), logging.ERROR):
-            pipeline = PipelineBuilder(prefix, rbuild, ebuild_err).build()
-            self.assertDictEqual(pipeline._ext2rdr, {
-                bsie.extractor.generic.path.Path(): bsie.reader.path.Path()})
-
-        # fail to load reader
-        with self.assertLogs(logging.getLogger('bsie.tools.builder'), logging.ERROR):
-            # switch reader of an extractor
-            old_reader = bsie.extractor.generic.path.Path.CONTENT_READER
-            bsie.extractor.generic.path.Path.CONTENT_READER = 'bsie.reader.foo.Foo'
-            # build pipeline with invalid reader reference
-            pipeline = PipelineBuilder(prefix, rbuild, ebuild).build()
-            self.assertDictEqual(pipeline._ext2rdr, {
-                bsie.extractor.generic.stat.Stat(): bsie.reader.stat.Stat(),
-                bsie.extractor.generic.constant.Constant(c_schema, c_tuples): None,
-                })
-            # switch back
-            bsie.extractor.generic.path.Path.CONTENT_READER = old_reader
-
-        # fail to build reader
-        rbuild_err = ReaderBuilder({'bsie.reader.stat.Stat': dict(foo=123)})
-        with self.assertLogs(logging.getLogger('bsie.tools.builder'), logging.ERROR):
-            pipeline = PipelineBuilder(prefix, rbuild_err, ebuild).build()
-            self.assertDictEqual(pipeline._ext2rdr, {
-                bsie.extractor.generic.path.Path(): bsie.reader.path.Path(),
-                bsie.extractor.generic.constant.Constant(c_schema, c_tuples): None,
-                })
-
-
-## main ##
-
-if __name__ == '__main__':
-    unittest.main()
-
-## EOF ##
diff --git a/test/tools/testfile.t b/test/tools/testfile.t
deleted file mode 100644
index 3b18e51..0000000
--- a/test/tools/testfile.t
+++ /dev/null
@@ -1 +0,0 @@
-hello world
diff --git a/test/utils/filematcher/__init__.py b/test/utils/filematcher/__init__.py
new file mode 100644
index 0000000..e69de29
--- /dev/null
+++ b/test/utils/filematcher/__init__.py
diff --git a/test/utils/filematcher/empty b/test/utils/filematcher/empty
new file mode 100644
index 0000000..e69de29
--- /dev/null
+++ b/test/utils/filematcher/empty
diff --git a/test/utils/filematcher/test_matcher.py b/test/utils/filematcher/test_matcher.py
new file mode 100644
index 0000000..88e96c2
--- /dev/null
+++ b/test/utils/filematcher/test_matcher.py
@@ -0,0 +1,227 @@
+
+# standard imports
+import os
+import stat
+import tempfile
+import unittest
+
+# objects to test
+from bsie.utils.filematcher import matcher
+
+
+## code ##
+
+class FakeMatcher(matcher.Matcher):
+    def __call__(self, *args, **kwargs):
+        pass
+
+class FakeCriterion(matcher.Criterion):
+    def __call__(self, *args, **kwargs):
+        pass
+
+class FakeAggregate(matcher.Aggregate):
+    def __call__(self, *args, **kwargs):
+        pass
+
+class TestMatcher(unittest.TestCase):
+    def setUp(self):
+        # paths
+        self.image = os.path.join(os.path.dirname(__file__), 'testimage.jpg')
+        self.text= os.path.join(os.path.dirname(__file__), 'textfile.t')
+        self.empty = os.path.join(os.path.dirname(__file__), 'empty')
+        self.missing = os.path.join(os.path.dirname(__file__), 'missing.jpg')
+
+    def test_matcher_skeleton(self):
+        # node: iteration and length
+        self.assertSetEqual(set(iter(FakeMatcher(1,2,3))), {1,2,3})
+        self.assertSetEqual(set(iter(FakeMatcher([1,2,3]))), {1,2,3})
+        self.assertEqual(len(FakeMatcher([1,2,3])), 3)
+        self.assertEqual(len(FakeMatcher(1,2,3)), 3)
+        self.assertEqual(len(FakeMatcher()), 0)
+        self.assertIn(1, FakeMatcher(1,2,3))
+        self.assertIn(3, FakeMatcher([1,2,3]))
+        self.assertNotIn(0, FakeMatcher(1,2,3))
+        self.assertNotIn(4, FakeMatcher([1,2,3]))
+        # node: comparison
+        self.assertEqual(FakeMatcher([1,2,3]), FakeMatcher([1,2,3]))
+        self.assertEqual(FakeMatcher(1,2,3), FakeMatcher(1,2,3))
+        self.assertEqual(FakeMatcher(1,2,3), FakeMatcher([1,2,3]))
+        self.assertEqual(FakeMatcher(1,2,3), FakeMatcher((1,2,3)))
+        self.assertNotEqual(FakeMatcher(1,2,3), FakeMatcher(1,2,4))
+        self.assertNotEqual(FakeMatcher(1,2,3), FakeMatcher(1,2,3,4))
+        self.assertNotEqual(FakeMatcher(1,2,3), FakeMatcher(1,2))
+        self.assertEqual(hash(FakeMatcher([1,2,3])), hash(FakeMatcher([1,2,3])))
+        self.assertEqual(hash(FakeMatcher(1,2,3)), hash(FakeMatcher(1,2,3)))
+        self.assertEqual(hash(FakeMatcher(1,2,3)), hash(FakeMatcher([1,2,3])))
+        self.assertEqual(hash(FakeMatcher(1,2,3)), hash(FakeMatcher((1,2,3))))
+        # node: representation
+        self.assertEqual(repr(FakeMatcher(1,2,3)), 'FakeMatcher({1, 2, 3})')
+
+        # criterion
+        self.assertEqual(repr(FakeCriterion(1,2,3)), 'FakeCriterion({1, 2, 3})')
+        self.assertEqual(hash(FakeCriterion(1,2,3)), hash(FakeCriterion(1,2,3)))
+        self.assertEqual(FakeCriterion(1,2,3), FakeCriterion([1,2,3]))
+        self.assertNotEqual(FakeCriterion(1,2,3), FakeCriterion(1,2))
+        self.assertNotEqual(FakeCriterion(1,2,3), FakeMatcher(1,2,3))
+        self.assertSetEqual(FakeCriterion(1,2,3).accepted(), {1,2,3})
+
+        # aggregate
+        self.assertEqual(repr(FakeAggregate(1,2,3)), 'FakeAggregate({1, 2, 3})')
+        self.assertNotEqual(FakeAggregate(1,2,3), FakeMatcher(1,2,3))
+
+    def test_any(self):
+        self.assertTrue(matcher.Any()(self.image))
+        self.assertTrue(matcher.Any()(self.text))
+        self.assertTrue(matcher.Any()(self.missing))
+        self.assertTrue(matcher.Any()(self.empty))
+
+    def test_nothing(self):
+        self.assertFalse(matcher.Nothing()(self.image))
+        self.assertFalse(matcher.Nothing()(self.text))
+        self.assertFalse(matcher.Nothing()(self.missing))
+        self.assertFalse(matcher.Nothing()(self.empty))
+
+    def test_exists(self):
+        self.assertTrue(matcher.Exists()(self.image))
+        self.assertTrue(matcher.Exists()(self.text))
+        self.assertTrue(matcher.Exists()(self.empty))
+        self.assertFalse(matcher.Exists()(self.missing))
+
+    def test_isfile(self):
+        self.assertTrue(matcher.IsFile()(self.image))
+        self.assertTrue(matcher.IsFile()(self.text))
+        self.assertFalse(matcher.IsFile()(self.missing))
+        self.assertFalse(matcher.IsFile()(os.path.dirname(self.image)))
+
+    def test_isdir(self):
+        self.assertTrue(matcher.IsDir()(os.path.dirname(self.image)))
+        self.assertFalse(matcher.IsDir()(self.image))
+        self.assertFalse(matcher.IsDir()(self.text))
+        self.assertFalse(matcher.IsDir()(self.missing))
+
+    def test_islink(self):
+        self.assertFalse(matcher.IsLink()(os.path.dirname(self.image)))
+        self.assertFalse(matcher.IsLink()(self.image))
+        self.assertFalse(matcher.IsLink()(self.text))
+        _, temp = tempfile.mkstemp(prefix='bsie-test-')
+        templink = temp + '-link'
+        os.symlink(temp, templink)
+        self.assertTrue(matcher.IsLink()(templink))
+        os.unlink(templink)
+        os.unlink(temp)
+
+    def test_isabs(self):
+        self.assertTrue(matcher.IsAbs()(os.path.abspath(self.image)))
+        self.assertTrue(matcher.IsAbs()(os.path.abspath(self.text)))
+        self.assertFalse(matcher.IsAbs()(os.path.relpath(self.text, os.path.dirname(self.text))))
+
+    def test_isrel(self):
+        self.assertFalse(matcher.IsRel()(os.path.abspath(self.image)))
+        self.assertFalse(matcher.IsRel()(os.path.abspath(self.text)))
+        self.assertTrue(matcher.IsRel()(os.path.relpath(self.text, os.path.dirname(self.text))))
+        self.assertTrue(matcher.IsRel()(os.path.basename(self.text)))
+
+    def test_ismount(self):
+        self.assertFalse(matcher.IsMount()(self.image))
+        self.assertFalse(matcher.IsMount()(self.text))
+        self.assertFalse(matcher.IsMount()(self.missing))
+        # there's no reasonable way to test a positive case
+
+    def test_isempty(self):
+        self.assertTrue(matcher.IsEmpty()(self.empty))
+        self.assertFalse(matcher.IsEmpty()(self.image))
+        self.assertFalse(matcher.IsEmpty()(self.text))
+        self.assertFalse(matcher.IsEmpty()(self.missing))
+
+    def test_isreadable(self):
+        self.assertTrue(matcher.IsReadable()(self.empty))
+        self.assertTrue(matcher.IsReadable()(self.image))
+        self.assertFalse(matcher.IsReadable()(self.missing))
+        _, temp = tempfile.mkstemp(prefix='bsie-test-')
+        os.chmod(temp, 0)
+        self.assertFalse(matcher.IsReadable()(temp))
+        os.unlink(temp)
+
+    def test_iswritable(self):
+        self.assertTrue(matcher.IsWritable()(self.empty))
+        self.assertTrue(matcher.IsWritable()(self.image))
+        self.assertFalse(matcher.IsWritable()(self.missing))
+        _, temp = tempfile.mkstemp(prefix='bsie-test-')
+        os.chmod(temp, 0)
+        self.assertFalse(matcher.IsWritable()(temp))
+        os.unlink(temp)
+
+    def test_isexecutable(self):
+        self.assertFalse(matcher.IsExecutable()(self.empty))
+        self.assertFalse(matcher.IsExecutable()(self.image))
+        self.assertFalse(matcher.IsExecutable()(self.missing))
+        _, temp = tempfile.mkstemp(prefix='bsie-test-')
+        os.chmod(temp, stat.S_IEXEC)
+        self.assertTrue(matcher.IsExecutable()(temp))
+        os.unlink(temp)
+
+    def test_extension(self):
+        self.assertTrue(matcher.Extension('jpg')(self.image))
+        self.assertTrue(matcher.Extension('jpg', 'png')(self.image))
+        self.assertTrue(matcher.Extension('jpg', 't')(self.text))
+        self.assertTrue(matcher.Extension('jpg', 'png', 't')(self.missing))
+        self.assertTrue(matcher.Extension('')(self.empty))
+
+        self.assertFalse(matcher.Extension()(self.image))
+        self.assertFalse(matcher.Extension('jpeg')(self.image))
+        self.assertFalse(matcher.Extension('.t')(self.text))
+        self.assertFalse(matcher.Extension('png', 't')(self.missing))
+        self.assertFalse(matcher.Extension('tiff')(self.empty))
+
+    def test_mime(self):
+        self.assertTrue(matcher.Mime('image/jpeg')(self.image))
+        self.assertTrue(matcher.Mime('image/tiff', 'image/jpeg')(self.image))
+        self.assertTrue(matcher.Mime('text/plain', 'image/jpeg')(self.text))
+        self.assertTrue(matcher.Mime('inode/x-empty')(self.empty))
+
+        self.assertFalse(matcher.Mime()(self.image))
+        self.assertFalse(matcher.Mime('image')(self.image))
+        self.assertFalse(matcher.Mime('image/tiff', 'image/png')(self.image))
+        self.assertFalse(matcher.Mime('')(self.text))
+        self.assertFalse(matcher.Mime('text')(self.text))
+        self.assertFalse(matcher.Mime('tiff')(self.empty))
+        self.assertFalse(matcher.Mime()(self.empty))
+        self.assertFalse(matcher.Mime('')(self.empty))
+        self.assertFalse(matcher.Mime()(self.missing))
+        self.assertFalse(matcher.Mime('')(self.missing))
+        self.assertFalse(matcher.Mime('inode/x-empty')(self.missing))
+
+    def test_not(self):
+        self.assertFalse(matcher.NOT(matcher.Mime('image/jpeg'))(self.image))
+        self.assertTrue(matcher.NOT(matcher.Mime('text/plain'))(self.image))
+
+    def test_and(self):
+        self.assertTrue(matcher.And(matcher.Mime('image/jpeg'), matcher.Extension('jpg'))(self.image))
+        self.assertTrue(matcher.And(matcher.Mime('image/jpeg'), matcher.Extension('jpg', 'tiff'))(self.image))
+        self.assertTrue(matcher.And(matcher.Mime('text/plain'), matcher.Extension('t', 'tiff'))(self.text))
+
+        self.assertFalse(matcher.And(matcher.Mime('image/jpeg'), matcher.Extension('tiff'))(self.image))
+        self.assertFalse(matcher.And(matcher.Mime('text/plain'), matcher.Extension('jpg'))(self.image))
+        self.assertFalse(matcher.And(matcher.Mime('inode/x-empty'), matcher.Extension('jpg'))(self.missing))
+        self.assertFalse(matcher.And(matcher.Mime('image/jpeg'), matcher.Extension('jpg', 't'))(self.text))
+
+    def test_or(self):
+        self.assertTrue(matcher.Or(matcher.Mime('image/jpeg'))(self.image))
+        self.assertFalse(matcher.Or(matcher.Mime('text/plain'))(self.image))
+
+        self.assertTrue(matcher.Or(matcher.Mime('image/jpeg'), matcher.Extension('jpg'))(self.image))
+        self.assertTrue(matcher.Or(matcher.Mime('image/jpeg'), matcher.Extension('t'))(self.image))
+        self.assertTrue(matcher.Or(matcher.Mime('text/plain'), matcher.Extension('jpg', 'tiff'))(self.image))
+        self.assertTrue(matcher.Or(matcher.Mime('text/plain'), matcher.Extension('tiff'))(self.text))
+        self.assertTrue(matcher.Or(matcher.Mime('image/jpeg'), matcher.Extension('jpg'))(self.missing))
+
+        self.assertFalse(matcher.Or(matcher.Mime('text/plain'), matcher.Extension('tiff'))(self.image))
+        self.assertFalse(matcher.Or(matcher.Mime('inode/x-empty'), matcher.Extension('jpg', 'tiff'))(self.text))
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/utils/filematcher/test_parser.py b/test/utils/filematcher/test_parser.py
new file mode 100644
index 0000000..536db00
--- /dev/null
+++ b/test/utils/filematcher/test_parser.py
@@ -0,0 +1,141 @@
+
+# standard imports
+import unittest
+
+# bsie imports
+from bsie.utils import errors
+from bsie.utils.filematcher import matcher
+
+# objects to test
+from bsie.utils.filematcher import parse
+
+
+## code ##
+
+class TestFileMatcherParser(unittest.TestCase):
+    def test_empty(self):
+        # no criterion
+        self.assertEqual(parse(''), matcher.Any())
+
+    def test_ruleone(self):
+        # single criterion, single value
+        self.assertEqual(parse('mime=text'), matcher.Mime('text'))
+        self.assertEqual(parse('MIME=text'), matcher.Mime('text'))
+        self.assertEqual(parse('MiMe=text'), matcher.Mime('text'))
+        self.assertEqual(parse('MIME=TEXT'), matcher.Mime('TEXT'))
+        self.assertEqual(parse('mime={text}'), matcher.Mime('text'))
+        self.assertEqual(parse('mime=image/jpeg'), matcher.Mime('image/jpeg'))
+        self.assertEqual(parse('mime="image/jpeg"'), matcher.Mime('image/jpeg'))
+        self.assertEqual(parse('extension=pdf'), matcher.Extension('pdf'))
+        self.assertEqual(parse('extension={pdf}'), matcher.Extension('pdf'))
+        self.assertEqual(parse('extension="pdf"'), matcher.Extension('pdf'))
+        self.assertEqual(parse('extension="foo,bar"'), matcher.Extension('foo,bar'))
+        self.assertEqual(parse('extension="f{oo|ba}r"'), matcher.Extension('f{oo|ba}r'))
+        self.assertEqual(parse('extension=""'), matcher.Extension(''))
+        self.assertEqual(parse('extension="foo'), matcher.Extension('"foo'))
+        self.assertRaises(errors.ParserError, parse, 'extension=foo=bar')
+        self.assertRaises(errors.ParserError, parse, 'extension=')
+        self.assertRaises(errors.ParserError, parse, 'extension={}')
+        self.assertRaises(errors.ParserError, parse, 'extension={foo')
+
+        # valueless
+        self.assertEqual(parse('any'), matcher.Any())
+        self.assertEqual(parse('nothing'), matcher.Nothing())
+        self.assertEqual(parse('exists'), matcher.Exists())
+        self.assertEqual(parse('any, nothing'), matcher.And(matcher.Any(), matcher.Nothing()))
+        self.assertEqual(parse('any, nothing, exists'),
+            matcher.And(matcher.Any(), matcher.Nothing(), matcher.Exists()))
+        self.assertEqual(parse('any, extension=jpg'), matcher.And(matcher.Any(), matcher.Extension('jpg')))
+        self.assertRaises(errors.ParserError, parse, 'mime')
+        self.assertRaises(errors.ParserError, parse, 'extension')
+        self.assertRaises(errors.ParserError, parse, 'exists=True')
+        self.assertRaises(errors.ParserError, parse, 'exists=foo')
+        self.assertEqual(parse('!any'), matcher.NOT(matcher.Any()))
+        self.assertEqual(parse('!any, nothing'), matcher.And(matcher.NOT(matcher.Any()), matcher.Nothing()))
+        self.assertEqual(parse('!any, extension=jpg'),
+            matcher.And(matcher.NOT(matcher.Any()), matcher.Extension('jpg')))
+        self.assertRaises(errors.ParserError, parse, '!mime')
+        self.assertRaises(errors.ParserError, parse, '!extension')
+
+    def test_rulefew(self):
+        # single criterion, multiple values
+        self.assertEqual(parse('extension={jpg, jpeg}'), matcher.Extension('jpg', 'jpeg'))
+        self.assertEqual(parse('mime={image/jpeg, image/png}'),
+            matcher.Mime('image/jpeg', 'image/png'))
+        self.assertRaises(errors.ParserError, parse, 'mime=image/png, image/jpeg')
+        self.assertRaises(errors.ParserError, parse, 'extension=jpg, jpeg')
+
+    def test_rulesets_ruleone(self):
+        # mutliple criteria, single value
+        self.assertEqual(parse('mime=text, extension=t'),
+            matcher.And(matcher.Mime('text'), matcher.Extension('t')))
+        self.assertEqual(parse('mime=text/plain, extension=t'),
+            matcher.And(matcher.Mime('text/plain'), matcher.Extension('t')))
+        self.assertRaises(errors.ParserError, parse, 'mime=text/plain extension=t')
+        self.assertRaises(errors.ParserError, parse, 'mime={image/jpeg, extension=jpg'),
+
+    def test_rulesets_rulefew(self):
+        # multiple criteria, multiple values
+        self.assertEqual(parse('mime=image/jpeg, extension={jpg, jpeg}'),
+            matcher.And(matcher.Mime('image/jpeg'), matcher.Extension('jpg', 'jpeg')))
+        self.assertEqual(parse('mime={image/jpeg, image/tiff}, extension={jpg, jpeg}'),
+            matcher.And(matcher.Mime('image/jpeg', 'image/tiff'), matcher.Extension('jpg', 'jpeg')))
+        self.assertEqual(parse('mime={image/jpeg, image/tiff}, extension=jpg'),
+            matcher.And(matcher.Mime('image/jpeg', 'image/tiff'), matcher.Extension('jpg')))
+        self.assertRaises(errors.ParserError, parse, 'mime={image/jpeg, image/tiff, extension=jpg')
+        self.assertRaises(errors.ParserError, parse, 'mime=image/jpeg, image/tiff, extension=jpg')
+        self.assertRaises(errors.ParserError, parse, 'mime=image/jpeg, extension=jpg, ')
+
+    def test_not(self):
+        self.assertEqual(parse('extension!=jpg'), matcher.NOT(matcher.Extension('jpg')))
+        self.assertEqual(parse('extension!={jpg, jpeg}'),
+            matcher.NOT(matcher.Extension('jpg', 'jpeg')))
+        self.assertEqual(parse('extension!=jpg, mime=image/jpeg'),
+            matcher.And(matcher.NOT(matcher.Extension('jpg')), matcher.Mime('image/jpeg')))
+        self.assertEqual(parse('extension!=jpg, mime!=image/jpeg'),
+            matcher.And(matcher.NOT(matcher.Extension('jpg')), matcher.NOT(matcher.Mime('image/jpeg'))))
+        self.assertEqual(parse('extension!=jpg | mime=image/jpeg'),
+            matcher.Or(matcher.NOT(matcher.Extension('jpg')), matcher.Mime('image/jpeg')))
+        self.assertEqual(parse('extension!=jpg | mime!=image/jpeg'),
+            matcher.Or(matcher.NOT(matcher.Extension('jpg')), matcher.NOT(matcher.Mime('image/jpeg'))))
+
+    def test_expr(self):
+        # multiple rulesets
+        self.assertEqual(parse('mime=image/jpeg | extension=jpg'),
+            matcher.Or(matcher.Mime('image/jpeg'), matcher.Extension('jpg')))
+        self.assertEqual(parse('mime=image/jpeg | extension={jpg, jpeg}'),
+            matcher.Or(matcher.Mime('image/jpeg'), matcher.Extension('jpg', 'jpeg')))
+        self.assertEqual(parse('mime={image/jpeg, image/png} | extension={jpg, jpeg}'),
+            matcher.Or(matcher.Mime('image/jpeg', 'image/png'), matcher.Extension('jpg', 'jpeg')))
+        self.assertEqual(parse('mime=image/jpeg , extension=jpg | extension=jpg'),
+            matcher.Or(matcher.And(matcher.Mime('image/jpeg'), matcher.Extension('jpg')), matcher.Extension('jpg')))
+        self.assertEqual(parse(
+            'mime={jpeg, text}, extension={jpg,t} | extension={png,txt}, mime={png, tiff}'),
+            matcher.Or(
+                matcher.And(matcher.Mime('jpeg', 'text'), matcher.Extension('jpg', 't')),
+                matcher.And(matcher.Extension('png', 'txt'), matcher.Mime('png', 'tiff'))))
+        self.assertEqual(parse('mime=text | extension=jpg | extension=png | mime=png'),
+            matcher.Or(matcher.Mime('text'), matcher.Extension('jpg'), matcher.Extension('png'), matcher.Mime('png')))
+        self.assertRaises(errors.ParserError, parse, 'mime=text |')
+        self.assertRaises(errors.ParserError, parse, '| mime=text')
+        self.assertRaises(errors.ParserError, parse, 'extension=png | mime=text, ')
+
+    def test_invalid(self):
+        # Invalid parses
+        self.assertRaises(errors.ParserError, parse, "extension=") # Empty value
+        self.assertRaises(errors.ParserError, parse, "mime=foo,bar") # Escaping
+        self.assertRaises(errors.ParserError, parse, "mime='foo,bar") # Quoting
+        self.assertRaises(errors.ParserError, parse, "mime=\"foo,bar") # Quoting
+
+        # Invalid input
+        self.assertRaises(AttributeError, parse, None)
+        self.assertRaises(AttributeError, parse, 123)
+        self.assertRaises(AttributeError, parse, [123,321])
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/utils/filematcher/testimage.jpg b/test/utils/filematcher/testimage.jpg
new file mode 100644
index 0000000..ea7af63
--- /dev/null
+++ b/test/utils/filematcher/testimage.jpg
diff --git a/test/utils/filematcher/textfile.t b/test/utils/filematcher/textfile.t
new file mode 100644
index 0000000..c389011
--- /dev/null
+++ b/test/utils/filematcher/textfile.t
@@ -0,0 +1,4 @@
+Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua.
+Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.
+Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.
+Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.
diff --git a/test/utils/test_loading.py b/test/utils/test_loading.py
new file mode 100644
index 0000000..b8773ab
--- /dev/null
+++ b/test/utils/test_loading.py
@@ -0,0 +1,43 @@
+
+# standard imports
+import unittest
+
+# bsie imports
+from bsie.utils import errors
+
+# objects to test
+from bsie.utils.loading import safe_load, unpack_qualified_name
+
+
+## code ##
+
+class TestUtils(unittest.TestCase):
+    def test_safe_load(self):
+        # invalid module
+        self.assertRaises(errors.LoaderError, safe_load, 'dBGHMSAYOoKeKMpywDoKZQycENFPvN', 'foobar')
+        self.assertRaises(errors.LoaderError, safe_load, 'dBGHMSAYOoKeKMpywDoKZQycENFPvN.bar', 'foobar')
+        # partially valid module
+        self.assertRaises(errors.LoaderError, safe_load, 'os.foo', 'foobar')
+        # invalid class
+        self.assertRaises(errors.LoaderError, safe_load, 'os.path', 'foo')
+        # valid module and class
+        cls = safe_load('collections.abc', 'Container')
+        import collections.abc
+        self.assertEqual(cls, collections.abc.Container)
+
+    def test_unpack_qualified_name(self):
+        self.assertRaises(TypeError, unpack_qualified_name, 123)
+        self.assertRaises(TypeError, unpack_qualified_name, None)
+        self.assertRaises(ValueError, unpack_qualified_name, '')
+        self.assertRaises(ValueError, unpack_qualified_name, 'path')
+        self.assertRaises(ValueError, unpack_qualified_name, '.Path')
+        self.assertEqual(unpack_qualified_name('path.Path'), ('path', 'Path'))
+        self.assertEqual(unpack_qualified_name('path.foo.bar.Path'), ('path.foo.bar', 'Path'))
+
+
+## main ##
+
+if __name__ == '__main__':
+    unittest.main()
+
+## EOF ##
diff --git a/test/utils/test_node.py b/test/utils/test_node.py
index c70f0b8..c0662a1 100644
--- a/test/utils/test_node.py
+++ b/test/utils/test_node.py
@@ -1,10 +1,5 @@
-"""
 
-Part of the bsie test suite.
-A copy of the license is provided with the project.
-Author: Matthias Baumgartner, 2022
-"""
-# imports
+# standard imports
 import unittest
 
 # bsie imports
@@ -18,22 +13,54 @@ from bsie.utils.node import Node
 
 class TestNode(unittest.TestCase):
     def test_equality(self):
-        uri = bsfs.URI('http://example.com/me/entity#1234')
-        node = Node(ns.bsfs.Entity, uri)
-        # basic equivalence
-        self.assertEqual(node, Node(ns.bsfs.Entity, bsfs.URI('http://example.com/me/entity#1234')))
-        self.assertEqual(hash(node), hash(Node(ns.bsfs.Entity, bsfs.URI('http://example.com/me/entity#1234'))))
+        uri1 = bsfs.URI('http://example.com/me/entity#1234')
+        uri2 = bsfs.URI('http://example.com/me/entity#4321')
+        node = Node(ns.bsfs.Entity, uri1)
         # equality respects uri
-        self.assertNotEqual(node, Node(ns.bsfs.Entity, bsfs.URI('http://example.com/me/entity#4321')))
-        self.assertNotEqual(hash(node), hash(Node(ns.bsfs.Entity, bsfs.URI('http://example.com/me/entity#4321'))))
+        self.assertEqual(node, Node(ns.bsfs.Entity, uri1))
+        self.assertEqual(hash(node), hash(Node(ns.bsfs.Entity, uri1)))
+        self.assertNotEqual(node, Node(ns.bsfs.Entity, uri2))
+        self.assertNotEqual(hash(node), hash(Node(ns.bsfs.Entity, uri2)))
+        # equality respects hints
+        self.assertEqual(
+            Node(ns.bsfs.Entity, foo='foo'),
+            Node(ns.bsfs.Entity, foo='foo'))
+        self.assertEqual(
+            hash(Node(ns.bsfs.Entity, foo='foo')),
+            hash(Node(ns.bsfs.Entity, foo='foo')))
+        self.assertNotEqual(
+            Node(ns.bsfs.Entity, foo='foo'),
+            Node(ns.bsfs.Entity, foo='bar'))
+        self.assertNotEqual(
+            hash(Node(ns.bsfs.Entity, foo='foo')),
+            hash(Node(ns.bsfs.Entity, foo='bar')))
+        self.assertNotEqual(
+            Node(ns.bsfs.Entity, foo='bar'),
+            Node(ns.bsfs.Entity, bar='foo'))
+        self.assertNotEqual(
+            hash(Node(ns.bsfs.Entity, foo='bar')),
+            hash(Node(ns.bsfs.Entity, bar='foo')))
+        # hints are irrelevant if uri is set
+        self.assertEqual(
+            Node(ns.bsfs.Entity, uri=uri1, foo='bar'),
+            Node(ns.bsfs.Entity, uri=uri1, bar='foo'))
+        self.assertEqual(
+            hash(Node(ns.bsfs.Entity, uri=uri1, foo='bar')),
+            hash(Node(ns.bsfs.Entity, uri=uri1, bar='foo')))
+        self.assertNotEqual(
+            Node(ns.bsfs.Entity, uri=uri1, foo='bar'),
+            Node(ns.bsfs.Entity, uri=uri2, bar='foo'))
+        self.assertNotEqual(
+            hash(Node(ns.bsfs.Entity, uri=uri1, foo='bar')),
+            hash(Node(ns.bsfs.Entity, uri=uri2, bar='foo')))
         # equality respects node_type
-        self.assertNotEqual(node, Node(ns.bsfs.Foo, uri))
-        self.assertNotEqual(hash(node), hash(Node(ns.bsfs.Foo, uri)))
+        self.assertNotEqual(node, Node(ns.bsfs.Foo, uri1))
+        self.assertNotEqual(hash(node), hash(Node(ns.bsfs.Foo, uri1)))
         # not equal to other types
         self.assertNotEqual(node, 1234)
         self.assertNotEqual(hash(node), hash(1234))
-        self.assertNotEqual(node, uri)
-        self.assertNotEqual(hash(node), hash(uri))
+        self.assertNotEqual(node, uri1)
+        self.assertNotEqual(hash(node), hash(uri1))
         self.assertNotEqual(node, ns.bsfs.Entity)
         self.assertNotEqual(hash(node), hash(ns.bsfs.Entity))
         class Foo(): pass
@@ -43,17 +70,17 @@ class TestNode(unittest.TestCase):
     def test_str(self):
         uri = bsfs.URI('http://example.com/me/entity#1234')
         # basic string conversion
-        node = Node(ns.bsfs.Entity, uri)
-        self.assertEqual(str(node), 'Node(http://bsfs.ai/schema/Entity, http://example.com/me/entity#1234)')
-        self.assertEqual(repr(node), 'Node(http://bsfs.ai/schema/Entity, http://example.com/me/entity#1234)')
+        node = Node(ns.bsn.Entity, uri)
+        self.assertEqual(str(node), 'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/entity#1234)')
+        self.assertEqual(repr(node), 'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/entity#1234)')
         # string conversion respects node_type
-        node = Node(ns.bsfs.Foo, uri)
-        self.assertEqual(str(node), 'Node(http://bsfs.ai/schema/Foo, http://example.com/me/entity#1234)')
-        self.assertEqual(repr(node), 'Node(http://bsfs.ai/schema/Foo, http://example.com/me/entity#1234)')
+        node = Node(ns.bsn.Foo, uri)
+        self.assertEqual(str(node), 'Node(https://schema.bsfs.io/ie/Node/Foo, http://example.com/me/entity#1234)')
+        self.assertEqual(repr(node), 'Node(https://schema.bsfs.io/ie/Node/Foo, http://example.com/me/entity#1234)')
         # string conversion respects uri
-        node = Node(ns.bsfs.Entity, bsfs.URI('http://example.com/me/entity#4321'))
-        self.assertEqual(str(node), 'Node(http://bsfs.ai/schema/Entity, http://example.com/me/entity#4321)')
-        self.assertEqual(repr(node), 'Node(http://bsfs.ai/schema/Entity, http://example.com/me/entity#4321)')
+        node = Node(ns.bsn.Entity, bsfs.URI('http://example.com/me/entity#4321'))
+        self.assertEqual(str(node), 'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/entity#4321)')
+        self.assertEqual(repr(node), 'Node(https://schema.bsfs.io/ie/Node/Entity, http://example.com/me/entity#4321)')
author	Matthias Baumgartner <dev@igsor.net>	2023-03-05 19:22:58 +0100
committer	Matthias Baumgartner <dev@igsor.net>	2023-03-05 19:22:58 +0100
commit	a35b33f4f1ddcf6f1bb8ab0f41b87bf2b847f11d (patch)
tree	fb220da28bb7248ebf37ce09af5de88f2c1aaad4
parent	7582c280ad5324a2f0427999911c7e7abc14a6ab (diff)
parent	af81318ae9311fd0b0e16949cef3cfaf7996970b (diff)
download	bsie-release.tar.gz bsie-release.tar.bz2 bsie-release.zip