Metadata-Version: 2.1
Name: filoc
Version: 0.0.9
Summary: eases the saving and reading of files within a structured folder tree
Home-page: https://github.com/jeromerg/filoc
Author: jeromerg
Author-email: jeromerg@gmx.net
License: UNKNOWN
Description: ![Tests](https://github.com/jeromerg/filoc/workflows/Tests/badge.svg)
        
        FiLoc File Locator
        ==================
        
        Save, read and analyze files in structured folders: Your files are the database. 
        
        Machine Learning Swiss Knife
        ----------------------------
        
        See the following [example](https://htmlpreview.github.io/?https://github.com/jeromerg/filoc/blob/master/examples/example_ml.html)
        
        In machine learning, we use it to create simulation hyperparameters set, schedule simulations, record results, and analyze hundreds of machine learning experiences 
        without the need of any database nor external services.
        
        
        Basic
        -----
        
        ```python
        import os
        from filoc import Filoc
        
        loc = Filoc('/data/simid={simid:d}/epid={epid:d}/settings.json') 
        
        #--------------------------------------------------------------
        # Build the path for simid=0 and epid=1, then write to the file
        #--------------------------------------------------------------
        path1 = loc.get_path(simid=0, epid=1)  # /data/simid=0/epid=1/settings.json
        os.makedirs(os.path.dirname(path1))
        with open(path1, 'w') as f:
            f.write('Coucou')
        
        #--------------------------------------------------------------
        # Same for simid=0 and epid=1 with a more compact syntax
        #--------------------------------------------------------------
        with loc.open(dict(simid=0, epid=2), 'w') as f:  # file handle to /data/simid=0/epid=2/settings.json
            f.write('Salut')
        
        #--------------------------------------------------------------
        # Find all files on file system having simid=0
        #--------------------------------------------------------------
        paths = loc.find_paths(simid=0)          # ['/data/simid=0/epid=1/settings.json', '/data/simid=0/epid=2/settings.json']
        
        #--------------------------------------------------------------
        # Extract properties for path paths[0]
        #--------------------------------------------------------------
        props = loc.get_path_properties(paths[0]) # { 'simid': 0, 'epid': 1 }
        
        #--------------------------------------------------------------
        # More compact form to get both the paths and their related properties
        #--------------------------------------------------------------
        for path, props in loc.find_paths_and_properties(simid=0):
            print(path, props)
        
        #--------------------------------------------------------------
        # Analyse all files and build a report
        #--------------------------------------------------------------
        def get_file_report(f):
            content = f.read()
            return { 'content_length' : len(content) }
        
        report = loc.report(dict(simid=0), get_file_report, 'r')
        
        print(report) #-> [{"simid":0, "epid":1, "content_length":6}, {"simid":0, "epid":2, "content_length":5}]
        
        from pandas import DataFrame
        print(DataFrame(report)) #-> convert report to pandas DataFrame 
        
        #--------------------------------------------------------------
        # Analyse all files and build a report
        #--------------------------------------------------------------
        def my_analysis(f):
            content = f.read()
            return { 'content_length' : len(content) }
        
        fimap = Filoc(loc, my_analysis)
        
        report = fimap.read_contents(simid=0)
        
        print(report) #-> [{"simid":0, "epid":1, "content_length":6}, {"simid":0, "epid":2, "content_length":5}]
        
        from pandas import DataFrame
        print(DataFrame(report)) #-> convert report to pandas DataFrame 
        
        ```    
        
        Install
        -------
        
            pip install filoc
        
        Syntax
        ------
        
        The FiLoc constructor accepts a file path/url, which will finally be interpreted by [fsspec](https://pypi.org/project/fsspec). 
        That way, it is possible to access ftp, HDFS or any other file repository supported by fsspec. 
        The path is at the same time a format string with named placeholder, which will be parsed by the [parse library](https://pypi.org/project/parse/).
        Each placeholder defines a *property* associated to the files to save, retrieve or analyse.
         
        ToDos
        -----
        
        - Readme Doc
        - API Doc 
        - File locking
        - Unit Test 
            - Multiloc with different aggregation layers (snowflake)
            - File locking
        - Dev: Logging of edited files
        - Dev: File watcher
        - Dev: Excel File watcher / binder
        
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3
Description-Content-Type: text/markdown
