Metadata-Version: 1.1
Name: qalsadi
Version: 0.4.2
Summary: Qalsadi Arabic Morphological Analyzer and lemmatizer for Python
Home-page: http://qalsadi.sourceforge.net/
Author: Taha Zerrouki
Author-email: taha. zerrouki@gmail .com
License: GPL
Description: Qalsadi Arabic Morphological Analyzer for Python
        ================================================
        
        Developpers: Taha Zerrouki: http://tahadz.com taha dot zerrouki at gmail
        dot com
        
        +---------+------------------------------------------------------------------+
        | Feature | value                                                            |
        | s       |                                                                  |
        +=========+==================================================================+
        | Authors | `Authors.md <https://github.com/linuxscout/qalsadi/master/AUTHOR |
        |         | S.md>`__                                                         |
        +---------+------------------------------------------------------------------+
        | Release | 0.3.6                                                            |
        +---------+------------------------------------------------------------------+
        | License | `GPL <https://github.com/linuxscout/qalsadi/master/LICENSE>`__   |
        +---------+------------------------------------------------------------------+
        | Tracker | `linuxscout/qalsadi/Issues <https://github.com/linuxscout/qalsad |
        |         | i/issues>`__                                                     |
        +---------+------------------------------------------------------------------+
        | Website | https://pypi.python.org/pypi/qalsadi                             |
        +---------+------------------------------------------------------------------+
        | Doc     | `package Documentaion <http://pythonhosted.org/qalsadi/>`__      |
        +---------+------------------------------------------------------------------+
        | Source  | `Github <http://github.com/linuxscout/qalsadi>`__                |
        +---------+------------------------------------------------------------------+
        | Downloa | `sourceforge <http://qalsadi.sourceforge.net>`__                 |
        | d       |                                                                  |
        +---------+------------------------------------------------------------------+
        | Feedbac | `Comments <http://tahadz.com/qalsadi/contact>`__                 |
        | ks      |                                                                  |
        +---------+------------------------------------------------------------------+
        | Account | [@Twitter](https://twitter.com/linuxscout)                       |
        | s       | [@Sourceforge](http://sourceforge.net/projects/qalsadi/)         |
        +---------+------------------------------------------------------------------+
        
        Citation
        --------
        
        If you would cite it in academic work, can you use this citation
        
        ::
        
            T. Zerrouki‏, Qalsadi, Arabic mophological analyzer Library for python.,  https://pypi.python.org/pypi/qalsadi/
        
        
        Features مزايا
        --------------
        
        -  Arabic word Light Stemming.
        -  Features:
        
           -  Lemmatization
           -  Vocalized Text Analyzer,
           -  Use Qutrub library to analyze verbs.
           -  give word frequency in arabic modern use.
        
        -  Requirement:
        
           -  libQutrub: Qutrub verb conjugation library:
              http://pypi.pyton/LibQutrub
           -  PyArabic: Arabic language tools library :
              http://pypi.pyton/pyarabic
           -  Tashaphyne;Arabic Light Stemmer library :
              http://pypi.python.org/pypi/Tashaphyne/
        
        Applications
        ============
        
        -  Stemming texts
        -  Text Classification and categorization
        -  Sentiment Analysis
        -  Named Entities Recognition
        
        Installation
        ============
        
        ::
        
            pip install qalsadi
        
        Requirements
        ------------
        
        ::
        
            pip install -r requirements.txt 
        
        -  libQutrub: Qutrub verb conjugation library:
           http://pypi.pyton/LibQutrub
        -  PyArabic: Arabic language tools library : http://pypi.pyton/pyarabic
        -  Tashaphyne;Arabic Light Stemmer library :
           http://pypi.python.org/pypi/Tashaphyne/
        -  Naftawayh : Arabic words tagger: :
           http://pypi.python.org/pypi/Naftawayh/
        -  Arramooz-pysqlite : Arabic dictionary
        -  CodernityDB : No Sql native python database
        
        Usage
        =====
        
        Example
        -------
        
        .. code:: python
        
            >>> import qalsadi.lemmatizer 
        	>>> text = u"""هل تحتاج إلى ترجمة كي تفهم خطاب الملك؟ اللغة "الكلاسيكية" (الفصحى) موجودة في كل اللغات وكذلك اللغة "الدارجة" .. الفرنسية التي ندرس في المدرسة ليست الفرنسية التي يستخدمها الناس في شوارع باريس .. وملكة بريطانيا لا تخطب بلغة شوارع لندن .. لكل مقام مقال"""
        	>>> lemmer = qalsadi.lemmatizer.Lemmatizer()
        	>>> # lemmatize a word
        	... lemmer.lemmatize("يحتاج")
        	'احتاج'
        	>>> # lemmatize a word with a specific pos
        	>>> lemmer.lemmatize("وفي")
        	'في'
        	>>> lemmer.lemmatize("وفي", pos="v")
        	'وفى'
        
        	>>> 
        	>>> lemmas = lemmer.lemmatize_text(text)
        	>>> print(lemmas)
        	['هل', 'احتاج', 'إلى', 'ترجمة', 'كي', 'تفهم', 'خطاب', 'ملك', '؟', 'لغة', '"', 'كلاسيكي', '"(', 'فصحى', ')', 'موجود', 'في', 'كل', 'لغة', 'ذلك', 'لغة', '"', 'دارج', '"..', 'فرنسي', 'التي', 'درس', 'في', 'مدرسة', 'ليست', 'فرنسي', 'التي', 'استخدم', 'ناس', 'في', 'شوارع', 'باريس', '..', 'ملك', 'بريطانيا', 'لا', 'خطب', 'بلغة', 'شوارع', 'دنو', '..', 'كل', 'مقام', 'مقالي']
        	>>> # lemmatize a text and return lemma pos
        	... lemmas = lemmer.lemmatize_text(text, return_pos=True)
        	>>> print(lemmas)
        	[('هل', 'stopword'), ('احتاج', 'verb'), ('إلى', 'stopword'), ('ترجمة', 'noun'), ('كي', 'stopword'), ('تفهم', 'noun'), ('خطاب', 'noun'), ('ملك', 'noun'), '؟', ('لغة', 'noun'), '"', ('كلاسيكي', 'noun'), '"(', ('فصحى', 'noun'), ')', ('موجود', 'noun'), ('في', 'stopword'), ('كل', 'stopword'), ('لغة', 'noun'), ('ذلك', 'stopword'), ('لغة', 'noun'), '"', ('دارج', 'noun'), '"..', ('فرنسي', 'noun'), ('التي', 'stopword'), ('درس', 'verb'), ('في', 'stopword'), ('مدرسة', 'noun'), ('ليست', 'stopword'), ('فرنسي', 'noun'), ('التي', 'stopword'), ('استخدم', 'verb'), ('ناس', 'noun'), ('في', 'stopword'), ('شوارع', 'noun'), ('باريس', 'all'), '..', ('ملك', 'noun'), ('بريطانيا', 'noun'), ('لا', 'stopword'), ('خطب', 'verb'), ('بلغة', 'noun'), ('شوارع', 'noun'), ('دنو', 'verb'), '..', ('كل', 'stopword'), ('مقام', 'noun'), ('مقالي', 'noun')]
        
        	>>> 
        
        
        .. code:: python
        
            filename="samples/text.txt"
            import qalsadi.analex as qa
            try:
                myfile=open(filename)
                text=(myfile.read()).decode('utf8');
        
                if text == None:
                    text=u"السلام عليكم"
            except:
                text=u"أسلم"
                print " given text"
        
            debug=False;
            limit=500
            analyzer = qa.Analex()
            analyzer.set_debug(debug);
            result = analyzer.check_text(text);
            print '----------------python format result-------'
            print result
            for i in range(len(result)):
            #       print "--------تحليل كلمة  ------------", word.encode('utf8');
                print "-------------One word detailed case------";
                for analyzed in  result[i]:
                    print "-------------one case for word------";
                    print repr(analyzed);
        
        .. ~ Output description
        .. ~ ------------------
        
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | Category     | Applied on   | feature                 | شرح                                                       | example                                        |
        .. ~ +==============+==============+=========================+===========================================================+================================================+============+
        .. ~ | affix        | all          | affix\_key              | مفتاح الزوائد                                             | ال--َاتُ-                                      | البيانات   |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | affix        | all          | affix                   | الزوائد                                                   |                                                |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | input        | all          | word                    | الكلمة المدخلة                                            | البيانات                                       |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | input        | all          | unvocalized             | غير مشكول                                                 |                                                |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | morphology   | noun         | tag\_mamnou3            | ممنوع من الصرف                                            | 0                                              |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | morphology   | verb         | tag\_confirmed          | خاصية الفعل المؤكد                                        | 0                                              |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | morphology   | verb         | tag\_mood               | حالة الفعل المضارع (منصوب، مجزوم، مرفوع)                  | 0                                              |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | morphology   | verb         | tag\_pronoun            | الضمير                                                    | 0                                              |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | morphology   | verb         | tag\_transitive         | التعدي اللزوم                                             | 0                                              |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | morphology   | verb         | tag\_voice              | البناء للمعلوم/ البناء للمجهول                            | 0                                              |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | morphology   | noun         | tag\_regular            | قياسي/ سماعي                                              | 1                                              |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | morphology   | noun/verb    | tag\_gender             | النوع ( مؤنث مذكر)                                        | 3                                              |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | morphology   | verb         | tag\_person             | الشخص (المتكلم الغائب المخاطب)                            | 4                                              |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | morphology   | noun         | tag\_number             | العدد(فرد/مثنى/جمع)                                       | 21                                             |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | original     | noun/verb    | freq                    | درجة شيوع الكلمة                                          | 694644                                         |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | original     | all          | original\_tags          | خصائص الكلمة الأصلية                                      | (u                                             |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | original     | all          | original                | الكلمة الأصلية                                            | بَيَانٌ                                        |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | original     | all          | root                    | الجذر                                                     | بين                                            |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | original     | all          | tag\_original\_gender   | جنس الكلمة الأصلية                                        | مذكر                                           |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | original     | noun         | tag\_original\_number   | عدد الكلمة الأصلية                                        | مفرد                                           |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | output       | all          | type                    | نوع الكلمة                                                | Noun:مصدر                                      |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | output       | all          | semivocalized           | الكلمة مشكولة بدون علامة الإعراب                          | الْبَيَانَات                                   |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | output       | all          | vocalized               | الكلمةمشكولة                                              | الْبَيَانَاتُ                                  |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | output       | all          | stem                    | الجذع                                                     | بيان                                           |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | output       | all          | tags                    |                                                           | تعريف::جمع مؤنث سالم:مرفوع:متحرك:ينون:جمع:::   |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | syntax       | all          | tag\_break              | الكلمة منفصلة عمّا قبلها                                  | 0                                              |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | syntax       | all          | tag\_initial            | خاصية نحوية، الكلمة في بداية الجملة                       | 0                                              |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | syntax       | all          | tag\_transparent        | البدل                                                     | 0                                              |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | syntax       | noun         | tag\_added              | خاصية نحوية، الكلمة مضاف                                  | 0                                              |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | syntax       | all          | need                    | الكلمة تحتاج إلى كلمة أخرى (المتعدي، العوامل) غير منجزة   |                                                |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | syntax       | tool         | action                  | العمل                                                     |                                                |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        .. ~ | syntax       | tool         | object\_type            | نوع المعمول، بالنسبة للعامل، مثلا اسم لحرف الجر           |                                                |
        .. ~ +--------------+--------------+-------------------------+-----------------------------------------------------------+------------------------------------------------+------------+
        
        .. ~ Files
        .. ~ ~~~~~
        
        .. ~ -  file/directory category description
        
        .. ~ Featured Posts
        .. ~ --------------
        
Platform: UNKNOWN
Classifier: Development Status :: 5 - Production/Stable
Classifier: Natural Language :: Arabic
Classifier: Intended Audience :: Developers
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python
Classifier: Topic :: Text Processing :: Linguistic
