Metadata-Version: 2.2
Name: gca_analyzer
Version: 0.4.0
Summary: A package for Group Conversation Analysis with improved text processing and visualization
Home-page: https://github.com/etShaw-zh/gca_analyzer
Author: Jianjun Xiao
Author-email: et_shaw@126.com
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Text Processing :: Linguistic
Requires-Python: >=3.12
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: pandas>=1.3.0
Requires-Dist: numpy>=1.20.0
Requires-Dist: jieba>=0.42.1
Requires-Dist: scikit-learn>=1.0.0
Requires-Dist: matplotlib>=3.4.0
Requires-Dist: seaborn>=0.11.0
Requires-Dist: networkx>=2.6.0
Requires-Dist: plotly>=5.3.0
Requires-Dist: loguru>=0.7.0
Requires-Dist: torch>=2.0.0
Requires-Dist: transformers>=4.30.0
Requires-Dist: sentence-transformers>=2.2.0
Dynamic: author
Dynamic: author-email
Dynamic: classifier
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: requires-dist
Dynamic: requires-python
Dynamic: summary

# GCA Analyzer

A Python package for analyzing group conversation dynamics using NLP techniques and quantitative metrics.

English | [中文](README_zh.md) | [日本語](README_ja.md) | [한국어](README_ko.md)

## Features

- **Multi-language Support**: Built-in support for Chinese and other languages through LLM models
- **Comprehensive Metrics**: Analyzes group interactions through multiple dimensions
- **Automated Analysis**: Finds optimal analysis windows and generates detailed statistics
- **Flexible Configuration**: Customizable parameters for different analysis needs
- **Easy Integration**: Command-line interface and Python API support

## Quick Start

### Installation

```bash
# Install from PyPI
pip install gca_analyzer

# For development
git clone https://github.com/etShaw-zh/gca_analyzer.git
cd gca_analyzer
pip install -e .
```

### Basic Usage

1. Prepare your conversation data in CSV format with required columns:
```
conversation_id,person_id,time,text
1A,student1,0:08,Hello teacher!
1A,teacher,0:10,Hello everyone!
```

2. Run analysis:
```bash
python -m gca_analyzer --data your_data.csv
```

3. Descriptive statistics for GCA measures:

The analyzer generates comprehensive statistics for the following measures:

![Descriptive Statistics](/docs/_static/gca_results.jpg)

- **Participation**
   - Measures relative contribution frequency
   - Negative values indicate below-average participation
   - Positive values indicate above-average participation

- **Responsivity**
   - Measures how well participants respond to others
   - Higher values indicate better response behavior

- **Internal Cohesion**
   - Measures consistency in individual contributions
   - Higher values indicate more coherent messaging

- **Social Impact**
   - Measures influence on group discussion
   - Higher values indicate stronger impact on others

- **Newness**
   - Measures introduction of new content
   - Higher values indicate more novel contributions

- **Communication Density**
   - Measures information content per message
   - Higher values indicate more information-rich messages

Results are saved as CSV files in the specified output directory.

## Citation

If you use this tool in your research, please cite:

```bibtex
@software{gca_analyzer,
  title = {GCA Analyzer: Group Conversation Analysis Tool},
  author = {Xiao, Jianjun},
  year = {2025},
  url = {https://github.com/etShaw-zh/gca_analyzer}
}
