Introducing CorpusCraft Beta: Professional Corpus Linguistics in Your Browser

Yaroslav Mar
We're thrilled to announce the beta launch of CorpusCraft, a comprehensive platform designed specifically for linguists and researchers who work with text corpora.

What is CorpusCraft?

CorpusCraft is a professional browser-based corpus linguistics platform that brings powerful research tools to your fingertips without requiring software installation. Whether you're conducting lexical analysis, studying language change over time, or exploring linguistic patterns, CorpusCraft provides the tools you need.

Key Features

Full-Text Search with Advanced Operators

Our search engine supports regex patterns, boolean operators, and proximity searches, giving you precise control over your queries. Find exactly what you're looking for with powerful pattern matching and filtering capabilities.

KWIC Concordance Analysis

Generate customizable Key Word In Context displays to examine how words and phrases are used in their natural context. Perfect for discourse analysis and usage studies. See how words function in different contexts, identify patterns, and extract meaningful insights from your corpus.

Multi-Language NLP Processing

Process texts in 8 languages (English, Spanish, Russian, French, German, Chinese, Japanese, and Arabic) with complete linguistic analysis including:

  • Lemmatization and POS tagging - Understand grammatical structure and reduce words to their base forms
  • Dependency parsing - Analyze syntactic relationships between words in sentences
  • Named Entity Recognition - Automatically identify and classify people, places, organizations, and more
  • Morphological feature analysis - Extract detailed linguistic features like tense, case, number, and gender
  • Stopword filtering - Focus on meaningful content by removing common function words

AI-Powered Analysis

Leverage cutting-edge AI for advanced linguistic research:

  • Auto-classification - Automatically categorize documents by topic, genre, or custom taxonomies
  • Theme discovery - Identify recurring themes and patterns across your corpus
  • Smart document summarization - Generate concise summaries of long texts
  • Natural language queries - Ask questions about your corpus in plain English
  • Writing style analysis - Compare authorial styles and detect stylistic patterns
  • Semantic similarity - Find documents with similar meanings, even with different wording
  • Sentiment analysis - Track emotional tone and attitudes across texts
  • Diachronic language change detection - Study how language evolves over time

Advanced Visualizations

Create word clouds, frequency distribution charts, collocation networks, and lexical dispersion plots to visualize your findings and identify patterns at a glance. Export all visualizations in high-resolution formats for publications and presentations.

Flexible Metadata Management

Define custom metadata schemas to organize your corpus according to your research needs. Tag documents with publication date, author, genre, register, geographic location, or any custom field relevant to your study. Use metadata to filter searches and create targeted sub-corpora.

Collaboration Tools

Share corpora with colleagues, manage role-based permissions (viewer, editor, admin), and work together on research projects in real-time. Add comments and annotations to documents, and track changes across team members.

Pricing Plans

We offer flexible pricing to suit different research needs:

Free Plan - $0/month

  • 10 documents (50K tokens)
  • Search & KWIC
  • Frequency analysis
  • NLP processing
  • AI features not included

Academic Plan - $99/year - Perfect for verified academics

  • 500K tokens corpus
  • 25 AI analyses per month
  • 5 advanced GPT-5.1 queries
  • NLP processing
  • Requires verification

Researcher Plan - $39/month

  • 500K tokens corpus
  • 55 AI analyses per month
  • 5 advanced GPT-5.1 queries
  • NLP processing
  • Collaboration (3 users)

Professional Plan - $99/month

  • 2M tokens corpus
  • 220 AI analyses per month
  • 20 advanced GPT-5.1 queries
  • Priority processing
  • Collaboration (10 users)

Institution Plan - $249/month

  • 10M tokens corpus
  • 1100 AI analyses per month
  • 100 advanced GPT-5.1 queries
  • REST API access
  • Unlimited collaborators

Try It Now - Interactive Demo

Want to see CorpusCraft in action before signing up? We've created an interactive demo corpus with sample texts including Shakespeare, academic papers on linguistics and AI, and scientific research articles.

In the demo, you can:

  • Perform full-text searches with advanced operators
  • Generate KWIC concordances
  • Analyze word frequencies and n-grams
  • Run complete NLP processing across 8 languages
  • Create visualizations like word clouds and frequency charts

No signup required to try the demo - just click and start exploring!

Get Started Today

Ready to transform your corpus linguistics research?

Start building your first corpus today at corpuscraft.org. No credit card required for the free plan.

Built for Researchers

CorpusCraft was designed from the ground up for academic researchers and professional linguists. We understand the challenges of corpus linguistics research because we're linguists ourselves. Every feature has been carefully crafted to support rigorous linguistic analysis while remaining accessible and easy to use.

Whether you're a PhD student working on your dissertation, a professor conducting research, or a professional linguist analyzing real-world data, CorpusCraft provides the tools you need to do your best work.

We're Here to Help

We're continuously improving CorpusCraft based on user feedback. If you have suggestions, feature requests, or encounter any issues, please don't hesitate to reach out through our contact page.

Join our growing community of researchers and discover what CorpusCraft can do for your linguistic research.

Happy researching!

The CorpusCraft Team

Try CorpusCraft

Start building and analyzing your text corpora with research-grade tools. No installation required.