Skip to Content

FAQ

Quick answers to common questions about NEO. Contact support: support@heyneo.so.


Quick Navigation


Getting Started

What is NEO and how does it work?

NEO is an autonomous ML agent that automates most stages of the pipeline, from data preparation to deployment.

Workflow:

  1. Describe your ML task in natural language
  2. Provide data (upload, URL, or cloud)
  3. NEO analyzes data and selects models
  4. Receive production-ready artifacts with documentation

Example task:

Build a customer churn prediction model using customer_data.csv. Optimize for recall since missing churners is costly.

NEO handles preprocessing, feature engineering, training, evaluation, and artifact generation automatically.

What ML tasks does NEO support?

NEO supports multiple ML domains:

  • Tabular ML – Classification, regression, clustering, time series
  • Computer vision – Image classification, object detection, OCR
  • NLP – Text classification, sentiment analysis, NER, summarization
  • Audio and speech – Speech recognition, audio classification
  • LLM fine-tuning – Instruction tuning, LoRA, domain adaptation
  • Anomaly detection – Outlier detection, fraud detection

Do I need ML expertise?

No. NEO is designed for all skill levels.

BeginnersML practitioners
Use task templatesSpecify models and constraints
Step-by-step explanationsAccess detailed reports
Start simple, progressCustomize deployments and evaluation
Describe your business goalModify generated code in VS Code

The key is clear goal description, not prior ML knowledge.

How long does a typical project take?

Task typeDuration
Simple tabular models15-30 min
Image classification30-60 min
Large datasets (>1GB)1-3 hrs
NLP fine-tuning2-6 hrs
Custom deep learning4-12 hrs

Tip: Start with a small sample, then scale.


Data and File Handling

What file formats does NEO support?

FormatUse casePlatformVS Code
CSVTabular and time series
ParquetLarge datasets
JSONStructured and log data
ImagesComputer vision tasks✅ (50MB)
AudioSpeech and music✅ (50MB)

How do I handle large datasets?

Approach:

  1. Platform – Use cloud storage (S3, Google Cloud Storage, Azure Blob Storage)
  2. Convert to Parquet – Faster processing
  3. Test first – Use 10% sample

File limits:

  • Platform upload: 50MB per file
  • Platform cloud storage: unlimited
  • VS Code: unlimited local files

Does NEO handle missing data?

Yes. Automatic detection and imputation:

Data typeStrategy
NumericalMean, median, predictive
CategoricalMode, “Unknown”
Time seriesForward fill, interpolation
AdvancedML-based imputation

Platform Mode vs VS Code Extension

What’s the difference between platform mode and VS Code extension?

FeaturePlatformVS Code extension
AccessBrowserVS Code editor
SetupQuick, no installInstall once
DataUpload ≤50MB or cloudLocal and cloud
ArtifactsDownloadableGenerated in workspace
Code editingView onlyFull IDE and Git
Best forPrototyping, collaborationCustomization, local dev, large datasets

Which mode should I use?

Platform mode: Quick results, no setup, collaborative testing

VS Code extension: Edit code, work with large local files, full IDE features, version control


Task Submission

How do I write an effective task?

Include:

  1. Goal – What to predict or classify
  2. Data – Files, size, key columns
  3. Metrics – How to measure success
  4. Context – Business relevance

Example comparison:

GoodPoor
Predict customer churn using customer_data.csv (50k rows). Optimize for precision-recall balance.Do some ML with my data

What metrics should I use?

TaskMetric
RegressionRMSE, MAE, R²
ClassificationAccuracy, F1, AUC-ROC
Time seriesMAPE, SMAPE, directional accuracy
RankingNDCG, MAP, precision@k

Map metrics to business goals:

  • Minimize false positives → precision
  • Catch all fraud → recall
  • Balance speed and accuracy → F1-score

Technical and Security

Is my data secure?

FeatureDescription
Data encryptionAt rest and in transit
No sharingNever shared with third parties
Complete controlDelete or export anytime

Platform: Cloud encrypted, deletion on request, no sharing

VS Code: Local files never leave machine, secure cloud access via credentials, full control

Can I see generated code?

Yes, NEO provides:

  • Preprocessing and modeling code
  • Step-by-step notebooks
  • Deployment scripts
  • Documentation and methodology

What if model performance is low?

Improvement steps:

  1. Improve data quality and features
  2. Adjust metrics and constraints
  3. Provide domain knowledge
  4. Request specific approaches (ensembles, deep learning)

Focus on practical business impact, not perfect accuracy.


Need More Help?