Skip to content

Upload Files and Parse Files

This comprehensive guide will walk you through the process of uploading files and parsing them with UnDatasIO. Whether you’re working with CSV files, JSON data, Excel spreadsheets, or other formats, this guide covers everything you need to know.

UnDatasIO provides a powerful and intuitive interface for uploading and parsing various file formats. The process involves three main steps:

  1. Upload your file to the platform
  2. Configure parsing options based on your file type
  3. Parse and review the results

UnDatasIO supports a wide range of file formats:

  • CSV (.csv) - Comma-separated values
  • TSV (.tsv) - Tab-separated values
  • JSON (.json) - JavaScript Object Notation
  • XML (.xml) - Extensible Markup Language
  • TXT (.txt) - Plain text files
  • Excel (.xlsx, .xls) - Microsoft Excel files
  • Google Sheets - Exported as CSV or Excel
  • PDF (.pdf) - Portable Document Format
  • Word (.docx, .doc) - Microsoft Word documents
  • SQL (.sql) - SQL dump files
  • Database exports - Various database export formats
  1. Log in to your UnDatasIO account at app.undatasio.com
  2. Navigate to the main dashboard
  3. Click the “Upload Files” button or drag files directly to the upload area
  1. Open your file explorer
  2. Select the file you want to upload
  3. Drag the file to the upload area on the UnDatasIO interface
  4. Release to start the upload
  1. Click “Choose File” or “Browse”
  2. Navigate to your file location
  3. Select the file
  4. Click “Open” to start the upload
  1. Hold Ctrl (Windows) or Cmd (Mac) to select multiple files
  2. Drag all selected files to the upload area
  3. Release to upload all files simultaneously

Once your file is uploaded, UnDatasIO will:

  1. Analyze the file content and format
  2. Detect the file type automatically
  3. Display a preview of the file content
  4. Suggest optimal parsing settings

Basic Settings:

  • Delimiter: Choose the character that separates values

    • Comma (,)
    • Semicolon (;)
    • Tab (\t)
    • Pipe (|)
    • Custom delimiter
  • Header Row: Specify if the first row contains column names

    • Check “Has Header” if your file has column names
    • Uncheck if the first row contains data
  • Encoding: Select the file encoding

    • UTF-8 (recommended for international characters)
    • ISO-8859-1 (Latin-1)
    • Windows-1252 (Windows default)

Advanced Settings:

  • Quote Character: Character used to enclose text fields
  • Escape Character: Character used to escape special characters
  • Skip Empty Lines: Remove rows with no data
  • Trim Whitespace: Remove leading/trailing spaces

Basic Settings:

  • Root Path: Specify the path to your data array/object

    • Leave empty if data is at the root level
    • Use dot notation (e.g., “data.users”) for nested structures
  • Schema Validation: Enable to validate against a JSON schema

    • Upload a JSON schema file
    • Or define schema inline

Advanced Settings:

  • Pretty Print: Format JSON for better readability
  • Max Depth: Limit the depth of nested objects
  • Array Handling: Choose how to handle arrays

Basic Settings:

  • Sheet Selection: Choose which worksheet to parse

    • Select from available sheets
    • Or specify sheet by name/number
  • Range: Specify the cell range to parse

    • Use Excel notation (e.g., “A1:D100”)
    • Leave empty to parse entire sheet
  • Header Row: Specify if the first row contains column names

Advanced Settings:

  • Skip Rows: Number of rows to skip at the beginning
  • Skip Columns: Number of columns to skip from the left
  • Formula Evaluation: Whether to evaluate formulas
  • Date Format: Specify date format for date columns

Basic Settings:

  • Delimiter: Character that separates values
  • Line Separator: Character that separates lines
  • Encoding: File encoding

Advanced Settings:

  • Custom Parsing Rules: Define custom parsing patterns
  • Regular Expressions: Use regex for complex parsing
  • Multi-line Records: Handle records spanning multiple lines
  1. Click “Parse File” to start the parsing process
  2. Wait for the parsing to complete
  3. Review the results in the preview panel

Data Preview:

  • First 10 rows of parsed data
  • Column names and data types
  • Data quality indicators

Parsing Statistics:

  • Total rows processed
  • Total columns detected
  • Parsing errors (if any)
  • Processing time

Data Quality Report:

  • Missing values count
  • Data type consistency
  • Format validation results

Encoding Problems:

  • Symptoms: Garbled characters, missing text
  • Solution: Try different encoding options (UTF-8, ISO-8859-1, Windows-1252)

Delimiter Issues:

  • Symptoms: All data in one column, incorrect column separation
  • Solution: Check the actual delimiter in your file and select the correct option

Header Problems:

  • Symptoms: Column names appear as data, or data appears as column names
  • Solution: Toggle the “Has Header” option

Large File Issues:

  • Symptoms: Slow processing, timeout errors
  • Solution: Use streaming processing or split large files

“Invalid file format”

  • Check if the file extension matches the actual content
  • Try manual format selection
  • Verify the file isn’t corrupted

“Parsing failed”

  • Review the error details
  • Check file encoding and format
  • Verify parsing configuration

“File too large”

  • Use file compression
  • Split the file into smaller chunks
  • Contact support for large file processing

Once parsing is successful, you can:

  • Download as CSV: Standard comma-separated format
  • Download as JSON: Structured data format
  • Download as Excel: Spreadsheet format
  • Download as XML: Markup language format
  • Data Cleaning: Remove duplicates, handle missing values
  • Data Transformation: Convert data types, format values
  • Data Validation: Apply business rules and constraints
  • Data Analysis: Generate statistics and insights

Process multiple files simultaneously:

  1. Upload multiple files at once
  2. Select all files in the batch
  3. Apply the same parsing configuration to all files
  4. Process all files together
  5. Download results as a single file or individual files

Save and reuse parsing configurations:

  1. Configure parsing options for your file type
  2. Click “Save as Template”
  3. Name your template (e.g., “Customer Data CSV”)
  4. Use the template for future files of the same type

Use the API for automated file processing:

// Upload file via API
const response = await fetch("https://api.undatasio.com/v1/files/upload", {
method: "POST",
headers: {
Authorization: "Bearer YOUR_API_KEY",
},
body: formData,
});
// Parse file via API
const parseResponse = await fetch(
`https://api.undatasio.com/v1/files/${fileId}/parse`,
{
method: "POST",
headers: {
Authorization: "Bearer YOUR_API_KEY",
"Content-Type": "application/json",
},
body: JSON.stringify({
format: "csv",
delimiter: "comma",
hasHeader: true,
}),
}
);

Set up webhooks for automated processing:

  1. Configure webhook endpoints
  2. Upload files via API or interface
  3. Receive notifications when parsing is complete
  4. Automatically process results in your application
  1. Clean your data before uploading

    • Remove unnecessary formatting
    • Ensure consistent data types
    • Handle missing values appropriately
  2. Use appropriate file formats

    • CSV for tabular data
    • JSON for structured data
    • Excel for complex spreadsheets
  3. Check file encoding

    • Use UTF-8 for international characters
    • Ensure compatibility with your data source
  1. Optimize file size

    • Compress large files
    • Remove unnecessary columns
    • Use efficient data formats
  2. Use batch processing

    • Process multiple files together
    • Reuse parsing configurations
    • Automate repetitive tasks
  3. Monitor processing

    • Check processing times
    • Review error logs
    • Optimize based on performance data
  1. Validate your data

    • Check for missing values
    • Verify data types
    • Ensure data consistency
  2. Handle errors gracefully

    • Review parsing errors
    • Fix data issues
    • Re-parse as needed
  3. Document your process

    • Save parsing configurations
    • Note any data transformations
    • Keep audit trails

File won’t upload:

  • Check file size limits
  • Verify file format is supported
  • Ensure stable internet connection

Parsing errors:

  • Review file content for issues
  • Check parsing configuration
  • Try different encoding options

Slow processing:

  • Optimize file size
  • Use appropriate file format
  • Consider batch processing

If you encounter issues:

  1. Check the documentation for your specific file format
  2. Review error messages for specific guidance
  3. Contact support with detailed information about your issue
  4. Join the community for help from other users

After successfully uploading and parsing your files:

  1. Explore data processing features to clean and transform your data
  2. Learn about API integration to automate your workflows
  3. Set up data pipelines for recurring processing tasks
  4. Implement data validation to ensure data quality

For more advanced topics, check out: