Skip to content

[User Story] A novice user needs help to convert their data into a format that they can use with the available python interface #19

@HuangYiran

Description

@HuangYiran

A production engineer that has downloaded logging information from his machines and understands that can theoretically predict the lifetime from the sensor data. He follows the instruction provided to convert his data into multivariate timeseries with a column that contains the time until the next failure detected.

A web shop owner understands that he might be able to predict the number of purchases in his webshop within a month. Following the instructions he collected a large amount of data in the past years. He follows the instruction provided to convert his data.

A manifacturer wants to use the package to predict manifacturing times for his products. He collected the data of the past years and saves them in a csv-file. There are some missing information for some samples.

these situations should be considered:

  • should provide a basic methods for the one who has multiple files, especially, the feature and target are not in the same database (It is better to do with software, not code).
    -- with merge key
    -- without merge key
  • should provide basic methods for the one has the other type of data: txt, excel, json, database(sql,...)
  • should automatically check for potential issues in the data
  • should provide proposals for the user to solve the issues and then excute them
  • should provide methods to deal with large data set (larger than 10GB)

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions