[GSoC] DataFrame project bonding

Next Topic
 
classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

[GSoC] DataFrame project bonding

Boost - uBLAS mailing list
Hi, 
My name is Wei Wang, and I am lucky to be able to as the student for GSoC19 Boost::ublas library. My project is to build a data structure work like pandas.DataFrame or dataframe in R. As I found on GSoC's website, my mentor will be Bellot, and I'm very glad. 
I have two questions related to logistics: 
(1) Where should I work for the code? I find an empty organization in Github(https://github.com/BoostGSoC19), but I'm still not sure how I gonna submit them. 
(2) Should I fork the whole ublas project? Or simply start build my own project directly under boost/numeric/ublas? 
Another two questions related to project requirement: 
(1) I have read one implementation from one previous student (https://github.com/BoostGSoC17/data_frame), which is pretty good. But it somehow goes against my idea. Is it okay to start a new project? 
And also I'd like to ask what's your expectation from this project? 
I'm targeting at pandas.DataFrame(though it won't be that full-featured), but the basics are:
- indexing
- slicing
- sort based on col
- relation ops like select, join
- set operations on rows like union, set diff, intersect
- group (possibly)
(2) What should I show in my final submit? Will it be evaluated on whether my code is able to merge? Or simply I will be provided some test case and see if I can pass them? 

Cheers, 
Wei

_______________________________________________
ublas mailing list
[hidden email]
https://lists.boost.org/mailman/listinfo.cgi/ublas
Sent to: [hidden email]
Reply | Threaded
Open this post in threaded view
|

Re: [GSoC] DataFrame project bonding

Boost - uBLAS mailing list
Hi Wei,

I've sent you an email a few minutes ago with some instructions. As we have more ublas students this year, all of you can use the repository https://github.com/BoostGSoC19. If any of the student needs access right, please let me know now and I'll open it for you.

Ideally, all the students can fork the whole project now. At the end of the GSOC, if you are successful, then I will help you merging your work into the main branch on boost.
There was indeed an excellent work done in the past, but I would like to explore your approach too and that's the reason I selected your project so that you can work with me.

For the details, we'll manage that together off-line

Dear Ublas community, please join me to welcome our GSOC'19 student Wei and congratulate him.

Cheers,
David

On Tue, May 7, 2019 at 2:14 PM Wei Wang via ublas <[hidden email]> wrote:
Hi, 
My name is Wei Wang, and I am lucky to be able to as the student for GSoC19 Boost::ublas library. My project is to build a data structure work like pandas.DataFrame or dataframe in R. As I found on GSoC's website, my mentor will be Bellot, and I'm very glad. 
I have two questions related to logistics: 
(1) Where should I work for the code? I find an empty organization in Github(https://github.com/BoostGSoC19), but I'm still not sure how I gonna submit them. 
(2) Should I fork the whole ublas project? Or simply start build my own project directly under boost/numeric/ublas? 
Another two questions related to project requirement: 
(1) I have read one implementation from one previous student (https://github.com/BoostGSoC17/data_frame), which is pretty good. But it somehow goes against my idea. Is it okay to start a new project? 
And also I'd like to ask what's your expectation from this project? 
I'm targeting at pandas.DataFrame(though it won't be that full-featured), but the basics are:
- indexing
- slicing
- sort based on col
- relation ops like select, join
- set operations on rows like union, set diff, intersect
- group (possibly)
(2) What should I show in my final submit? Will it be evaluated on whether my code is able to merge? Or simply I will be provided some test case and see if I can pass them? 

Cheers, 
Wei
_______________________________________________
ublas mailing list
[hidden email]
https://lists.boost.org/mailman/listinfo.cgi/ublas
Sent to: [hidden email]

_______________________________________________
ublas mailing list
[hidden email]
https://lists.boost.org/mailman/listinfo.cgi/ublas
Sent to: [hidden email]
Reply | Threaded
Open this post in threaded view
|

Re: [GSoC] DataFrame project bonding

Boost - uBLAS mailing list
Hi Wei,

I also welcome you to the Boost/ublas community. Once you have forked from the boostorg/ublas repository, I will add you to the BoostGSoC19 gitter channel. Please let me know if you have any questions.

Best
Cem

Am Di., 7. Mai 2019 um 13:28 Uhr schrieb David Bellot via ublas <[hidden email]>:
Hi Wei,

I've sent you an email a few minutes ago with some instructions. As we have more ublas students this year, all of you can use the repository https://github.com/BoostGSoC19. If any of the student needs access right, please let me know now and I'll open it for you.

Ideally, all the students can fork the whole project now. At the end of the GSOC, if you are successful, then I will help you merging your work into the main branch on boost.
There was indeed an excellent work done in the past, but I would like to explore your approach too and that's the reason I selected your project so that you can work with me.

For the details, we'll manage that together off-line

Dear Ublas community, please join me to welcome our GSOC'19 student Wei and congratulate him.

Cheers,
David

On Tue, May 7, 2019 at 2:14 PM Wei Wang via ublas <[hidden email]> wrote:
Hi, 
My name is Wei Wang, and I am lucky to be able to as the student for GSoC19 Boost::ublas library. My project is to build a data structure work like pandas.DataFrame or dataframe in R. As I found on GSoC's website, my mentor will be Bellot, and I'm very glad. 
I have two questions related to logistics: 
(1) Where should I work for the code? I find an empty organization in Github(https://github.com/BoostGSoC19), but I'm still not sure how I gonna submit them. 
(2) Should I fork the whole ublas project? Or simply start build my own project directly under boost/numeric/ublas? 
Another two questions related to project requirement: 
(1) I have read one implementation from one previous student (https://github.com/BoostGSoC17/data_frame), which is pretty good. But it somehow goes against my idea. Is it okay to start a new project? 
And also I'd like to ask what's your expectation from this project? 
I'm targeting at pandas.DataFrame(though it won't be that full-featured), but the basics are:
- indexing
- slicing
- sort based on col
- relation ops like select, join
- set operations on rows like union, set diff, intersect
- group (possibly)
(2) What should I show in my final submit? Will it be evaluated on whether my code is able to merge? Or simply I will be provided some test case and see if I can pass them? 

Cheers, 
Wei
_______________________________________________
ublas mailing list
[hidden email]
https://lists.boost.org/mailman/listinfo.cgi/ublas
Sent to: [hidden email]
_______________________________________________
ublas mailing list
[hidden email]
https://lists.boost.org/mailman/listinfo.cgi/ublas
Sent to: [hidden email]

_______________________________________________
ublas mailing list
[hidden email]
https://lists.boost.org/mailman/listinfo.cgi/ublas
Sent to: [hidden email]