Students from India and Japan assemble to deliberate data science
The Indian finance and technology hub of Gurgaon has recently played host to around 100 students of data science, accredited academicians and seasoned industry captains at the inaugural World Data Science Forum. The event assembled students from the Institute of Technology (IIT) Delhi and the University of Tokyo to deliberate the prospects of the data science industry and how the technology can be applied to solve real-world problems.
With the theme of “The Power Data Science for the Future”, the forum welcomes academicians from India and Japan – as well as veterans within the IT, automotive, telecommunications and healthcare industries from both countries – to discuss how innovations within the Internet of Things (IoT), Artificial Intelligence (AI) and the Blockchain will foster disruption for individuals, businesses and institutions in the years to come.
While the forum allows the academicians and industry captains to share their expert insights with a new generation of data scientists, the students are given the opportunity to engage in a Student Assignment, whereby they attempt to work out the most pressing problems faced by data analysts – Twitter spam. With the pervasiveness of spam Tweets generate on Twitter feeds every day, the students are tasked to apply their data analytics knowledge to distinguish automated tweets from thousands of real ones, particularly from a set of 10,000.
Nimish Joseph (1st Place, IIT Delhi), describes the challenge as being interesting and complex. “I had to identify spammers from a set of 10,000 anonymous Twitter users. Although the questionnaire gave us parameters such as their followers, friends, the number of tweets posted, topics discussed and emotional diversity, such information did not give us any information on whether the profile was real or not. Since the data was unlabelled, I decided to proceed with an unsupervised learning approach. This involved a better anomaly detection mechanism where I had to clean up the dataset provided to ensure that tweets with special characters and null characters are treated properly.”
Prof. Arpan Kar, one of the forum’s speakers, applauds the competition for being insightful and testing the analytical capabilities of the participants. “We received extremely interesting propositions from all the students to the problem presented to them. Although the same data analytics problem can be solved through 50 different approaches, the underpinning idea is not to create the most “suitable” approach, but a workable approach using a mix of different methods. I truly appreciate all the effort put in by these students in solving this case.”
Following the success of the World Data Science Forum, the organizer bitgrit will seek to use it to better realize its vision to build an Asia-centred data scientist network platform that utilizes Blockchain technology to allow for the optimal application of data science and AI within societal contexts.
“Data Analytics is becoming more integral to our daily lives the demand of data scientists to dissect it has never been higher,” says Kazuya Saginawa, Co-Founder & CEO of bitgrit. “By continuing to expand our network and matching experts with stakeholders and future innovators through initiatives such as the World Data Science Forum, we aim to build a data scientists network that is firmly rooted in Asia yet has the capacity to solve tomorrow’s pressing societal challenges.”
See What’s Next in Tech With the Fast Forward Newsletter
Tweets From @varindiamag
Nothing to see here - yet
When they Tweet, their Tweets will show up here.