Sobre
Dr. Karin Breitman is a global executive and independent board director with a track…
Atividades
5 mil seguidores
Experiência
Formação acadêmica
Experiência de voluntariado
-
Practitioners Board Member
ACM, Association for Computing Machinery
- 5 anos 1 mês
Ciência e tecnologia
-
Advisory Board Member
European Union Brazil Cloud Forum
- o momento 11 anos 5 meses
Ciência e tecnologia
A priority policy action for Europe and Brazil cooperation for the 2016 time-frame should be the definition and scoping of a public, open cloud infrastructure that all scientific researchers can use in an integrated way, the so called “Open Science Cloud”. Within this priority area, EUbrasilCloudFORUM is going to contribute in a strategic way to the definition, scoping and eventual creation of an EU-Brazil Open Science Cloud. The project will facilitate the establishment of an organisational…
A priority policy action for Europe and Brazil cooperation for the 2016 time-frame should be the definition and scoping of a public, open cloud infrastructure that all scientific researchers can use in an integrated way, the so called “Open Science Cloud”. Within this priority area, EUbrasilCloudFORUM is going to contribute in a strategic way to the definition, scoping and eventual creation of an EU-Brazil Open Science Cloud. The project will facilitate the establishment of an organisational cooperation model that enables the EU and Brazil to formulate and develop a common strategy and approach for Research & Innovation in Cloud Computing in line with the priorities of each region.
-
-
Publications Board Member
ACM, Association for Computing Machinery
- 2 anos 6 meses
Ciência e tecnologia
-
Director
France Brazil Chamber of Commerce
- 11 meses
Empoderamento econômico
-
Strategic Committee project SUPER
Universidade Federal do Amazonas
- o momento 5 anos 10 meses
Ciência e tecnologia
A collaboration between Samsung and UFAM, this project if focused on STEM capacitation in the Amazon.
http://super.ufam.edu.br/ -
Publicações
Patentes
-
Methods and apparatus for person-centric multichannel opinion mining in data lakes
Expedidas em US 11113306
Ver patentePerson-centric multi-channel opinion mining is performed in a single data repository, such as a data lake. An exemplary method comprises obtaining multi-channel heterogeneous data from a plurality of channels; identifying entities that are targets of opinion information across the plurality of channels; extracting a plurality of user identities from the plurality of channels; aligning the plurality of extracted user identities across the plurality of channels to link common user identities;…
Person-centric multi-channel opinion mining is performed in a single data repository, such as a data lake. An exemplary method comprises obtaining multi-channel heterogeneous data from a plurality of channels; identifying entities that are targets of opinion information across the plurality of channels; extracting a plurality of user identities from the plurality of channels; aligning the plurality of extracted user identities across the plurality of channels to link common user identities; identifying the entities that are targets of the opinion information of the extracted user identities; linking opinion information of the extracted user identities with a user identity associated with an opinion holder that expressed the opinion information; determining whether the opinion information comprises a positive or negative opinion; and providing a summary of the opinion information of a given opinion holder. Sentiment polarity classification algorithms optionally determine whether opinion information comprises a positive or negative opinion and assign a polarity score. An influencer score of the opinion holder is optionally assigned to the opinion information.
-
METHODS AND APPARATUS FOR AUTOMATIC MEDIA FILE TRANSCODING
Expedidas em US 10757360
Ver patenteMethods and apparatus are provided for automatically transcoding media files. An exemplary method comprises obtaining an input media file having an input file format and encoded with a codec of a first type; automatically determining output media file formats for transcoding the input media file based on statistics of previously transcoded files and statistics of trending media formats for previously downloaded files; and transcoding the input media file into transcoded output media files using…
Methods and apparatus are provided for automatically transcoding media files. An exemplary method comprises obtaining an input media file having an input file format and encoded with a codec of a first type; automatically determining output media file formats for transcoding the input media file based on statistics of previously transcoded files and statistics of trending media formats for previously downloaded files; and transcoding the input media file into transcoded output media files using a codec of a second type to obtain the determined output media file formats. The output media file formats can be automatically determined using a weighting scheme. Transcoding algorithms are optionally automatically selected based on transcoding algorithms previously used to transcode proximally similar files as the input media file. Input files can be dynamically prioritized for transcoding based on a complexity rating of the determined output media file formats and a ranking of selected transcoding algorithms Metadata is optionally generated for transcoded output media files and stored in the transcoded output media files and/or a separate media catalogue.
-
Architecture for a converged compute and file system within network-attached storage clusters
Expedidas em US 10419537
Scale-out network attached storage (NAS) file systems can employ an Ingest, Transform, Store (ITS) framework for data processing. In one aspect, the ITS-NAS file systems comprise NAS nodes and high performance computing (HPC) nodes that operate under a common operating system and that are coupled to each other via a common high-bandwidth, low-latency private network infrastructure. The NAS nodes can present data to the HPC nodes as well as dispatch the execution of transform services to the HPC…
Scale-out network attached storage (NAS) file systems can employ an Ingest, Transform, Store (ITS) framework for data processing. In one aspect, the ITS-NAS file systems comprise NAS nodes and high performance computing (HPC) nodes that operate under a common operating system and that are coupled to each other via a common high-bandwidth, low-latency private network infrastructure. The NAS nodes can present data to the HPC nodes as well as dispatch the execution of transform services to the HPC nodes. The ITS-NAS file systems enable massive parallelization of operations on files, for example, complex distributed operations on large files and/or simple parallel operations on large collections of small files, all within the same hardware and software architecture
Outros inventoresVer patente -
SEMANTIC SEARCH INTERFACE FOR DATA COLLECTIONS II
Expedidas em US 10402442
Described herein are technologies pertaining to automatically summarizing contents of a dataset and visualizing a summary of the dataset together with summaries of other datasets. A schema that defines the structure and content of a dataset is received, and pre-processing is undertaken on the schema to generate an enriched schema. Portions of the enriched schema are selected to generate a semantic summary of the schema, which is included with at least one exemplary entry of the dataset to…
Described herein are technologies pertaining to automatically summarizing contents of a dataset and visualizing a summary of the dataset together with summaries of other datasets. A schema that defines the structure and content of a dataset is received, and pre-processing is undertaken on the schema to generate an enriched schema. Portions of the enriched schema are selected to generate a semantic summary of the schema, which is included with at least one exemplary entry of the dataset to generate a summary of the dataset.
Outros inventoresVer patente -
Systems and methods for file triggers in a converged compute and file system
Expedidas em US 10331630
A hot folder mechanism is employed to provide a truly integrated architecture for easy-to-use, easy-to-deploy scale-out computation and scale-out storage. A folder of an Ingest, Transform, Store (ITS)-Network attached storage (NAS) system can be configured as “hot.” The configured hot folder can then detect changes on its content, analyze such content, perform transform services on the content, and output the computation results as files on other specified output folders. In one aspect, file…
A hot folder mechanism is employed to provide a truly integrated architecture for easy-to-use, easy-to-deploy scale-out computation and scale-out storage. A folder of an Ingest, Transform, Store (ITS)-Network attached storage (NAS) system can be configured as “hot.” The configured hot folder can then detect changes on its content, analyze such content, perform transform services on the content, and output the computation results as files on other specified output folders. In one aspect, file system nodes of the ITS-NAS can present the content to high performance computing (HPC) compute nodes of the ITS-NAS as well as to dispatch the execution of transform services to the HPC compute nodes.
Outros inventoresVer patente -
Integration of heterogenous data using omni-channel ontologies
Expedidas em US 10296913
Methods and apparatus are provided for integrating heterogeneous multi-channel data using ontologies. An exemplary method for integrating multi-channel heterogeneous data comprises obtaining a domain-specific mediator ontology; identifying a plurality of target channels; identifying entities pertinent to each of the plurality of channels; describing the entities pertinent to each of the plurality of channels using an ontology description language to generate a plurality of channel specific…
Methods and apparatus are provided for integrating heterogeneous multi-channel data using ontologies. An exemplary method for integrating multi-channel heterogeneous data comprises obtaining a domain-specific mediator ontology; identifying a plurality of target channels; identifying entities pertinent to each of the plurality of channels; describing the entities pertinent to each of the plurality of channels using an ontology description language to generate a plurality of channel specific ontologies; aligning the channel specific ontologies with the domain-specific mediator ontology to generate aligned channel specific and domain-specific mediator ontologies; extracting a plurality of user identities from the plurality of channels; aligning the plurality of extracted user identities across the plurality of channels to link common user identities; generating at least one user profile for at least one of the aligned user identities; and correlating at least one user profile with the aligned channel specific and domain-specific mediator ontologies to generate an omni-channel ontology that integrates the multi-channel heterogeneous data
Outros inventoresVer patente -
SEMANTIC SEARCH INTERFACE FOR DATA COLLECTIONS
Expedidas em US 20120310990
Technologies pertaining to automatically summarizing contents of a dataset and visualizing a summary of the dataset together with summaries of other datasets
Outros inventoresVer patente -
Methods and apparatus for a semantic multi-database data lake
US 10,901,973
Ver patenteMethods and apparatus are provided for integrating a plurality of different database types in a semantic multi-database data lake. An exemplary method comprises providing a plurality of databases having different database types; translating ontology definition language database commands obtained from a user into a plurality of data definition language and/or data manipulation language commands supported by the different database types in order to replicate data from the user to each of the…
Methods and apparatus are provided for integrating a plurality of different database types in a semantic multi-database data lake. An exemplary method comprises providing a plurality of databases having different database types; translating ontology definition language database commands obtained from a user into a plurality of data definition language and/or data manipulation language commands supported by the different database types in order to replicate data from the user to each of the different database types; obtaining a query specified in a query language of a given database; and delegating the query to the given database. A plurality of cluster gateways optionally manage a corresponding plurality of clusters of database instances and wherein queries are delegated to a given database instance by delegating the queries to the appropriate cluster gateway. Dark data that was not queried by any supported query language in a predefined period of time can be detected.
Idiomas
-
English
Nível nativo ou bilíngue
-
French
Nível avançado
-
Portuguese
Nível nativo ou bilíngue
Recomendações recebidas
-
Usuário do LinkedIn
3 pessoas recomendaram Karin
Cadastre-se agora para visualizar